Re: Differences between HistoryServer and Yarn TimeLine server?

2014-04-25 Thread Ashwin Shankar
Thanks Zhijie !
I had few more questions  :
1. I played around with the timeline server ui today which showed the
generic application history details,
but I couldn't find any page for application specific data. Is the
expectation that every application
needs to build their own UI using the exposed REST apis and somehow install
it with timeline server ?
Or am I missing something.
2. Are there REST apis for accessing both generic and framework specific
data in 2.4.0 ?
3. Is there an approximate timeframe for timeline server to be feature
complete ?
4. Tez doesn't have any job history UI,is there any work being done to
integrate Tez with timeline server ?
If not,is the timeline server ready for such integration in case someone
wants to pick this up ?

Thanks,
Ashwin



On Thu, Apr 24, 2014 at 12:00 AM, Zhijie Shen zs...@hortonworks.com wrote:

 Ashwin,

 YARN-321 focuses on the issue in the scope of generic application history
 service, while YARN-1530 covers the framework specific data service. And
 yes, the timeline server is going to cover both.

 We've not such a Jira before, but it is described in YARN-321's design
 doc. Anyway, I open a Jira (MAPREDUCE-5858) to track this issue.


 On Wed, Apr 23, 2014 at 11:25 PM, Ashwin Shankar 
 ashwinshanka...@gmail.com wrote:

 Hi Zhijie,
 There seems to two umbrella jiras for this - YARN-321 and YARN-1530,can
 you please let me know what is the
 difference ? Is timeline server finally going to be YARN321+YARN1530 ?

 You mentioned that MR is going to integrated with timeline server,is
 there a jira I can watch ?

 Thanks,
 Ashwin


 On Wed, Apr 23, 2014 at 10:15 PM, Zhijie Shen zs...@hortonworks.comwrote:

 Sam,

 You're right. We can definitely integrate MapReduce to use the timeline
 server to store and serve its specific data, and this is actually our plan.

 However, it's a big move, and we still need time to get it done. In
 addition, not to disturb the users that are currently relying on JHS for MR
 job information, we cannot simply remove JHS from Hadoop.


 On Wed, Apr 23, 2014 at 8:15 PM, sam liu samliuhad...@gmail.com wrote:

 Zhijie,

 I am much clear now. Thanks a lot!

 As my understanding, besides previous Job History Server, hadoop now
 has a new timeline server which could restore both the generic YARN
 application history and the framework specific information. However, I
 think the timeline server also include the functions of Job History Server,
 because it can store the framework specific information(of course, include
 mapreduce framework). In another words, Job History Server is not necessary
 any more.* If that's the case, why hadoop still include Job History
 Server?*


 2014-04-23 12:56 GMT+08:00 Zhijie Shen zs...@hortonworks.com:

  In Hadoop 2.4, we have delivered the timeline server at a preview
 stage, which actually can serve some generic YARN application history as
 well as the framework specific information. Due to the development
 logistics, we have created the two concepts: History Server and Timeline
 Server. To be simple, you can consider the history server of the service 
 of
 the generic YARN application information, while consider the timeline
 server of the service of the framework specific information. Importantly,
 we just have one daemon, which includes both services, and which we'd like
 to call timeline server (unfortunately, the confusing thing is that the
 command to start the daemon is historyserver). We're going on working on
 the timeline server to integrate these two parts, including refactoring 
 the
 names.

 BTW, if you mean MapReduce JobHistoryServer by HistoryServer, it's a
 different daemon, which serves the historic information of MapReduce jobs
 only.


 On Tue, Apr 22, 2014 at 8:44 PM, sam liu samliuhad...@gmail.comwrote:

 Hi Experts,

 I am confusing on these two concepts. Could you help explain the
 differences?

 Thanks!




 --
 Zhijie Shen
 Hortonworks Inc.
 http://hortonworks.com/

 CONFIDENTIALITY NOTICE
 NOTICE: This message is intended for the use of the individual or
 entity to which it is addressed and may contain information that is
 confidential, privileged and exempt from disclosure under applicable law.
 If the reader of this message is not the intended recipient, you are 
 hereby
 notified that any printing, copying, dissemination, distribution,
 disclosure or forwarding of this communication is strictly prohibited. If
 you have received this communication in error, please contact the sender
 immediately and delete it from your system. Thank You.





 --
 Zhijie Shen
 Hortonworks Inc.
 http://hortonworks.com/

 CONFIDENTIALITY NOTICE
 NOTICE: This message is intended for the use of the individual or entity
 to which it is addressed and may contain information that is confidential,
 privileged and exempt from disclosure under applicable law. If the reader
 of this message is not the intended recipient, you are hereby notified that
 any printing, copying, dissemination, distribution, 

Re: Differences between HistoryServer and Yarn TimeLine server?

2014-04-25 Thread Zhijie Shen
1 and 4:
We have thought about that in addition to service application specific
data, the timeline server should accept the web UI plugin from the
application, install it and render the data on the web page according the
application's design, but still need to figure out the plan. Before that,
the application needs to take care of the data rendering itself, or make
use of third-party monitoring service, such as Ambari, which AFAIK, has
integration with the timeline server in the recent release (Tez is
leveraging it). And yes, it's always welcome if somebody want to contribute.

2:
REST APIs are available for accessing both the generic data and framework
specific data. For the API specification, you can temporally look at the
patch in YARN-1876.

3:
In terms of services, they're almost there. The next step would be about
the security, scalability and integration stuff.


On Thu, Apr 24, 2014 at 11:11 PM, Ashwin Shankar
ashwinshanka...@gmail.comwrote:

 Thanks Zhijie !
 I had few more questions  :
 1. I played around with the timeline server ui today which showed the
 generic application history details,
 but I couldn't find any page for application specific data. Is the
 expectation that every application
 needs to build their own UI using the exposed REST apis and somehow
 install it with timeline server ?
 Or am I missing something.
 2. Are there REST apis for accessing both generic and framework specific
 data in 2.4.0 ?
 3. Is there an approximate timeframe for timeline server to be feature
 complete ?
 4. Tez doesn't have any job history UI,is there any work being done to
 integrate Tez with timeline server ?
 If not,is the timeline server ready for such integration in case someone
 wants to pick this up ?

 Thanks,
 Ashwin



 On Thu, Apr 24, 2014 at 12:00 AM, Zhijie Shen zs...@hortonworks.comwrote:

 Ashwin,

 YARN-321 focuses on the issue in the scope of generic application history
 service, while YARN-1530 covers the framework specific data service. And
 yes, the timeline server is going to cover both.

 We've not such a Jira before, but it is described in YARN-321's design
 doc. Anyway, I open a Jira (MAPREDUCE-5858) to track this issue.


 On Wed, Apr 23, 2014 at 11:25 PM, Ashwin Shankar 
 ashwinshanka...@gmail.com wrote:

 Hi Zhijie,
 There seems to two umbrella jiras for this - YARN-321 and YARN-1530,can
 you please let me know what is the
 difference ? Is timeline server finally going to be YARN321+YARN1530 ?

 You mentioned that MR is going to integrated with timeline server,is
 there a jira I can watch ?

 Thanks,
 Ashwin


 On Wed, Apr 23, 2014 at 10:15 PM, Zhijie Shen zs...@hortonworks.comwrote:

 Sam,

 You're right. We can definitely integrate MapReduce to use the timeline
 server to store and serve its specific data, and this is actually our plan.

 However, it's a big move, and we still need time to get it done. In
 addition, not to disturb the users that are currently relying on JHS for MR
 job information, we cannot simply remove JHS from Hadoop.


 On Wed, Apr 23, 2014 at 8:15 PM, sam liu samliuhad...@gmail.comwrote:

 Zhijie,

 I am much clear now. Thanks a lot!

 As my understanding, besides previous Job History Server, hadoop now
 has a new timeline server which could restore both the generic YARN
 application history and the framework specific information. However, I
 think the timeline server also include the functions of Job History 
 Server,
 because it can store the framework specific information(of course, include
 mapreduce framework). In another words, Job History Server is not 
 necessary
 any more.* If that's the case, why hadoop still include Job History
 Server?*


 2014-04-23 12:56 GMT+08:00 Zhijie Shen zs...@hortonworks.com:

  In Hadoop 2.4, we have delivered the timeline server at a preview
 stage, which actually can serve some generic YARN application history as
 well as the framework specific information. Due to the development
 logistics, we have created the two concepts: History Server and Timeline
 Server. To be simple, you can consider the history server of the service 
 of
 the generic YARN application information, while consider the timeline
 server of the service of the framework specific information. Importantly,
 we just have one daemon, which includes both services, and which we'd 
 like
 to call timeline server (unfortunately, the confusing thing is that the
 command to start the daemon is historyserver). We're going on working 
 on
 the timeline server to integrate these two parts, including refactoring 
 the
 names.

 BTW, if you mean MapReduce JobHistoryServer by HistoryServer, it's a
 different daemon, which serves the historic information of MapReduce jobs
 only.


 On Tue, Apr 22, 2014 at 8:44 PM, sam liu samliuhad...@gmail.comwrote:

 Hi Experts,

 I am confusing on these two concepts. Could you help explain the
 differences?

 Thanks!




 --
 Zhijie Shen
 Hortonworks Inc.
 http://hortonworks.com/

 CONFIDENTIALITY NOTICE
 NOTICE: This 

Re: Differences between HistoryServer and Yarn TimeLine server?

2014-04-24 Thread Ashwin Shankar
Hi Zhijie,
There seems to two umbrella jiras for this - YARN-321 and YARN-1530,can you
please let me know what is the
difference ? Is timeline server finally going to be YARN321+YARN1530 ?

You mentioned that MR is going to integrated with timeline server,is there
a jira I can watch ?

Thanks,
Ashwin


On Wed, Apr 23, 2014 at 10:15 PM, Zhijie Shen zs...@hortonworks.com wrote:

 Sam,

 You're right. We can definitely integrate MapReduce to use the timeline
 server to store and serve its specific data, and this is actually our plan.

 However, it's a big move, and we still need time to get it done. In
 addition, not to disturb the users that are currently relying on JHS for MR
 job information, we cannot simply remove JHS from Hadoop.


 On Wed, Apr 23, 2014 at 8:15 PM, sam liu samliuhad...@gmail.com wrote:

 Zhijie,

 I am much clear now. Thanks a lot!

 As my understanding, besides previous Job History Server, hadoop now has
 a new timeline server which could restore both the generic YARN application
 history and the framework specific information. However, I think the
 timeline server also include the functions of Job History Server, because
 it can store the framework specific information(of course, include
 mapreduce framework). In another words, Job History Server is not necessary
 any more.* If that's the case, why hadoop still include Job History
 Server?*


 2014-04-23 12:56 GMT+08:00 Zhijie Shen zs...@hortonworks.com:

  In Hadoop 2.4, we have delivered the timeline server at a preview
 stage, which actually can serve some generic YARN application history as
 well as the framework specific information. Due to the development
 logistics, we have created the two concepts: History Server and Timeline
 Server. To be simple, you can consider the history server of the service of
 the generic YARN application information, while consider the timeline
 server of the service of the framework specific information. Importantly,
 we just have one daemon, which includes both services, and which we'd like
 to call timeline server (unfortunately, the confusing thing is that the
 command to start the daemon is historyserver). We're going on working on
 the timeline server to integrate these two parts, including refactoring the
 names.

 BTW, if you mean MapReduce JobHistoryServer by HistoryServer, it's a
 different daemon, which serves the historic information of MapReduce jobs
 only.


 On Tue, Apr 22, 2014 at 8:44 PM, sam liu samliuhad...@gmail.com wrote:

 Hi Experts,

 I am confusing on these two concepts. Could you help explain the
 differences?

 Thanks!




 --
 Zhijie Shen
 Hortonworks Inc.
 http://hortonworks.com/

 CONFIDENTIALITY NOTICE
 NOTICE: This message is intended for the use of the individual or entity
 to which it is addressed and may contain information that is confidential,
 privileged and exempt from disclosure under applicable law. If the reader
 of this message is not the intended recipient, you are hereby notified that
 any printing, copying, dissemination, distribution, disclosure or
 forwarding of this communication is strictly prohibited. If you have
 received this communication in error, please contact the sender immediately
 and delete it from your system. Thank You.





 --
 Zhijie Shen
 Hortonworks Inc.
 http://hortonworks.com/

 CONFIDENTIALITY NOTICE
 NOTICE: This message is intended for the use of the individual or entity
 to which it is addressed and may contain information that is confidential,
 privileged and exempt from disclosure under applicable law. If the reader
 of this message is not the intended recipient, you are hereby notified that
 any printing, copying, dissemination, distribution, disclosure or
 forwarding of this communication is strictly prohibited. If you have
 received this communication in error, please contact the sender immediately
 and delete it from your system. Thank You.




-- 
Thanks,
Ashwin


Re: Differences between HistoryServer and Yarn TimeLine server?

2014-04-24 Thread Zhijie Shen
Ashwin,

YARN-321 focuses on the issue in the scope of generic application history
service, while YARN-1530 covers the framework specific data service. And
yes, the timeline server is going to cover both.

We've not such a Jira before, but it is described in YARN-321's design doc.
Anyway, I open a Jira (MAPREDUCE-5858) to track this issue.


On Wed, Apr 23, 2014 at 11:25 PM, Ashwin Shankar
ashwinshanka...@gmail.comwrote:

 Hi Zhijie,
 There seems to two umbrella jiras for this - YARN-321 and YARN-1530,can
 you please let me know what is the
 difference ? Is timeline server finally going to be YARN321+YARN1530 ?

 You mentioned that MR is going to integrated with timeline server,is there
 a jira I can watch ?

 Thanks,
 Ashwin


 On Wed, Apr 23, 2014 at 10:15 PM, Zhijie Shen zs...@hortonworks.comwrote:

 Sam,

 You're right. We can definitely integrate MapReduce to use the timeline
 server to store and serve its specific data, and this is actually our plan.

 However, it's a big move, and we still need time to get it done. In
 addition, not to disturb the users that are currently relying on JHS for MR
 job information, we cannot simply remove JHS from Hadoop.


 On Wed, Apr 23, 2014 at 8:15 PM, sam liu samliuhad...@gmail.com wrote:

 Zhijie,

 I am much clear now. Thanks a lot!

 As my understanding, besides previous Job History Server, hadoop now has
 a new timeline server which could restore both the generic YARN application
 history and the framework specific information. However, I think the
 timeline server also include the functions of Job History Server, because
 it can store the framework specific information(of course, include
 mapreduce framework). In another words, Job History Server is not necessary
 any more.* If that's the case, why hadoop still include Job History
 Server?*


 2014-04-23 12:56 GMT+08:00 Zhijie Shen zs...@hortonworks.com:

  In Hadoop 2.4, we have delivered the timeline server at a preview
 stage, which actually can serve some generic YARN application history as
 well as the framework specific information. Due to the development
 logistics, we have created the two concepts: History Server and Timeline
 Server. To be simple, you can consider the history server of the service of
 the generic YARN application information, while consider the timeline
 server of the service of the framework specific information. Importantly,
 we just have one daemon, which includes both services, and which we'd like
 to call timeline server (unfortunately, the confusing thing is that the
 command to start the daemon is historyserver). We're going on working on
 the timeline server to integrate these two parts, including refactoring the
 names.

 BTW, if you mean MapReduce JobHistoryServer by HistoryServer, it's a
 different daemon, which serves the historic information of MapReduce jobs
 only.


 On Tue, Apr 22, 2014 at 8:44 PM, sam liu samliuhad...@gmail.comwrote:

 Hi Experts,

 I am confusing on these two concepts. Could you help explain the
 differences?

 Thanks!




 --
 Zhijie Shen
 Hortonworks Inc.
 http://hortonworks.com/

 CONFIDENTIALITY NOTICE
 NOTICE: This message is intended for the use of the individual or
 entity to which it is addressed and may contain information that is
 confidential, privileged and exempt from disclosure under applicable law.
 If the reader of this message is not the intended recipient, you are hereby
 notified that any printing, copying, dissemination, distribution,
 disclosure or forwarding of this communication is strictly prohibited. If
 you have received this communication in error, please contact the sender
 immediately and delete it from your system. Thank You.





 --
 Zhijie Shen
 Hortonworks Inc.
 http://hortonworks.com/

 CONFIDENTIALITY NOTICE
 NOTICE: This message is intended for the use of the individual or entity
 to which it is addressed and may contain information that is confidential,
 privileged and exempt from disclosure under applicable law. If the reader
 of this message is not the intended recipient, you are hereby notified that
 any printing, copying, dissemination, distribution, disclosure or
 forwarding of this communication is strictly prohibited. If you have
 received this communication in error, please contact the sender immediately
 and delete it from your system. Thank You.




 --
 Thanks,
 Ashwin





-- 
Zhijie Shen
Hortonworks Inc.
http://hortonworks.com/

-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank 

Re: Differences between HistoryServer and Yarn TimeLine server?

2014-04-23 Thread sam liu
Zhijie,

I am much clear now. Thanks a lot!

As my understanding, besides previous Job History Server, hadoop now has a
new timeline server which could restore both the generic YARN application
history and the framework specific information. However, I think the
timeline server also include the functions of Job History Server, because
it can store the framework specific information(of course, include
mapreduce framework). In another words, Job History Server is not necessary
any more.* If that's the case, why hadoop still include Job History Server?*


2014-04-23 12:56 GMT+08:00 Zhijie Shen zs...@hortonworks.com:

 In Hadoop 2.4, we have delivered the timeline server at a preview stage,
 which actually can serve some generic YARN application history as well as
 the framework specific information. Due to the development logistics, we
 have created the two concepts: History Server and Timeline Server. To be
 simple, you can consider the history server of the service of the generic
 YARN application information, while consider the timeline server of the
 service of the framework specific information. Importantly, we just have
 one daemon, which includes both services, and which we'd like to call
 timeline server (unfortunately, the confusing thing is that the command to
 start the daemon is historyserver). We're going on working on the
 timeline server to integrate these two parts, including refactoring the
 names.

 BTW, if you mean MapReduce JobHistoryServer by HistoryServer, it's a
 different daemon, which serves the historic information of MapReduce jobs
 only.


 On Tue, Apr 22, 2014 at 8:44 PM, sam liu samliuhad...@gmail.com wrote:

 Hi Experts,

 I am confusing on these two concepts. Could you help explain the
 differences?

 Thanks!




 --
 Zhijie Shen
 Hortonworks Inc.
 http://hortonworks.com/

 CONFIDENTIALITY NOTICE
 NOTICE: This message is intended for the use of the individual or entity
 to which it is addressed and may contain information that is confidential,
 privileged and exempt from disclosure under applicable law. If the reader
 of this message is not the intended recipient, you are hereby notified that
 any printing, copying, dissemination, distribution, disclosure or
 forwarding of this communication is strictly prohibited. If you have
 received this communication in error, please contact the sender immediately
 and delete it from your system. Thank You.


Re: Differences between HistoryServer and Yarn TimeLine server?

2014-04-23 Thread Zhijie Shen
Sam,

You're right. We can definitely integrate MapReduce to use the timeline
server to store and serve its specific data, and this is actually our plan.

However, it's a big move, and we still need time to get it done. In
addition, not to disturb the users that are currently relying on JHS for MR
job information, we cannot simply remove JHS from Hadoop.


On Wed, Apr 23, 2014 at 8:15 PM, sam liu samliuhad...@gmail.com wrote:

 Zhijie,

 I am much clear now. Thanks a lot!

 As my understanding, besides previous Job History Server, hadoop now has a
 new timeline server which could restore both the generic YARN application
 history and the framework specific information. However, I think the
 timeline server also include the functions of Job History Server, because
 it can store the framework specific information(of course, include
 mapreduce framework). In another words, Job History Server is not necessary
 any more.* If that's the case, why hadoop still include Job History
 Server?*


 2014-04-23 12:56 GMT+08:00 Zhijie Shen zs...@hortonworks.com:

 In Hadoop 2.4, we have delivered the timeline server at a preview stage,
 which actually can serve some generic YARN application history as well as
 the framework specific information. Due to the development logistics, we
 have created the two concepts: History Server and Timeline Server. To be
 simple, you can consider the history server of the service of the generic
 YARN application information, while consider the timeline server of the
 service of the framework specific information. Importantly, we just have
 one daemon, which includes both services, and which we'd like to call
 timeline server (unfortunately, the confusing thing is that the command to
 start the daemon is historyserver). We're going on working on the
 timeline server to integrate these two parts, including refactoring the
 names.

 BTW, if you mean MapReduce JobHistoryServer by HistoryServer, it's a
 different daemon, which serves the historic information of MapReduce jobs
 only.


 On Tue, Apr 22, 2014 at 8:44 PM, sam liu samliuhad...@gmail.com wrote:

 Hi Experts,

 I am confusing on these two concepts. Could you help explain the
 differences?

 Thanks!




 --
 Zhijie Shen
 Hortonworks Inc.
 http://hortonworks.com/

 CONFIDENTIALITY NOTICE
 NOTICE: This message is intended for the use of the individual or entity
 to which it is addressed and may contain information that is confidential,
 privileged and exempt from disclosure under applicable law. If the reader
 of this message is not the intended recipient, you are hereby notified that
 any printing, copying, dissemination, distribution, disclosure or
 forwarding of this communication is strictly prohibited. If you have
 received this communication in error, please contact the sender immediately
 and delete it from your system. Thank You.





-- 
Zhijie Shen
Hortonworks Inc.
http://hortonworks.com/

-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.


Re: Differences between HistoryServer and Yarn TimeLine server?

2014-04-22 Thread Zhijie Shen
In Hadoop 2.4, we have delivered the timeline server at a preview stage,
which actually can serve some generic YARN application history as well as
the framework specific information. Due to the development logistics, we
have created the two concepts: History Server and Timeline Server. To be
simple, you can consider the history server of the service of the generic
YARN application information, while consider the timeline server of the
service of the framework specific information. Importantly, we just have
one daemon, which includes both services, and which we'd like to call
timeline server (unfortunately, the confusing thing is that the command to
start the daemon is historyserver). We're going on working on the
timeline server to integrate these two parts, including refactoring the
names.

BTW, if you mean MapReduce JobHistoryServer by HistoryServer, it's a
different daemon, which serves the historic information of MapReduce jobs
only.


On Tue, Apr 22, 2014 at 8:44 PM, sam liu samliuhad...@gmail.com wrote:

 Hi Experts,

 I am confusing on these two concepts. Could you help explain the
 differences?

 Thanks!




-- 
Zhijie Shen
Hortonworks Inc.
http://hortonworks.com/

-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.