[DISCUSSION] WebUI query plan graphs

2017-08-10 Thread Karen Coppage
Hi all,

I’m working on a feature of the Hive WebUI Query Plan tab that would
provide the option to display the query plan as a nice graph (scroll down
for screenshots). If you click on one of the graph’s stages, the plan for
that stage appears as text below.

Stages are color-coded if they have a status (Success, Error, Running), and
the rest are grayed out. Coloring is based on status already available in
the WebUI, under the Stages tab.

There is an additional option to display stats for MapReduce tasks. This
includes the job’s ID, tracking URL (where the logs are found), and mapper
and reducer numbers/progress, among other info.

The library I’m using for the graph is called vis.js (http://visjs.org/).
It has an Apache license, and the only necessary file to be included from
this library is about 700 KB.

I tried to keep server-side changes minimal, and graph generation is taken
care of by the client. Plans with more than a given number of stages
(default: 25) won't be displayed in order to preserve resources.

I’d love to hear any and all input from the community about this feature:
do you think it’s useful, and is there anything important I’m missing?

Thanks,

Karen Coppage

*

A completely successful query:

[image: Inline image 1]


A MapReduce task selected, with MapReduce stats view on:

[image: Inline image 2]


Full MapReduce stats, lacking some information because the query was run in
local mode:

[image: Inline image 3]


A non-MapReduce stage selected:

[image: Inline image 4]


Last stage running:

[image: Inline image 5]


Last stage returns error:

[image: Inline image 6]


[DISCUSSION] WebUI query plan graphs

2017-08-10 Thread Karen Coppage
I'm resending this with a link to pictures, as they could not be embedded.

Screenshots here:
https://drive.google.com/drive/folders/0B0gDaJsjA3cxMUV5SW5VQnh4aGM



Hi all,

I’m working on a feature of the Hive WebUI Query Plan tab that would
provide the option to display the query plan as a nice graph (scroll down
for link to screenshots). If you click on one of the graph’s stages, the
plan for that stage appears as text below.

Stages are color-coded if they have a status (Success, Error, Running), and
the rest are grayed out. Coloring is based on status already available in
the WebUI, under the Stages tab.

There is an additional option to display stats for MapReduce tasks. This
includes the job’s ID, tracking URL (where the logs are found), and mapper
and reducer numbers/progress, among other info.

The library I’m using for the graph is called vis.js (http://visjs.org/).
It has an Apache license, and the only necessary file to be included from
this library is about 700 KB.

I tried to keep server-side changes minimal, and graph generation is taken
care of by the client. Plans with more than a given number of stages
(default: 25) won't be displayed in order to preserve resources.

I’d love to hear any and all input from the community about this feature:
do you think it’s useful, and is there anything important I’m missing?

Thanks,

Karen Coppage


Re: [DISCUSSION] WebUI query plan graphs

2017-08-10 Thread Xuefu Zhang
Hi Karen,

Thanks for reaching out. While your message doesn't seem showing any
images, I think the feature would be a great addition to Hive. (Hive
community always welcomes contributions like this.)

Please feel free to create an JIRA for easier discussion and tracking.

Thanks again for your interest.

--Xuefu

On Thu, Aug 10, 2017 at 6:25 AM, Karen Coppage 
wrote:

> Hi all,
>
> I’m working on a feature of the Hive WebUI Query Plan tab that would
> provide the option to display the query plan as a nice graph (scroll down
> for screenshots). If you click on one of the graph’s stages, the plan for
> that stage appears as text below.
>
> Stages are color-coded if they have a status (Success, Error, Running), and
> the rest are grayed out. Coloring is based on status already available in
> the WebUI, under the Stages tab.
>
> There is an additional option to display stats for MapReduce tasks. This
> includes the job’s ID, tracking URL (where the logs are found), and mapper
> and reducer numbers/progress, among other info.
>
> The library I’m using for the graph is called vis.js (http://visjs.org/).
> It has an Apache license, and the only necessary file to be included from
> this library is about 700 KB.
>
> I tried to keep server-side changes minimal, and graph generation is taken
> care of by the client. Plans with more than a given number of stages
> (default: 25) won't be displayed in order to preserve resources.
>
> I’d love to hear any and all input from the community about this feature:
> do you think it’s useful, and is there anything important I’m missing?
>
> Thanks,
>
> Karen Coppage
>
> *
>
> A completely successful query:
>
> [image: Inline image 1]
>
>
> A MapReduce task selected, with MapReduce stats view on:
>
> [image: Inline image 2]
>
>
> Full MapReduce stats, lacking some information because the query was run in
> local mode:
>
> [image: Inline image 3]
>
>
> A non-MapReduce stage selected:
>
> [image: Inline image 4]
>
>
> Last stage running:
>
> [image: Inline image 5]
>
>
> Last stage returns error:
>
> [image: Inline image 6]
>


Re: [DISCUSSION] WebUI query plan graphs

2017-08-11 Thread Karen Coppage
Hi Xuefu,

Thanks so much for your reply!

I created a JIRA as per your suggestion; it can be found here along with
actually visible screenshots:
https://issues.apache.org/jira/browse/HIVE-17300

Any and all input, comments, criticism will be very welcome.

Thanks again,

Karen



On Thu, Aug 10, 2017 at August 10, 2017 at 6:39 AM, Xuefu Zhang <
xu...@apache.org> wrote:

Hi Karen,

Thanks for reaching out. While your message doesn't seem showing any
images, I think the feature would be a great addition to Hive. (Hive
community always welcomes contributions like this.)

Please feel free to create an JIRA for easier discussion and tracking.

Thanks again for your interest.

--Xuefu


On Thu, Aug 10, 2017 at 6:25 AM, Karen Coppage 
wrote:

Hi all,

I’m working on a feature of the Hive WebUI Query Plan tab that would
provide the option to display the query plan as a nice graph (scroll down
for screenshots). If you click on one of the graph’s stages, the plan for
that stage appears as text below.

Stages are color-coded if they have a status (Success, Error, Running), and
the rest are grayed out. Coloring is based on status already available in
the WebUI, under the Stages tab.

There is an additional option to display stats for MapReduce tasks. This
includes the job’s ID, tracking URL (where the logs are found), and mapper
and reducer numbers/progress, among other info.

The library I’m using for the graph is called vis.js (http://visjs.org/).
It has an Apache license, and the only necessary file to be included from
this library is about 700 KB.

I tried to keep server-side changes minimal, and graph generation is taken
care of by the client. Plans with more than a given number of stages
(default: 25) won't be displayed in order to preserve resources.

I’d love to hear any and all input from the community about this feature:
do you think it’s useful, and is there anything important I’m missing?

Thanks,

Karen Coppage

*

A completely successful query:

[image: Inline image 1]


A MapReduce task selected, with MapReduce stats view on:

[image: Inline image 2]


Full MapReduce stats, lacking some information because the query was run in
local mode:

[image: Inline image 3]


A non-MapReduce stage selected:

[image: Inline image 4]


Last stage running:

[image: Inline image 5]


Last stage returns error:

[image: Inline image 6]