[jira] [Comment Edited] (HADOOP-12857) Rework hadoop-tools

2016-03-10 Thread Allen Wittenauer (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-12857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15189620#comment-15189620
 ] 

Allen Wittenauer edited comment on HADOOP-12857 at 3/10/16 5:50 PM:


-02:
* documentation 
* eliminate HADOOP_TOOLS_PATH since it makes zero sense anymore with this 
layout and the other capabilities of the shell code in trunk
* rework to hopefully work with Windows. :D

Should I break this apart to send through Jenkins or ... ?


was (Author: aw):
-02:
* documentation 
* eliminate HADOOP_TOOLS_PATH since it makes zero sense anymore with this 
layout and the other capabilities of the shell code in trunk

Should I break this apart to send through Jenkins or ... ?

> Rework hadoop-tools
> ---
>
> Key: HADOOP-12857
> URL: https://issues.apache.org/jira/browse/HADOOP-12857
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 3.0.0
>Reporter: Allen Wittenauer
>Assignee: Allen Wittenauer
> Attachments: HADOOP-12857.00.patch, HADOOP-12857.01.patch, 
> HADOOP-12857.02.patch
>
>
> As hadoop-tools grows bigger and bigger, it's becoming evident that having a 
> single directory that gets sucked in is starting to become a big burden as 
> the number of tools grows.  Let's rework this to be smarter.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (HADOOP-12857) Rework hadoop-tools

2016-03-10 Thread Allen Wittenauer (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-12857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15189620#comment-15189620
 ] 

Allen Wittenauer edited comment on HADOOP-12857 at 3/10/16 5:50 PM:


-02:
* documentation 
* eliminate HADOOP_TOOLS_PATH since it makes zero sense anymore with this 
layout and the other capabilities of the shell code in trunk

Should I break this apart to send through Jenkins or ... ?


was (Author: aw):
-02:
* documentation 
* eliminate HADOOP_TOOLS_PATH since it makes zero sense anymore with this 
layout and the other capabilities of the shell code in trunk

> Rework hadoop-tools
> ---
>
> Key: HADOOP-12857
> URL: https://issues.apache.org/jira/browse/HADOOP-12857
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 3.0.0
>Reporter: Allen Wittenauer
>Assignee: Allen Wittenauer
> Attachments: HADOOP-12857.00.patch, HADOOP-12857.01.patch, 
> HADOOP-12857.02.patch
>
>
> As hadoop-tools grows bigger and bigger, it's becoming evident that having a 
> single directory that gets sucked in is starting to become a big burden as 
> the number of tools grows.  Let's rework this to be smarter.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (HADOOP-12857) Rework hadoop-tools-dist

2016-03-02 Thread Allen Wittenauer (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-12857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15177090#comment-15177090
 ] 

Allen Wittenauer edited comment on HADOOP-12857 at 3/3/16 3:25 AM:
---

Why does the hdfs haadmin command require hadoop-tools in the classpath?  Is 
this actually a long standing bug/misunderstanding of where toolrunner comes 
from?


was (Author: aw):
Why does the hdfs haadmin command require hadoop-tools in the classpath?  Is 
this actually a long standing bug?

> Rework hadoop-tools-dist
> 
>
> Key: HADOOP-12857
> URL: https://issues.apache.org/jira/browse/HADOOP-12857
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 3.0.0
>Reporter: Allen Wittenauer
>Assignee: Allen Wittenauer
>
> As hadoop-tools grows bigger and bigger, it's becoming evident that having a 
> single directory that gets sucked in is starting to become a big burden as 
> the number of tools grows.  Let's rework this to be smarter.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (HADOOP-12857) Rework hadoop-tools-dist

2016-02-29 Thread Allen Wittenauer (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-12857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15172799#comment-15172799
 ] 

Allen Wittenauer edited comment on HADOOP-12857 at 2/29/16 10:57 PM:
-

FWIW, I've got some stupid/simple shell code that takes the output of mvn 
dependency:list and builds a shell profile script.

Some random notes:

* It currently looks for *ALL* of the depended upon jars in the tools dir.  
This is less than efficient for what are hopefully obvious reasons.
* HADOOP-10115 pretty much means that the shell profiles will need to be built 
well after we've processed the hadoop-tools dir in order to know what is/isn't 
already bundled via hadoop-common.


So contemplating two approaches in order to make the latter option work:

# Try to trigger mvn dependency:list in the build stage for those modules that 
need it.  Push the output through the build process up until hadoop-dist gets 
triggered. Take that output and generate the profiles then.
# In hadoop-dist, run mvn dependency:list for all (except some blacklisted 
ones) modules under hadoop-tools (and thus effectively having mvn running mvn), 
and then generate profiles as in #1.

To make matters more complicated, I've been informed over the weekend that Big 
Top based distributions stupidly merge all of hadoop-tools into hadoop-common's 
lib dir.  So they'll always have the perf hit and other issues that having a 
flat dir structure causes.


was (Author: aw):
FWIW, I've got some stupid/simple shell code that takes the output of mvn 
dependency:list and builds a shell profile script.

Some random notes:

* It currently looks for *ALL* of the jars in the tools dir.  This is less than 
efficient for what are hopefully obvious reasons.
* HADOOP-10115 pretty much means that the shell profiles will need to be built 
well after we've processed the hadoop-tools dir in order to know what is/isn't 
already bundled via hadoop-common.


So contemplating two approaches in order to make the latter option work:

# Try to trigger mvn dependency:list in the build stage for those modules that 
need it.  Push the output through the build process up until hadoop-dist gets 
triggered. Take that output and generate the profiles then.
# In hadoop-dist, run mvn dependency:list for all (except some blacklisted 
ones) modules under hadoop-tools (and thus effectively having mvn running mvn), 
and then generate profiles as in #1.

To make matters more complicated, I've been informed over the weekend that Big 
Top based distributions stupidly merge all of hadoop-tools into hadoop-common's 
lib dir.  So they'll always have the perf hit and other issues that having a 
flat dir structure causes.

> Rework hadoop-tools-dist
> 
>
> Key: HADOOP-12857
> URL: https://issues.apache.org/jira/browse/HADOOP-12857
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 3.0.0
>Reporter: Allen Wittenauer
>
> As hadoop-tools grows bigger and bigger, it's becoming evident that having a 
> single directory that gets sucked in is starting to become a big burden as 
> the number of tools grows.  Let's rework this to be smarter.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)