Julien Nioche created NUTCH-1805:
------------------------------------

             Summary: Remove unnecessary transitive dependencies from Hadoop 
core 
                 Key: NUTCH-1805
                 URL: https://issues.apache.org/jira/browse/NUTCH-1805
             Project: Nutch
          Issue Type: Improvement
          Components: build
    Affects Versions: 1.8
            Reporter: Julien Nioche


The Hadoop libs are not included in the job file as a Hadoop cluster must 
already be available in order to use it, however some of its transitive 
dependencies make it to the job file. We already prevent some but could extend 
that to :
                        <exclude org="org.mortbay.jetty"/>
                        <exclude org="com.sun.jersey"/>
                        <exclude org="tomcat"/>
Note that we need some of the Hadoop classes and dependencies in order to run 
Nutch in local mode.

Alternatively we could have a separate Ivy profile only for Hadoop and store 
the dependencies in a separate location so that they do not get copied to the 
job jar, however this is probably an overkill if the dependencies above are not 
needed when running in local mode.






--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to