Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "RunNutchInEclipse" page has been changed by kiranchitturi:
http://wiki.apache.org/nutch/RunNutchInEclipse?action=diff&rev1=38&rev2=39

   * Ubuntu Release 11.04 (natty)
    . Kernel Linux 2.6.38-10-generic GNOME 2.32.1
   * Windows Vista (Service Edition 2)
+ 
+ The tutorial here works fine for Nutch 1.6 and 2.x series as well with couple 
of changes and fixing dependencies. Check the bottom section for suggestions to 
fixes.
  
  == Before you start ==
  Setting up Nutch to run into Eclipse can be tricky, and most of the time you 
are much faster if you edit Nutch in Eclipse but run the scripts from the 
command line. However, it's very useful to be able to debug Nutch in Eclipse 
and is also extremely useful when applying and testing patches as it enables 
you to see them working in a larger context. This being said, you will still 
benefit greatly by looking at the hadoop.log output.
@@ -32, +34 @@

  Use the Subclipse plugin to check out the latest Nutch Trunk development.
  
   * File > New > Project > SVN > Checkout Projects from SVN
-  * Create new repository location > 
https://svn.apache.org/repos/asf/nutch/trunk
+  * Create new repository location > 
https://svn.apache.org/repos/asf/nutch/trunk  
+ {{{
+ Use https://svn.apache.org/repos/asf/nutch/branches/2.1/ for 2.1 version. The 
trunk is 1.6 version now.
+ }}}
   * Subclipse will ask some additional configuration options, at this stage 
checkout the trunk source as a project configured using the '''New Project 
Wizard'''. Ensure that you're checking out the HEAD revision, then progress to 
Finish.
   * The Wizard will prompt you to choose a project, so navigate to Java > Java 
Project > next
   * Enter your Project name (trunk) and ensure that the '''create separate 
folders for sources and class files''' option is activated.
@@ -43, +48 @@

   * Ensure that you're in the Package Explorer > right click on Trunk Project 
folder.
   * The only Source folder will be trunk/src > '''Remove''' this folder > Add 
Folder > expand trunk/src and check src/bin, src/java, src/test & 
src/testresources.
   * In addition, we must manually add '''EVERY''' individual plugin src/java 
and src/test folder, although this takes some time it is absolutely essential 
that this is done.
-  * In the Libraries tab, click Add Class Folder and add src/conf to the 
classpath.
+  * In the Libraries tab, click Add Class Folder and add /conf to the 
classpath.
   * Still in the Libraries tab add JARs > 
src/plugin/urlfilter-automaton/lib/automaton.jar & 
src/plugin/parse-swf/lib/javaswf.jar
   * Remaining in the Libraries tab Add Library > IvyDE Managed Dependencies > 
browse to trunk/ivy/ivy.xml > ensure '''ALL''' configuration boxes are included.
   * Go to "Order and Export" tab, find the entry for added "conf" folder (it 
will most likely be at the bottom of the list) and move it to the top (by 
checking it and clicking the "Top" button). This is required so Eclipse will 
take config (nutch-default.xml, etc.) resources from our "conf" folder and not 
from somewhere else.
@@ -121, +126 @@

  == If things do not work... ==
  Yes, Nutch and Eclipse can be a difficult companionship sometimes ;-)
  
+ === Missing dependencies ===
+ I have found the following dependencies missing. 
+ 1) [[http://sourceforge.net/projects/jsch/?source=dlp|jsch]]
+ 2) [[http://sourceforge.net/projects/nekohtml/|nekohtml]]
+ 3) 
[[http://grepcode.com/snapshot/repository.springsource.com/com.sun.syndication/com.springsource.com.sun.syndication/1.0.0|com.sun.syndication]]
+ 4) [[http://www.jarfinder.com/index.php/jars/versionInfo/2828|tagsoup]]
+ Adding the above jar files to the buildPath using 'external jars' option has 
resolved the errors.
+ 
  === eclipse: Cannot create project content in workspace ===
  The Nutch source code must be out of the workspace folder. Alternatively you 
can download the code with eclipse (svn) under your workspace rather than try 
to create the project using existing code, eclipse sometimes doesn't let you do 
it from source code into the workspace.
  
@@ -147, +160 @@

   * add the Hadoop project as a dependent project of Nutch project
   * you can now also set break points within Hadoop classes like inputformat 
implementations etc.
  
+ === Non-ported Plugins to 2.x ===
+ Few plugins were not ported to Nutch 2.x series yet. If you are following the 
above tutorial for building Nutch 2.x, please check [[Nutch2Plugins]] for more 
information
+ 
+ == Other Resources ==
+ http://florianhartl.com/nutch-installation.html.
+ http://techvineyard.blogspot.com/2010/12/build-nutch-20.html
+ 

Reply via email to