How to set up Nutch in Eclipse IDE

2009-01-14 Thread Pradeep Pujari

Hi All,

How to set up Nutch code in Eclipse IDE? I studied the wiki page, but looks
like the instructions are very old. Any help is highly appreciated.

Thanks/Regards
Pradeep Pujari
415-422-1678



Re: How to set up Nutch in Eclipse IDE

2009-01-15 Thread Pradeep Pujari
  TestRegexURLNormalizer.java   line 1311232003049175 7381
ArrayList is a raw type. References to generic type ArrayList should be
parameterized nutch/trunk/src/test/org/apache/nutch/crawl
TestLinkDbMerger.java   line 1271232003049515 7416
ArrayList is a raw type. References to generic type ArrayList should be
parameterized nutch/trunk/src/test/org/apache/nutch/crawl
TestLinkDbMerger.java   line 1271232003049515 7417
Class is a raw type. References to generic type Class should be
parameterized
nutch/trunk/contrib/web2/src/main/java/org/apache/nutch/webapp/common
PluginResourceLoader.java line 92 1232003044077 6367
Class is a raw type. References to generic type Class should be
parameterized
nutch/trunk/contrib/web2/src/main/java/org/apache/nutch/webapp/common
PluginResourceLoader.java line 2241232003044077 6388

Thanks/Regards
Pradeep Pujari
415-422-1678


   
 Edwin Chu 
  To 
   "nutch-dev@lucene.apache.org"   
 01/14/2009 09:09 
 PM cc 
   
   Subject 
 Please respond to Re: How to set up Nutch in Eclipse  
 nutch-...@lucene. IDE 
apache.org  
  
   
   
   
   




Hi

I have just set up eclipse environment following the wiki without much
trouble. Can you describe the problems you have?

Warm wishes,
Edwin Chu

• mobile: (+852) 9834 0556
(Sent from iPhone. Sorry for curt text.)

On 15 Jan 2009, at 12:56 PM, Pradeep Pujari 
wrote:

>
> Hi All,
>
> How to set up Nutch code in Eclipse IDE? I studied the wiki page,
> but looks
> like the instructions are very old. Any help is highly appreciated.
>
> Thanks/Regards
> Pradeep Pujari
> 415-422-1678
>



Re: How to set up Nutch in Eclipse IDE

2009-01-15 Thread Pradeep Pujari
How to disable them?

Thanks/Regards
Pradeep Pujari
415-422-1678


   
 Pau   
   To 
   nutch-dev@lucene.apache.org 
 01/15/2009 11:23   cc 
 AM
   Subject 
   Re: How to set up Nutch in Eclipse  
 Please respond to IDE 
 nutch-...@lucene. 
apache.org 
   
   
   
   




I see that the first errors are about the RTFParseFactory.java
In http://wiki.apache.org/nutch/RunNutchInEclipse explain that the
parse-rtf, parse-mp3, and contrib/ plugins are not built correctly. Have
you disabled them?

On Thu, Jan 15, 2009 at 7:21 PM, Pradeep Pujari 
wrote:
  Hi Edwin,

  I used to subEclipse plug-in to check out from trunk. Then, I created
  Java
  Project, so the steps are not exactly the same as described in wiki. Now
  I
  am getting compilation errors. Here is what I did.

  1. created a java project

  2. Imported from SVN option /../trunk/src directory.

  3. When compiled,

  Severity and Description  Path  ResourceLocationCreation Time
  Id
  e cannot be resolved
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParseFactory.javaline 66 1232003048273 7125
  IN_DOCUMENT cannot be resolved
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParserDelegateImpl.javaline 77 1232003048263 7114
  IN_DOCUMENT cannot be resolved
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParserDelegateImpl.javaline 1011232003048263 7116
  IN_INFO cannot be resolved
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParserDelegateImpl.javaline 74 1232003048263 7113
  IN_INFO cannot be resolved
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParserDelegateImpl.javaline 93 1232003048263 7115
  ParseException cannot be resolved to a type
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParseFactory.javaline 63 1232003048273 7124
  RTFParser cannot be resolved
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParseFactory.javaline 57 1232003048273 7123
  RTFParser cannot be resolved to a type
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParseFactory.javaline 56 1232003048273 7122
  RTFParserDelegate cannot be resolved to a type
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParserDelegateImpl.javaline 37 1232003048263 7110
  The constructor Hit(int, int) is undefined
  nutch/trunk/contrib/web2/src/main/java/org/apache/nutch/webapp/controller
  AnchorsController.java  line 63 1232003043937 6310
  The constructor Hit(int, int) is undefined
  nutch/trunk/contrib/web2/src/main/java/org/apache/nutch/webapp/controller
  CachedController.java   line 45 1232003043917 6304
  The constructor Hit(int, int) is undefined
  nutch/trunk/contrib/web2/src/main/java/org/apache/nutch/webapp/controller
  ExplainController.java  line 42 1232003043917 6303
  The constructor Hit(int, int) is undefined
  nutch/trunk/contrib/web2/src/main/java/org/apache/nutch/webapp/servlet
  CachedServlet.java  line 60 1232003043877 6287
  The constructor Outlink(String, String, Configuration) is undefined
  nutch/trunk/src/plugin/parse-mp3/src/java/org/apache/nutch/parse/mp3
  MetadataCollector.java  line 54 1232003047903 7023
  The import com.etranslate cannot be resolved
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParseFactory.javaline 39 1232003048273 7119
  The import com.etranslate cannot be resolved
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParseFactory.javaline 40 1232003048273 7120
  The import com.etranslate cannot be resolved
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParserDelegateImpl.javaline 21 1232003048263 7109
  The method getIndexDocNo() is undefined for the type Hit
  nutch/trunk/contrib/web2/src/main

Re: How to set up Nutch in Eclipse IDE

2009-01-16 Thread Pradeep Pujari
I am able to compile with Java 5.0. I am getting following error when I
execute this below command. Please help.


p139...@ft000xd3149416
/cygdrive/c/eclipse/eclipse/workspace/nutch_trunk/bin
$ nutch crawl urls -dir crawl -depth 3 -topN 50


Exception in thread "main" java.lang.UnsupportedClassVersionError:
org/apache/ha
doop/util/PlatformName (Unsupported major.minor version 50.0)
at java.lang.ClassLoader.defineClass0(Native Method)
at java.lang.ClassLoader.defineClass(ClassLoader.java:509)
at
java.security.SecureClassLoader.defineClass(SecureClassLoader.java:12
3)
at java.net.URLClassLoader.defineClass(URLClassLoader.java:246)
at java.net.URLClassLoader.access$100(URLClassLoader.java:54)
at java.net.URLClassLoader$1.run(URLClassLoader.java:193)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:186)
at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:265)
at java.lang.ClassLoader.loadClass(ClassLoader.java:262)
at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:322)
Exception in thread "main" java.lang.NoClassDefFoundError:
org/apache/nutch/craw
l/Crawl


Thanks/Regards
Pradeep Pujari
415-422-1678


   
 Pradeep   
 Pujari/MCOM/MACYS 
To 
 01/15/2009 11:26  nutch-dev@lucene.apache.org 
 AM cc 
   nutch-dev@lucene.apache.org 
   Subject 
   Re: How to set up Nutch in Eclipse  
   IDE(Document link: Pradeep Pujari)  
   
   
   
   
   
   



How to disable them?

Thanks/Regards
Pradeep Pujari
415-422-1678


   
 Pau   
   To 
   nutch-dev@lucene.apache.org 
 01/15/2009 11:23   cc 
 AM
   Subject 
   Re: How to set up Nutch in Eclipse  
 Please respond to IDE 
 nutch-...@lucene. 
apache.org 
   
   
   
   




I see that the first errors are about the RTFParseFactory.java
In http://wiki.apache.org/nutch/RunNutchInEclipse explain that the
parse-rtf, parse-mp3, and contrib/ plugins are not built correctly. Have
you disabled them?

On Thu, Jan 15, 2009 at 7:21 PM, Pradeep Pujari 
wrote:
  Hi Edwin,

  I used to subEclipse plug-in to check out from trunk. Then, I created
  Java
  Project, so the steps are not exactly the same as described in wiki. Now
  I
  am getting compilation errors. Here is what I did.

  1. created a java project

  2. Imported from SVN option /../trunk/src directory.

  3. When compiled,

  Severity and Description  Path  ResourceLocationCreation Time
  Id
  e cannot be resolved
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParseFactory.javaline 66 1232003048273 7125
  IN_DOCUMENT cannot be resolved
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParserDelegateImpl.javaline 77 1232003048263 7114
  IN_DOCUMENT cannot be resolved
  nutch/trunk/src/plugin/parse-rtf/src/java/org/apache/nutch/parse/rtf
  RTFParserDelegateImpl.javaline 1011232003048263 7116

Re: NTCH-635 LinkAnalysis Tool for Nutch

2009-02-12 Thread Pradeep Pujari
Where is the document, can you please send me?

Thanks/Regards
Pradeep Pujari



   
 "Eric J.  
 Christeson"   
  nutch-dev@lucene.apache.org 
cc 
 02/12/2009 04:05  
 PMSubject 
   NTCH-635 LinkAnalysis Tool for  
   Nutch   
 Please respond to 
 nutch-...@lucene. 
apache.org 
   
   
   




I went through org.apache.nutch.scoring.webgraph.* found all the
config settings I could, threw them into nutch-default.xml and tried
to document them.  Who wants the patches?

Eric
--
Eric J. Christeson

Enterprise Computing and Infrastructure(701) 231-8693 (Voice)
North Dakota State University