[ 
https://issues.apache.org/jira/browse/NUTCH-2202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16341852#comment-16341852
 ] 

ASF GitHub Bot commented on NUTCH-2202:
---------------------------------------

HansBrende commented on issue #97: NUTCH-2202 Integration of Anthelion (Focused 
Crawling Module) into Nutch
URL: https://github.com/apache/nutch/pull/97#issuecomment-360942425
 
 
   @lewismc I can't get your NUTCH-2202 branch to build.
   
   I'm doing:
   ```
   git clone https://github.com/lewismc/nutch
   cd nutch
   git checkout NUTCH-2202
   ant
   ```
   
   which is giving me:
   <pre>
   Buildfile: /Users/hansbrende/nutch/build.xml
   Trying to override old definition of task javac
     [taskdef] Could not load definitions from resource 
org/sonar/ant/antlib.xml. It could not be found.
   
   ivy-probe-antlib:
   
   ivy-download:
     [taskdef] Could not load definitions from resource 
org/sonar/ant/antlib.xml. It could not be found.
   </pre>
   .
   .
   .
   <pre>
   init:
       [mkdir] Created dir: /Users/hansbrende/nutch/build/anthelion
       [mkdir] Created dir: /Users/hansbrende/nutch/build/anthelion/classes
       [mkdir] Created dir: /Users/hansbrende/nutch/build/anthelion/test
       [mkdir] Created dir: /Users/hansbrende/nutch/build/anthelion/test/lib
       [mkdir] Created dir: /Users/hansbrende/nutch/build/plugins/anthelion
   
   init-plugin:
   
   deps-jar:
   
   init:
   
   init-plugin:
   
   clean-lib:
   
   resolve-default:
   [ivy:resolve] :: loading settings :: file = 
/Users/hansbrende/nutch/ivy/ivysettings.xml
   
   compile:
   
   jar:
   
   clean-lib:
   
   resolve-default:
   [ivy:resolve] :: loading settings :: file = 
/Users/hansbrende/nutch/ivy/ivysettings.xml
   
   compile:
        [echo] Compiling plugin: anthelion
       [javac] Compiling 34 source files to 
/Users/hansbrende/nutch/build/anthelion/classes
       [javac] 
/Users/hansbrende/nutch/src/plugin/anthelion/src/java/org/apache/nutch/anthelion/classifier/NutchOnlineClassifier.java:35:
 error: cannot find symbol
       [javac] import moa.core.InstancesHeader;
       [javac]                ^
       [javac]   symbol:   class InstancesHeader
       [javac]   location: package moa.core
       [javac] 
/Users/hansbrende/nutch/src/plugin/anthelion/src/java/org/apache/nutch/anthelion/framework/AnthOnlineClassifier.java:33:
 error: cannot find symbol
       [javac] import moa.core.InstancesHeader;
       [javac]                ^
       [javac]   symbol:   class InstancesHeader
       [javac]   location: package moa.core
       [javac] 
/Users/hansbrende/nutch/src/plugin/anthelion/src/java/org/apache/nutch/anthelion/mao/DataManipulationFilter.java:19:
 error: cannot find symbol
       [javac] import moa.core.InstancesHeader;
       [javac]                ^
       [javac]   symbol:   class InstancesHeader
       [javac]   location: package moa.core
   </pre>
   .
   .
   .
   <pre>
       [javac] 46 errors
   
   BUILD FAILED
   /Users/hansbrende/nutch/build.xml:116: The following error occurred while 
executing this line:
   /Users/hansbrende/nutch/src/plugin/build.xml:37: The following error 
occurred while executing this line:
   /Users/hansbrende/nutch/src/plugin/build-plugin.xml:133: Compile failed; see 
the compiler error output for details.
   </pre>
   
   Am I doing something wrong?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Integration of Anthelion (Focused Crawling Module) into Nutch
> -------------------------------------------------------------
>
>                 Key: NUTCH-2202
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2202
>             Project: Nutch
>          Issue Type: Improvement
>          Components: parser, scoring
>            Reporter: Robert Meusel
>            Assignee: Lewis John McGibbney
>            Priority: Major
>              Labels: any23, online_learning
>
> We have recently released anthelion, which is a focused crawler plugin for 
> structured data which can be extracted with any23. 
> (https://github.com/yahoo/anthelion) As proposed by Lewis (Lewis John 
> McGibbney) we think the integration of the parser (any23) and the scoring 
> function based on the online learner could be a good improvement for nutch. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to