[ 
https://issues.apache.org/jira/browse/NUTCH-859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12894051#action_12894051
 ] 

Julien Nioche commented on NUTCH-859:
-------------------------------------

{quote}

1. Build.xml file
org/apache/nutch/scoring/webgraph/*/.java,org/apache/nutch/tools/compat/*/.java 
is no longer existed

(target compile-core-test, compile-core)

{quote}

the webgraph has not been ported to NutchBase yet, however since there is no 
code in these packages this line in build.xml has no effect and could be 
removed.

{quote}
2. solrindex-mapping.xml is not added
{quote}

indeed. well spotted

{quote}
We shoud convert SolrIndexer.java to run solrindex command in local mode
{quote}

what do you mean by that? Embedding SOLR in the reducers in order to generate 
an index locally? You can put that as a new feature for 2.0, what we are trying 
to do here is to make sure we are not missing any of the improvements done in 
the last 12+ months on the trunk.

{quote}
3. /java/org/apache/nutch/crawl/FetchScheduleFactory.java

No need to use Class<?>, just Class is enough
{quote}

ok - not really relevant here (see comment above)

{quote}
4. nutch-default.xml

query-(basic|site|url)|summary-basic is not used
{quote}

true. we'll also need to remove the params in nutch-default.xml which won't be 
used anymore. This can be done after we move to trunk though. 

{quote}
5. nutchbase use fetcher.threads.per.queue, why not use other feature like 
fetcher.threads.per.host, fetcher.threads.per.host.by.ip
{quote}

things are handled differently in the code but the feature is the same. look at 
the code for more details

{quote}
6.Some java source contain $Id, some java source not, it is not unique style

// $Id: PrefixURLFilter.java 823614 2009-10-09 17:02:32Z ab $
// $Id$
{quote}

ok, can be part of the code cleanup after the move to trunk

{quote}
We should define a standard format rules for source code, then using eclipse to 
format the whole source code in the same style
{quote}

agreed - can you create a separate JIRA for this please?

Thanks for your comments!



> Diff trunk and NutchBase 
> -------------------------
>
>                 Key: NUTCH-859
>                 URL: https://issues.apache.org/jira/browse/NUTCH-859
>             Project: Nutch
>          Issue Type: Task
>    Affects Versions: nutchbase
>            Reporter: Julien Nioche
>            Priority: Blocker
>             Fix For: nutchbase
>
>
> Before we turn NutchBase into trunk we need to make sure that all (more or 
> less) recent changes in the trunk have been ported to NutchBase. I have done 
> that recently but given that there is a very large number of changes I might 
> have missed a few things here and there.  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to