I may be loosing all and every credit ... it's still in the same state -
reindex doesn't change the subcollection field! 

I did a REFETCH by mistake (before reindex), and I was happy to notice that
subcollections were changed - but I assumed it happened only due to reindex.

However I am looking for REINDEX only - and subcollection field looks that
it doesn't change (on corresponding changes on subcollection.xml file).

Any help in debugging would be greatly appreciated... however I'm not
acquinted to java to pursue this by myself.

thanks


liv wrote:
> 
> I have no ideea why this hapened - probably due to luke, because of it not
> re-reading the indexes? very strange!
> 
> Anyway, it works as it should - after a reindex the subcollection field is
> populated with latest data.
> 
> Please excuse my insistence and my clumsiness, and thanks for your
> answers.
> 
> 
> 
> liv wrote:
>> 
>> Unfortunately my java knowledge is too poor to debug this one. However I
>> doubt that the file "subcollections.xml" from inside the nutch-xxx.job is
>> used. This because the file nutchxxx.job is old enough - has the date
>> since the day I made he nutch installation.
>> 
>> 
>> Sami Siren-2 wrote:
>>> 
>>> liv wrote:
>>>> - I reindex the db: delete folder "indexes", run the command:
>>>> 
>>>> bin/nutch index crawl/indexes crawl/crawldb crawl/linkdb
>>>> crawl/segments/*
>>>> 
>>>> - then I inspect the resulting db with luke again
>>>> 
>>>> Unfortunately nothing has changed. Maybe I am missing something...
>>>> Please
>>>> tell me if you see anything wrong.
>>> 
>>> If you did exactly those steps then what happens is that the
>>> subcollections.xml is read from inside the .job file. You need to
>>> rebuild the .job to put new file inside of it.
>>> 
>>> simply do "ant" and rerun indexing and it should work as expected.
>>> 
>>> --
>>>  Sami Siren
>>> 
>>> 
>>> 
>> 
>> 
> 
> 

-- 
View this message in context: 
http://www.nabble.com/subcollections-tf2821188.html#a7935139
Sent from the Nutch - User mailing list archive at Nabble.com.


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to