Hi,this question has been asked by other posters to this list, however I haven't seen an answer yet, hopefully some one can help.
I have recrawling working for v0.71, however using the v0.8 wiki scripts I can't get it working on v0.9. They appear to to a recrawl, however no new documents appear in the index. I have been able to merge 2 seperate indexes into one, however I am concerned if I have an index of 500,000 documents, how efficient it will be if - on a daily basis I want to add 100 or so new documents and reindex 300. The source material is from a document management system accessed by urls, and I will know exactly what documents are new and which have been reupdated and require reindexing. Do the scripts work- and I need to check again how I am using them or do I need to look at something else? Regards John Reidy. ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
