Dear Andrzej Bialecki I added some words to the list in NutchAnalysis.java and tried to crawl some sites. When I searched for the original stop words, I got zero results. When I tried the added words, there were lots of them in the results. What is going wrong? Thank you
----- Original Message ----- From: "Andrzej Bialecki" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Thursday, May 10, 2007 7:14 AM Subject: Re: Stop words > Naess, Ronny wrote: >> Hi. >> >> I am living in Norway and I would like to add a stop word list. >> >> I found this https://issues.apache.org/jira/browse/NUTCH-453 in JIRA >> saying something about "moveing stop words from code to config file", >> but nothing has happend in this area it seems. >> > > Correct. Patches are welcome ;) > >> How can I add stop words with current version (0.9)? > > For now, you can simply replace the list that you can find in > NutchAnalysis.java. > > > > -- > Best regards, > Andrzej Bialecki <>< > ___. ___ ___ ___ _ _ __________________________________ > [__ || __|__/|__||\/| Information Retrieval, Semantic Web > ___|||__|| \| || | Embedded Unix, System Integration > http://www.sigram.com Contact: info at sigram dot com > > > > -- > No virus found in this incoming message. > Checked by AVG Free Edition. Version: 7.5.467 / Virus Database: > 269.6.6/795 - Release Date: 9/5/2007 15:07 > > ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
