i suggest you first open the index with luke and check that the encoding is detected correct, and make a search from luke to see if you get any answers. Then you may invoke org.apache.nutch.searcher.Query to see if you query is parsed and translated correctly. Finally, you may check tomcat whether it uses utf-8 encoding.
Karol Rybak wrote: > Hello, i have set up nutch to do some more testing, after indexing couple > thousand pages i tried to do some searching. Everything works fine, > however > there's one problem, i cannot search using polish characters. I tried > searching for a query like "Materiały dydaktyczne" i got no results, > and the > text in the search field changed into "MateriaÅ‚y dydaktyczne". > Everything > else is fine when i search for "dydaktyczne" results show up and the > encoding is ok. Do you have any idea what could be wrong ? ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
