i suggest you first open the index with luke and check that the encoding 
is detected correct, and make a search from luke to see if you get any 
answers. Then you may invoke org.apache.nutch.searcher.Query to see if 
you query is parsed and translated correctly. Finally, you may check 
tomcat whether it uses utf-8 encoding.

Karol Rybak wrote:
> Hello, i have set up nutch to do some more testing, after indexing couple
> thousand pages i tried to do some searching. Everything works fine, 
> however
> there's one problem, i cannot search using polish characters. I tried
> searching for a query like "Materiały dydaktyczne" i got no results, 
> and the
> text in the search field changed into "Materiały dydaktyczne". 
> Everything
> else is fine when i search for "dydaktyczne" results show up and the
> encoding is ok. Do you have any idea what could be wrong ?

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to