At http://wiki.apache.org/nutch/GettingNutchRunningWithUtf8 it is suggested, 
in order to handle UTF-8 characters in GET parameters, to change the 
configuration of the application server. Why can't the webapp just switch 
the "request" object to UTF-8 encoding, e.g. by placing in the <head> 
section of the query form page:

<jsp:scriptlet> request.setCharacterEncoding("UTF-8"); </jsp:scriptlet>

I'm doing this with a Lucene-based custom web application running on Tomcat 
5.5, and it appears to work fine.

Enzo



-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to