I found a fix, apparently it is the default Tomcat install which is incorrect and creates problems in the forms
http://www.bedework.org/trac/bedework/ticket/243 Add : URIEncoding="UTF-8" in the appropriate connector in the server.xml Restart Tomcat / nutch and it works. Maybe worth a modification in the Wiki Install Guide. 2009/7/18 MilleBii <[email protected]> > I'm testing NUTCH in a French setup, I just came accross an accent problem > when doing search > > Debugging I found : > + search.jsp works all over in UTF-8, so the query string is UTF-8 encoded > + yet it calls Entities.encode which assumes strings 8bits encoding, > probably iso-latin-1 > > Anyone came across this issue or having a patch for this ? > > Just to make sure I replaced all utf-8 declaration in search.jsp, by > iso-8859-1, works fine & perfect. > > Looks like a bug to me. > > -- > -MilleBii- > -- -MilleBii-
