Hi,
Does anybody know how to set another character
encoding than UTF-8, which seems to be the default in
Nutch 0.8.1 on Tomcat 5 ? (Ubuntu 6.10 / Tomcat 5.0)
What I have tried :
In <tomcat_root>/conf/web.xml :
(in jsp section) :
Added :
<init-param>
<param-name>javaEncoding</param-name>
<param-value>ISO-8859-1</param-value>
</init-param>
In <tomcat_root>/webapps/ROOT/WEB-INF/web.xml :
(in <servlet-name>Cached</servlet-name> section)
Added :
<init-param>
<param-name>javaEncoding</param-name>
<param-value>ISO-8859-1</param-value>
</init-param>
Stopped and restarted Tomcat (from the crawldir folder
of Nutch)
The browser keeps showing UTF-8 encoded pages, and
french special characters are being replaced with
wrong characters.
Any idea ?
Thanks
___________________________________________________________________________
Découvrez une nouvelle façon d'obtenir des réponses à toutes vos questions !
Profitez des connaissances, des opinions et des expériences des internautes sur
Yahoo! Questions/Réponses
http://fr.answers.yahoo.com
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general