[ 
http://issues.apache.org/jira/browse/NUTCH-138?page=comments#action_12361546 ] 

KuroSaka TeruHiko commented on NUTCH-138:
-----------------------------------------

You are right.  WIth this Tomcat config, UTF-8 characters can be passed.
Also works is having:   useBodyEncodingForURI="true"
in the <Connector> tag within $TOMCAT/conf/service.xml
This is documented in:
http://issues.apache.org/bugzilla/show_bug.cgi?id=29900

What I suggest is to add this note to:
http://lucene.apache.org/nutch/i18n.html
(which currently explains the GUI localization issue only, rather than 
internationalization proper),
or perhaps creating a new page:
http://wiki.apache.org/nutch/GettingNutchRunningUTF8Tomcat5

I am willing to write a draft if someone tell me where to submit.

Feel free to close this bug.


> non-Latin-1 characters cannot be submitted for search
> -----------------------------------------------------
>
>          Key: NUTCH-138
>          URL: http://issues.apache.org/jira/browse/NUTCH-138
>      Project: Nutch
>         Type: Bug
>   Components: web gui
>     Versions: 0.7.1
>  Environment: Windows XP, Tomcat 5.5.12
>     Reporter: KuroSaka TeruHiko
>     Priority: Minor

>
> The search.html currently specifies GET method for query submission.
> Tomcat 5.x only allows ISO-8859-1 (aka Latin-1) code set to be submitted over 
> GET because of some restrictions of HTML or HTTP spec they discovered. (If my 
> memory is correct, non ISO-8859-1 characters were woking OK over GET with 
> older versions of Tomcat as far as setCharacterEncoding() is called properly.)
> To allow proper transmission of non-ISO-8859-1, POST method should be used.  
> Here's a proposed patch:
> *** search.html       Tue Dec 13 15:02:15 2005
> --- search-org.html   Tue Dec 13 15:02:07 2005
> ***************
> *** 59,65 ****
>   </span><span class="bodytext">
>   <center>
>   
> ! <form name="search" action="../search.jsp" method="post"> 
>   <input name="query" size="44">&nbsp;<input type="submit" value="Search">
>   <a href="help.html">help</a>
>   
> --- 59,65 ----
>   </span><span class="bodytext">
>   <center>
>   
> ! <form name="search" action="../search.jsp" method="get"> 
>   <input name="query" size="44">&nbsp;<input type="submit" value="Search">
>   <a href="help.html">help</a>
>   
> BTW, I am aware that Nutch and Lucene won't hanlde non Western languages well 
> as packaged.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira



-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to