[ 
https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13855625#comment-13855625
 ] 

Sebastian Nagel commented on NUTCH-1681:
----------------------------------------

Shouldn't we keep all parts of the URL, esp. port and query, ev. also 
anchor/fragment and user info?
{code}
assertEquals("http://uni-tübingen.de:8080/search.php?q=abc";, 
URLUtil.toUNICODE("http://xn--uni-tbingen-xhb.de:8080/search.php?q=abc";));
{code}


> In URLUtil.java, toUNICODE method does not work correctly
> ---------------------------------------------------------
>
>                 Key: NUTCH-1681
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1681
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 2.2.1
>            Reporter: İlhami KALKAN
>             Fix For: 1.9, 2.2.1
>
>         Attachments: NUTCH-1681-1.8-2.patch, NUTCH-1681-1.8.patch, 
> NUTCH-1681-1.8.patch, NUTCH-1681-1.8.patch, toUnicode.patch
>
>
> This method returns java.net.URISyntaxException when non-ascii character does 
> in parameter like http://www.çevir.com.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to