[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13855625#comment-13855625 ]
Sebastian Nagel commented on NUTCH-1681: ---------------------------------------- Shouldn't we keep all parts of the URL, esp. port and query, ev. also anchor/fragment and user info? {code} assertEquals("http://uni-tübingen.de:8080/search.php?q=abc", URLUtil.toUNICODE("http://xn--uni-tbingen-xhb.de:8080/search.php?q=abc")); {code} > In URLUtil.java, toUNICODE method does not work correctly > --------------------------------------------------------- > > Key: NUTCH-1681 > URL: https://issues.apache.org/jira/browse/NUTCH-1681 > Project: Nutch > Issue Type: Bug > Affects Versions: 2.2.1 > Reporter: İlhami KALKAN > Fix For: 1.9, 2.2.1 > > Attachments: NUTCH-1681-1.8-2.patch, NUTCH-1681-1.8.patch, > NUTCH-1681-1.8.patch, NUTCH-1681-1.8.patch, toUnicode.patch > > > This method returns java.net.URISyntaxException when non-ascii character does > in parameter like http://www.çevir.com. -- This message was sent by Atlassian JIRA (v6.1.5#6160)