[jira] [Commented] (NUTCH-1690) IndexClean: mark url as unindexed after clean to not delete again

2013-12-23 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13855496#comment-13855496 ] Sebastian Nagel commented on NUTCH-1690: Patch depends on STATUS_DUPLICATED

[jira] [Commented] (NUTCH-1685) URLUtil.toUNICODE fails on IDNs

2013-12-23 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13855518#comment-13855518 ] Markus Jelsma commented on NUTCH-1685: -- Looks like a duplicate. URLUtil.toUNICODE

[jira] [Closed] (NUTCH-1685) URLUtil.toUNICODE fails on IDNs

2013-12-23 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel closed NUTCH-1685. -- Resolution: Duplicate You are right, [~markus17]. URLUtil.toUNICODE fails on IDNs

[jira] [Commented] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1381#comment-1381 ] Sebastian Nagel commented on NUTCH-1681: Hi [~markus17], the solution fails for

[jira] [Updated] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] İlhami KALKAN updated NUTCH-1681: - Attachment: NUTCH-1681-1.8-2.patch Hi Sebastian, Thanks. I fix the toUNICODE method and add

[jira] [Updated] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1681: - Attachment: NUTCH-1681-1.8.patch Yes that works İlhami. Final patch to get rid of the system.out

[jira] [Commented] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13855625#comment-13855625 ] Sebastian Nagel commented on NUTCH-1681: Shouldn't we keep all parts of the URL,

[jira] [Updated] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-1681: --- Attachment: NUTCH-1681-1.8-5.patch Patch which keeps (hopefully :)!) all info from original

[jira] [Updated] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1681: - Attachment: NUTCH-1681-1.8.patch Yes, put parts of original code back. If you build URI up like

[jira] [Commented] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13855654#comment-13855654 ] Sebastian Nagel commented on NUTCH-1681: Can you commit? (to avoid any further

[jira] [Comment Edited] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13855660#comment-13855660 ] Markus Jelsma edited comment on NUTCH-1681 at 12/23/13 2:29 PM:

[jira] [Updated] (NUTCH-1360) Suport the storing of IP address connected to when web crawling

2013-12-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1360: Attachment: NUTCH-1360-trunkv3.patch Patch for trunk. This adds the host IP to the

[jira] [Commented] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13855669#comment-13855669 ] Lewis John McGibbney commented on NUTCH-1681: - I'll look right now Markus and

Jenkins build is back to normal : Nutch-trunk #2460

2013-12-23 Thread Apache Jenkins Server
See https://builds.apache.org/job/Nutch-trunk/2460/changes

[jira] [Comment Edited] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13855669#comment-13855669 ] Lewis John McGibbney edited comment on NUTCH-1681 at 12/23/13 2:49 PM:

[jira] [Commented] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13855670#comment-13855670 ] Hudson commented on NUTCH-1681: --- SUCCESS: Integrated in Nutch-trunk #2460 (See

[jira] [Updated] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1681: Attachment: NUTCH-1681-2.x.patch patch for 2.x HEAD Committed @revision 1553125 in

[jira] [Resolved] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1681. - Resolution: Fixed Fix Version/s: (was: 2.2.1) 2.2

[jira] [Commented] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13855682#comment-13855682 ] Markus Jelsma commented on NUTCH-1681: -- I see. Thanks mate! In URLUtil.java,

[jira] [Updated] (NUTCH-1321) IDNNormalizer

2013-12-23 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] İlhami KALKAN updated NUTCH-1321: - Attachment: (was: idnNormalizer.patch) IDNNormalizer - Key:

[jira] [Commented] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-23 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13855689#comment-13855689 ] Hudson commented on NUTCH-1681: --- SUCCESS: Integrated in Nutch-nutchgora #857 (See

[jira] [Updated] (NUTCH-1321) IDNNormalizer

2013-12-23 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] İlhami KALKAN updated NUTCH-1321: - Attachment: idnNormalizer.patch Hi Sebastian, I dont know enough information about 1.x so I

[jira] [Updated] (NUTCH-1360) Suport the storing of IP address connected to when web crawling

2013-12-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1360: Attachment: NUTCH-1360-trunkv4.patch Hi [~jnioche] thank you for review. Yes you

[jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling

2013-12-23 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13855745#comment-13855745 ] Julien Nioche commented on NUTCH-1360: -- Looks good mate, +1 to commit Suport the

[jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling

2013-12-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13855780#comment-13855780 ] Lewis John McGibbney commented on NUTCH-1360: - Thanks [~jnioche] Committed

[jira] [Resolved] (NUTCH-1360) Suport the storing of IP address connected to when web crawling

2013-12-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1360. - Resolution: Fixed Fix Version/s: (was: 1.8) 1.9

[jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling

2013-12-23 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13855794#comment-13855794 ] Hudson commented on NUTCH-1360: --- SUCCESS: Integrated in Nutch-nutchgora #858 (See

Re: Step Through Nutch 1.7 Inside Eclipse Missing Argument

2013-12-23 Thread Tejas Patil
Hi Bin Wang, You are welcome to edit the wiki and add your observations to it. Thanks for your contribution. ~tejas On Mon, Dec 23, 2013 at 8:19 AM, Bin Wang binwang...@gmail.com wrote: Hi Tejas, Thanks a lot for your confirmation! And it is working for me now! I will take you as the

[jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling

2013-12-23 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13856108#comment-13856108 ] Hudson commented on NUTCH-1360: --- FAILURE: Integrated in Nutch-trunk #2462 (See