[jira] [Commented] (NUTCH-2083) Implement functionality to shadow nutch-selenium-grid-plugin from Mo Omer

2015-08-25 Thread Kim Whitehall (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14711670#comment-14711670 ] Kim Whitehall commented on NUTCH-2083: -- Hey Lewis, Nicely done! I've got a few

[GitHub] nutch pull request: [DO NOT MERGE/DISCUSSION] add cleaned up versi...

2015-08-25 Thread eivindveg
Github user eivindveg commented on a diff in the pull request: https://github.com/apache/nutch/pull/50#discussion_r37885151 --- Diff: src/plugin/protocol-selenium/src/java/org/apache/nutch/protocol/selenium/HttpResponse.java --- @@ -0,0 +1,107 @@ +/** + * Licensed to the

[GitHub] nutch pull request: [DO NOT MERGE/DISCUSSION] add cleaned up versi...

2015-08-25 Thread eivindveg
Github user eivindveg closed the pull request at: https://github.com/apache/nutch/pull/50 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[jira] [Commented] (NUTCH-2083) Implement functionality to shadow nutch-selenium-grid-plugin from Mo Omer

2015-08-25 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712399#comment-14712399 ] Hudson commented on NUTCH-2083: --- SUCCESS: Integrated in Nutch-trunk #3262 (See

[jira] [Resolved] (NUTCH-2083) Implement functionality to shadow nutch-selenium-grid-plugin from Mo Omer

2015-08-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2083. - Resolution: Fixed Committed @ revision 1697808 in trunk Implement functionality

[DISCUSS] Release Nutch trunk 1.11

2015-08-25 Thread Lewis John Mcgibbney
Hi Folks, What do you all think about getting a release candidate out for Nutch 1.11? I am happy to do RM role. Thanks Lewis -- *Lewis*

Moving Nutch conf to etc/nutch

2015-08-25 Thread Lewis John Mcgibbney
Hi Folks, What are the thoughts about shadowing the directory structure present across projects within the Hadoop ecosystem. This is a trivial change e.g. involves moving conf to etc/nutch and fiddling with some of our build targets. Lewis -- *Lewis*

[jira] [Updated] (NUTCH-2084) Track changes in input dirs for SegmentMerger

2015-08-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-2084: - Description: When merging 1000's of segments, and one is corrupt, broken, whatever, the merge

[jira] [Updated] (NUTCH-2084) Track changes in input dirs for SegmentMerger

2015-08-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-2084: - Attachment: NUTCH-2084.patch Patch for trunk. Track changes in input dirs for SegmentMerger

[jira] [Commented] (NUTCH-2084) Track changes in input dirs for SegmentMerger

2015-08-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14710828#comment-14710828 ] Markus Jelsma commented on NUTCH-2084: -- Well, this immediately helped me track down

[jira] [Created] (NUTCH-2084) Track changes in input dirs for SegmentMerger

2015-08-25 Thread Markus Jelsma (JIRA)
Markus Jelsma created NUTCH-2084: Summary: Track changes in input dirs for SegmentMerger Key: NUTCH-2084 URL: https://issues.apache.org/jira/browse/NUTCH-2084 Project: Nutch Issue Type: Bug

[jira] [Commented] (NUTCH-1679) UpdateDb using batchId, link may override crawled page.

2015-08-25 Thread Alexander Kingson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14711774#comment-14711774 ] Alexander Kingson commented on NUTCH-1679: -- Hi, It seems to me that in this case

[jira] [Updated] (NUTCH-2083) Implement functionality to shadow nutch-selenium-grid-plugin from Mo Omer

2015-08-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2083: Attachment: NUTCH-2083v2.patch Hi [~kwhitehall] please see attached patch * It

[jira] [Updated] (NUTCH-1741) Support of Sitemaps in Nutch 2.x

2015-08-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1741: Attachment: NUTCH-1741v5.patch Patch for 2.X HEAD which adds missing license

[jira] [Commented] (NUTCH-2085) Upgrade Guava

2015-08-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712134#comment-14712134 ] Lewis John McGibbney commented on NUTCH-2085: - +1 Upgrade Guava

[jira] [Commented] (NUTCH-2083) Implement functionality to shadow nutch-selenium-grid-plugin from Mo Omer

2015-08-25 Thread Kim Whitehall (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712132#comment-14712132 ] Kim Whitehall commented on NUTCH-2083: -- [~lewismc] Thanks for the clarifications and

[jira] [Commented] (NUTCH-2083) Implement functionality to shadow nutch-selenium-grid-plugin from Mo Omer

2015-08-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712136#comment-14712136 ] Lewis John McGibbney commented on NUTCH-2083: - OK will commit EoB today unless

[jira] [Created] (NUTCH-2086) Nutch 1.X Webui

2015-08-25 Thread Sujen Shah (JIRA)
Sujen Shah created NUTCH-2086: - Summary: Nutch 1.X Webui Key: NUTCH-2086 URL: https://issues.apache.org/jira/browse/NUTCH-2086 Project: Nutch Issue Type: New Feature Components:

[jira] [Created] (NUTCH-2085) Upgrade Guava

2015-08-25 Thread Markus Jelsma (JIRA)
Markus Jelsma created NUTCH-2085: Summary: Upgrade Guava Key: NUTCH-2085 URL: https://issues.apache.org/jira/browse/NUTCH-2085 Project: Nutch Issue Type: Task Affects Versions: 1.10

[jira] [Updated] (NUTCH-2085) Upgrade Guava

2015-08-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-2085: - Attachment: NUTCH-2085.patch Patch for trunk. Tests pass except for ParserFactory, which fails

[jira] [Updated] (NUTCH-2085) Upgrade Guava

2015-08-25 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-2085: - Patch Info: Patch Available Upgrade Guava - Key: NUTCH-2085