[jira] [Updated] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X Docker

2015-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2050: Component/s: (was: build) docker > Upgrade HBase and Hadoop ver

[jira] [Updated] (NUTCH-2018) Ensure that the Docker containers for Nutch 2.X are part of the Release Management Documentation

2015-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2018: Component/s: docker > Ensure that the Docker containers for Nutch 2.X are part of th

[jira] [Created] (NUTCH-2105) Update Nutch Cassandra Dockerfile to work with Gora Nutch 2.3.1

2015-09-16 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2105: --- Summary: Update Nutch Cassandra Dockerfile to work with Gora Nutch 2.3.1 Key: NUTCH-2105 URL: https://issues.apache.org/jira/browse/NUTCH-2105 Project:

[jira] [Updated] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X Docker

2015-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2050: Attachment: NUTCH-2050.patch Patch for 2.X HEAD blocker by NUTCH-1946. This patch ai

[jira] [Updated] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X Docker

2015-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2050: Flags: Patch,Important Patch Info: Patch Available > Upgrade HBase and Hado

[jira] [Updated] (NUTCH-1709) Generated classes o.a.n.storage.Host and o.a.n.storage.ProtocolStatus contain methods not defined in source .avsc

2015-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1709: Fix Version/s: (was: 2.3.1) 2.4 > Generated classes o.a.n.sto

NUTCH-1946 Upgrade to Gora 0.6.1

2015-09-16 Thread Lewis John Mcgibbney
Hi user@ and dev@, Quick message to ask kindly for a call to arms. I pushed a patch to NUTCH-1946 [0] for Nutch 2.X HEAD [1] This includes - Upgrade to Gora 0.6.1 - Upgrade to Hadoop 2.5.1 (which Gora supports fully) see NUTCH-2101 -

[jira] [Updated] (NUTCH-1946) Upgrade to Gora 0.6.1

2015-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1946: Flags: Patch,Important Patch Info: Patch Available Priority: Critic

[jira] [Updated] (NUTCH-1946) Upgrade to Gora 0.6.1

2015-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1946: Attachment: NUTCH-1946v4.patch Patch for 2.X HEAD This includes * Upgrade to Gora

[jira] [Commented] (NUTCH-1946) Upgrade to Gora 0.6.1

2015-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14791633#comment-14791633 ] Lewis John McGibbney commented on NUTCH-1946: - As We've fixed a deal of things

[jira] [Commented] (NUTCH-1946) Upgrade to Gora 0.6.1

2015-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14791632#comment-14791632 ] Lewis John McGibbney commented on NUTCH-1946: - These are intrinsically linked.

[jira] [Updated] (NUTCH-2101) Upgrade Nutch 2.X to Hadoop 2.5.1

2015-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2101: Summary: Upgrade Nutch 2.X to Hadoop 2.5.1 (was: Upgrade Nutch 2.X to Hadoop 2.4.0)

[jira] [Created] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation

2015-09-16 Thread Kim Whitehall (JIRA)
Kim Whitehall created NUTCH-2104: Summary: Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation Key: NUTCH-2104 URL: https://issues.apache.org/jira/browse/NUTCH-2104

Re: unsubscribe

2015-09-16 Thread Michael Joyce
Please see the instructions on the project website regarding how to unsubscribe https://nutch.apache.org/mailing_lists.html Namely, you need to email dev-unsubscr...@nutch.apache.org instead of the actual list. Hope that helps -- Jimmy On Wed, Sep 16, 2015 at 10:01 AM, Mohit Raman wrote: > >

[jira] [Commented] (NUTCH-2099) Refactoring the REST endpoints for integration with webui

2015-09-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14790807#comment-14790807 ] Lewis John McGibbney commented on NUTCH-2099: - [~sujenshah], can you show me a

[jira] [Commented] (NUTCH-2099) Refactoring the REST endpoints for integration with webui

2015-09-16 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14790802#comment-14790802 ] ASF GitHub Bot commented on NUTCH-2099: --- Github user lewismc commented on a diff in

[GitHub] nutch pull request: Fix for NUTCH-2099 Contributed by Sujen Shah

2015-09-16 Thread lewismc
Github user lewismc commented on a diff in the pull request: https://github.com/apache/nutch/pull/59#discussion_r39662837 --- Diff: src/java/org/apache/nutch/metadata/Nutch.java --- @@ -80,4 +80,11 @@ public static final String STAT_PROGRESS = "progress"; /**Used by

[GitHub] nutch pull request: Fix for NUTCH-2099 Contributed by Sujen Shah

2015-09-16 Thread lewismc
Github user lewismc commented on a diff in the pull request: https://github.com/apache/nutch/pull/59#discussion_r39662639 --- Diff: src/java/org/apache/nutch/crawl/CrawlDb.java --- @@ -261,30 +262,68 @@ public int run(String[] args) throws Exception { additionsAllowed = f

[jira] [Commented] (NUTCH-2099) Refactoring the REST endpoints for integration with webui

2015-09-16 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14790799#comment-14790799 ] ASF GitHub Bot commented on NUTCH-2099: --- Github user lewismc commented on a diff in

[GitHub] nutch pull request: Fix for NUTCH-2099 Contributed by Sujen Shah

2015-09-16 Thread lewismc
Github user lewismc commented on a diff in the pull request: https://github.com/apache/nutch/pull/59#discussion_r39662447 --- Diff: src/java/org/apache/nutch/crawl/CrawlDb.java --- @@ -236,10 +237,10 @@ public int run(String[] args) throws Exception { * Used for Nutch REST s

[jira] [Commented] (NUTCH-2099) Refactoring the REST endpoints for integration with webui

2015-09-16 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14790797#comment-14790797 ] ASF GitHub Bot commented on NUTCH-2099: --- Github user lewismc commented on a diff in

unsubscribe

2015-09-16 Thread Mohit Raman

[jira] [Created] (NUTCH-2103) Nutch 2.3 has an old version of hbase jar in runtime/lib folder

2015-09-16 Thread Mobin Ranjbar (JIRA)
Mobin Ranjbar created NUTCH-2103: Summary: Nutch 2.3 has an old version of hbase jar in runtime/lib folder Key: NUTCH-2103 URL: https://issues.apache.org/jira/browse/NUTCH-2103 Project: Nutch

[jira] [Comment Edited] (NUTCH-2102) WARC Exporter

2015-09-16 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14747327#comment-14747327 ] Julien Nioche edited comment on NUTCH-2102 at 9/16/15 11:21 AM:

[jira] [Commented] (NUTCH-2102) WARC Exporter

2015-09-16 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14747327#comment-14747327 ] Julien Nioche commented on NUTCH-2102: -- Hi Markus > I believe this warc format is t

[jira] [Commented] (NUTCH-2102) WARC Exporter

2015-09-16 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14747316#comment-14747316 ] Markus Jelsma commented on NUTCH-2102: -- Hello Julien! I believe this warc format is t

[jira] [Commented] (NUTCH-2102) WARC Exporter

2015-09-16 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14747301#comment-14747301 ] Julien Nioche commented on NUTCH-2102: -- Please review > WARC Exporter >

[jira] [Updated] (NUTCH-2102) WARC Exporter

2015-09-16 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-2102: - Description: This patch adds a WARC exporter [http://bibnum.bnf.fr/warc/WARC_ISO_28500_version1_l

[jira] [Commented] (NUTCH-2102) WARC Exporter

2015-09-16 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14747300#comment-14747300 ] Julien Nioche commented on NUTCH-2102: -- The only modification to existing code is in

[jira] [Updated] (NUTCH-2102) WARC Exporter

2015-09-16 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-2102: - Attachment: NUTCH-2102.patch > WARC Exporter > - > > Key: NUTCH-2102 >

[jira] [Updated] (NUTCH-2102) WARC Exporter

2015-09-16 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-2102: - Attachment: (was: NUTCH-2102.patch) > WARC Exporter > - > > Key: N

[jira] [Updated] (NUTCH-2102) WARC Exporter

2015-09-16 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-2102: - Description: This patch adds a WARC exporter [http://bibnum.bnf.fr/warc/WARC_ISO_28500_version1_l

[jira] [Updated] (NUTCH-2102) WARC Exporter

2015-09-16 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-2102: - Attachment: NUTCH-2102.patch > WARC Exporter > - > > Key: NUTCH-2102 >

[jira] [Created] (NUTCH-2102) WARC Exporter

2015-09-16 Thread Julien Nioche (JIRA)
Julien Nioche created NUTCH-2102: Summary: WARC Exporter Key: NUTCH-2102 URL: https://issues.apache.org/jira/browse/NUTCH-2102 Project: Nutch Issue Type: Improvement Components: com

[jira] [Updated] (NUTCH-1932) Automatically remove orphaned pages

2015-09-16 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1932: - Attachment: NUTCH-1932.patch Probably the final patch. It now includes: * moving reducer code to