[jira] [Updated] (NUTCH-2159) Ensure that all WebApp files are copied into generated artifacts for 1.X Webapp

2015-11-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2159: Flags: Patch,Important Patch Info: Patch Available > Ensure that

[jira] [Updated] (NUTCH-2159) Ensure that all WebApp files are copied into generated artifacts for 1.X Webapp

2015-11-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2159: Attachment: NUTCH-2159.patch Patch fro trunk. This resolves the issue and also

[jira] [Updated] (NUTCH-2159) Ensure that all WebApp files are copied into generated artifacts for 1.X Webapp

2015-11-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2159: Attachment: Screen Shot 2015-11-03 at 10.36.44 PM.png Nice WebApp for us to improve

[jira] [Created] (NUTCH-2160) Upgrade Selenium Java to 2.48.2

2015-11-03 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2160: --- Summary: Upgrade Selenium Java to 2.48.2 Key: NUTCH-2160 URL: https://issues.apache.org/jira/browse/NUTCH-2160 Project: Nutch Issue Type: Bug

[jira] [Updated] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1800: Issue Type: New Feature (was: Bug) > Documentation for Nutch 1.X and 2.X REST A

[jira] [Resolved] (NUTCH-1988) Make nested output directory dump optional

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1988. - Resolution: Fixed Fix Version/s: 1.11 Committed @ revision 1711366

[jira] [Closed] (NUTCH-1988) Make nested output directory dump optional

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1988. --- > Make nested output directory dump optio

[jira] [Created] (NUTCH-2157) Parent Issue for Addressing Miredot REST API Warnings

2015-10-29 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2157: --- Summary: Parent Issue for Addressing Miredot REST API Warnings Key: NUTCH-2157 URL: https://issues.apache.org/jira/browse/NUTCH-2157 Project: Nutch

[jira] [Commented] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14981186#comment-14981186 ] Lewis John McGibbney commented on NUTCH-1800: - [~sujenshah] did you see that this comments out

[jira] [Commented] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14981251#comment-14981251 ] Lewis John McGibbney commented on NUTCH-1800: - Agreed! Committed @revision 1711359 in trunk

[jira] [Reopened] (NUTCH-1988) Make nested output directory dump optional

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reopened NUTCH-1988: - Assignee: Lewis John McGibbney (was: Chris A. Mattmann) This issue seems

[jira] [Updated] (NUTCH-1988) Make nested output directory dump optional

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1988: Attachment: NUTCH-1988v2.patch Patch for trunk which reintroduces this patch. I

[RESULT] WAS Re: [VOTE] Release Apache Nutch 2.3.1

2015-10-29 Thread Lewis John Mcgibbney
(please state why) Lewis John Mcgibbney* (this release candidate has a flaw in the crawl script) Sherban Drulea (cannot get it to run... this release candidate has a flaw in the crawl script)) Sebastian Nagel* (this release candidate has a flaw in the crawl script) *PMC Binding This VOTE therefore fails

[jira] [Assigned] (NUTCH-2143) GeneratorJob ignores batch id passed as argument

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2143: --- Assignee: Lewis John McGibbney > GeneratorJob ignores batch id pas

Re: [VOTE] Release Apache Nutch 2.3.1

2015-10-29 Thread Lewis John Mcgibbney
this is good to go. I will send a RESULT thread then work on getting 2.3.1 RC #2 shipped. Thanks On Tue, Sep 22, 2015 at 6:45 PM, Lewis John Mcgibbney < lewis.mcgibb...@gmail.com> wrote: > Hi user@ & dev@,This thread is a VOTE for releasing Apache Nutch 2.3.1 RC#1. > > We

[jira] [Commented] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979831#comment-14979831 ] Lewis John McGibbney commented on NUTCH-1800: - For those who want to see the docs you can see

Re: MireDot user activation

2015-10-28 Thread lewis john mcgibbney
requested one for the Apache Tika project a while back and we now release kick ass REST documentation with each release. It would be my intention to do the same with Apache Nutch if possible. Thank you in advance for any feedback you have on this one, it is greatly appreciated. Lewis John McGibbney

[jira] [Updated] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1800: Flags: Patch Patch Info: Patch Available > Documentation for Nutch

[jira] [Assigned] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-1800: --- Assignee: Lewis John McGibbney > Documentation for Nutch 1.X and 2.X R

[jira] [Updated] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1800: Fix Version/s: (was: 2.4) 2.3.1 1.11

[jira] [Commented] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979828#comment-14979828 ] Lewis John McGibbney commented on NUTCH-1800: - [~sujenshah] > Documentation for Nutch

[jira] [Updated] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1800: Attachment: NUTCH-1800.patch Patch for trunk. Currently uses my own license key

[jira] [Commented] (NUTCH-2147) LanguagePreferenceScoringFilter for Nutch

2015-10-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14974565#comment-14974565 ] Lewis John McGibbney commented on NUTCH-2147: - Hi [~markus17], I never took a look

[jira] [Updated] (NUTCH-2147) MetadataScoringFilter for Nutch

2015-10-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2147: Summary: MetadataScoringFilter for Nutch (was: LanguagePreferenceScoringFilter

[jira] [Updated] (NUTCH-2147) MetadataScoringFilter for Nutch

2015-10-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2147: Description: This issue originally started by envisioning an implementation

[jira] [Commented] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14967986#comment-14967986 ] Lewis John McGibbney commented on NUTCH-2148: - These ones also {code} 15/10/21 21:30:44 INFO

[jira] [Updated] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2148: Fix Version/s: (was: 2.3.1) > Review and update mapred --> mapreduce

[jira] [Updated] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2148: Attachment: NUTCH-2148v2.patch Updated patch for trunk, this deals with the parse

[jira] [Resolved] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2148. - Resolution: Fixed Committed @revision 1709943 in trunk > Review and upd

[jira] [Closed] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-2148. --- > Review and update mapred --> mapreduce config params in crawl

[jira] [Created] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-20 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2148: --- Summary: Review and update mapred --> mapreduce config params in crawl script Key: NUTCH-2148 URL: https://issues.apache.org/jira/browse/NUTCH-2

[jira] [Created] (NUTCH-2147) LanguagePreferenceScoringFilter for Nutch

2015-10-20 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2147: --- Summary: LanguagePreferenceScoringFilter for Nutch Key: NUTCH-2147 URL: https://issues.apache.org/jira/browse/NUTCH-2147 Project: Nutch Issue

[jira] [Assigned] (NUTCH-2147) LanguagePreferenceScoringFilter for Nutch

2015-10-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2147: --- Assignee: Lewis John McGibbney > LanguagePreferenceScoringFilter for Nu

[jira] [Updated] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2148: Attachment: NUTCH-2148.patch Patch for trunk > Review and update map

[jira] [Commented] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14966183#comment-14966183 ] Lewis John McGibbney commented on NUTCH-2148: - Seeing as the crawl script for 2.3.1 needs

[jira] [Commented] (NUTCH-2086) Nutch 1.X Webui

2015-10-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14942846#comment-14942846 ] Lewis John McGibbney commented on NUTCH-2086: - Nice Aron -- *Lewis* > Nutch 1.X We

Re: [VOTE] Release Apache Nutch 2.3.1

2015-09-30 Thread Lewis John Mcgibbney
Hi Folks, Is anyone else able to test and run the release candidate for 2.3.1? It would be great to get a release if we can get the VOTE's and the RC is suitable. Thanks in advance. Best Lewis On Wed, Sep 23, 2015 at 9:46 PM, Lewis John Mcgibbney < lewis.mcgibb...@gmail.com> wrote: >

[jira] [Created] (NUTCH-2130) copyField rawcontent creates error within schema.xml

2015-09-30 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2130: --- Summary: copyField rawcontent creates error within schema.xml Key: NUTCH-2130 URL: https://issues.apache.org/jira/browse/NUTCH-2130 Project: Nutch

[jira] [Commented] (NUTCH-2086) Nutch 1.X Webui

2015-09-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14933808#comment-14933808 ] Lewis John McGibbney commented on NUTCH-2086: - Folks this is committed @revision 1705744

[jira] [Commented] (NUTCH-2086) Nutch 1.X Webui

2015-09-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14909345#comment-14909345 ] Lewis John McGibbney commented on NUTCH-2086: - I would also be +1 based on this adding new

[jira] [Commented] (NUTCH-2086) Nutch 1.X Webui

2015-09-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14909103#comment-14909103 ] Lewis John McGibbney commented on NUTCH-2086: - OK, so far I've found that src/bin/nutch

[jira] [Updated] (NUTCH-2086) Nutch 1.X Webui

2015-09-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2086: Attachment: NUTCH-2086.patch Patch for trunk. [~sujenshah] this was starting

[jira] [Updated] (NUTCH-2086) Nutch 1.X Webui

2015-09-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2086: Attachment: (was: NUTCH-2086.patch) > Nutch 1.X We

[jira] [Updated] (NUTCH-2086) Nutch 1.X Webui

2015-09-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2086: Attachment: NUTCH-2086.patch Updated patch to accomodate src/bin/nutch logic

[jira] [Commented] (NUTCH-1644) Should have a parser that uses xpath

2015-09-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14907644#comment-14907644 ] Lewis John McGibbney commented on NUTCH-1644: - [~bipin] bq. will this be fixed. Yes certainly

[jira] [Created] (NUTCH-2120) Remove MapWritable from trunk codebase

2015-09-24 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2120: --- Summary: Remove MapWritable from trunk codebase Key: NUTCH-2120 URL: https://issues.apache.org/jira/browse/NUTCH-2120 Project: Nutch Issue

[jira] [Created] (NUTCH-2122) Implement Javadoc package.html for service packages

2015-09-24 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2122: --- Summary: Implement Javadoc package.html for service packages Key: NUTCH-2122 URL: https://issues.apache.org/jira/browse/NUTCH-2122 Project: Nutch

[jira] [Assigned] (NUTCH-2111) Delete temporary files location for selenium tmp files after driver quits

2015-09-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2111: --- Assignee: Lewis John McGibbney > Delete temporary files locat

[jira] [Resolved] (NUTCH-2111) Delete temporary files location for selenium tmp files after driver quits

2015-09-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2111. - Resolution: Fixed Committed @revision 1704896 in trunk > Delete temporary fi

[jira] [Updated] (NUTCH-2111) Delete temporary files location for selenium tmp files after driver quits

2015-09-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2111: Assignee: Kim Whitehall (was: Lewis John McGibbney) > Delete temporary fi

[jira] [Updated] (NUTCH-2111) Delete temporary files location for selenium tmp files after driver quits

2015-09-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2111: Summary: Delete temporary files location for selenium tmp files after driver quits

[jira] [Commented] (NUTCH-2111) Set temporary file location for selenium tmp files

2015-09-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904070#comment-14904070 ] Lewis John McGibbney commented on NUTCH-2111: - Hi [~kwhitehall] bq. The patch submitted does

[jira] [Created] (NUTCH-2116) NutchServer and NutchApp should contain shutdown hooks

2015-09-23 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2116: --- Summary: NutchServer and NutchApp should contain shutdown hooks Key: NUTCH-2116 URL: https://issues.apache.org/jira/browse/NUTCH-2116 Project: Nutch

[jira] [Resolved] (NUTCH-2115) Add total counts to dump stats

2015-09-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2115. - Resolution: Fixed Assignee: Michael Joyce Nice patch Mike Committed

[jira] [Resolved] (NUTCH-2117) NutchServer CLI Option for CMD_PORT is incorrect and should be CMD_HOST

2015-09-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2117. - Resolution: Fixed Committed @revision 1704972 in trunk. > NutchServer CLI Opt

[jira] [Created] (NUTCH-2117) NutchServer CLI Option for CMD_PORT is incorrect and should be CMD_HOST

2015-09-23 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2117: --- Summary: NutchServer CLI Option for CMD_PORT is incorrect and should be CMD_HOST Key: NUTCH-2117 URL: https://issues.apache.org/jira/browse/NUTCH-2117

Re: [VOTE] Release Apache Nutch 2.3.1

2015-09-23 Thread Lewis John Mcgibbney
Hi Folks, It turns out the formatting for the original email below was terrible. Sorry about that. I've hopefully corrected formatting now. Please VOTE away! On Tue, Sep 22, 2015 at 6:45 PM, Lewis John Mcgibbney < lewis.mcgibb...@gmail.com> wrote: > Hi user@ & dev@, > > Th

[VOTE] Release Apache Nutch 2.3.1

2015-09-22 Thread Lewis John Mcgibbney
Hi user@ & dev@,This thread is a VOTE for releasing Apache Nutch 2.3.1 RC#1. We addressed 32 issues in all which can been see at the release report http://s.apache.org/nutch_2.3.1 The release candidate comprises the following components. * A staging repository [0] containing various Maven

[jira] [Resolved] (NUTCH-2018) Ensure that the Docker containers for Nutch 2.X are part of the Release Management Documentation

2015-09-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2018. - Resolution: Fixed Added to release management HOWTO > Ensure that the Doc

[jira] [Resolved] (NUTCH-2105) Update Nutch Cassandra Dockerfile to work with Gora Nutch 2.3.1

2015-09-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2105. - Resolution: Fixed Committed @revision 1704754 in 2.X HEAD > Update Nu

[jira] [Updated] (NUTCH-2105) Update Nutch Cassandra Dockerfile to work with Gora Nutch 2.3.1

2015-09-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2105: Attachment: NUTCH-2105.patch Patch for 2.X HEAD Would like to commit today and get

[jira] [Updated] (NUTCH-2018) Ensure that the Docker containers for Nutch 2.X are part of the Release Management Documentation

2015-09-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2018: Issue Type: Improvement (was: Bug) > Ensure that the Docker containers for Nutc

[jira] [Resolved] (NUTCH-1946) Upgrade to Gora 0.6.1

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1946. - Resolution: Fixed Committed @revision 1704128 in 2.X HEAD > Upgrade to G

[jira] [Resolved] (NUTCH-1286) Refactoring/reimplementing crawling API (NutchApp)

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1286. - Resolution: Won't Fix > Refactoring/reimplementing crawling API (Nutch

[jira] [Resolved] (NUTCH-2101) Upgrade Nutch 2.X to Hadoop 2.5.1

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2101. - Resolution: Fixed resolved in NUTCH-1946 > Upgrade Nutch 2.X to Hadoop 2.

[jira] [Commented] (NUTCH-2018) Ensure that the Docker containers for Nutch 2.X are part of the Release Management Documentation

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14899916#comment-14899916 ] Lewis John McGibbney commented on NUTCH-2018: - I'll deal with this tomorrow folks. > Ens

[jira] [Resolved] (NUTCH-2028) java.lang.IllegalArgumentException: can't serialize class org.apache.avro.util.Utf8

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2028. - Resolution: Fixed This issue has been resolved in upgrade to Gora 0.6.1

[jira] [Commented] (NUTCH-2105) Update Nutch Cassandra Dockerfile to work with Gora Nutch 2.3.1

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14899915#comment-14899915 ] Lewis John McGibbney commented on NUTCH-2105: - I will work on this tomorrow then push an RC

[jira] [Resolved] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X HBase Docker

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2050. - Resolution: Fixed Committed @revision 1704129 in 2.X HEAD > Upgrade HB

[jira] [Resolved] (NUTCH-1572) Nutch 2.x should use o.a.g.mem.store.MemStore for testing

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1572. - Resolution: Fixed Resolved with NUTCH-1946 > Nutch 2.x should

[jira] [Updated] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X HBase Docker

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2050: Summary: Upgrade HBase and Hadoop versioning on 2.X HBase Docker (was: Upgrade

[jira] [Commented] (NUTCH-2111) Set temporary file location for selenium tmp files

2015-09-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14877248#comment-14877248 ] Lewis John McGibbney commented on NUTCH-2111: - lol > Set temporary file location for selen

[jira] [Commented] (NUTCH-2106) Runtime to contain Selenium and dependencies only once

2015-09-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14877251#comment-14877251 ] Lewis John McGibbney commented on NUTCH-2106: - +1 for commit [~wastl-nagel] > Runt

[jira] [Commented] (NUTCH-2106) Runtime to contain Selenium and dependencies only once

2015-09-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14805366#comment-14805366 ] Lewis John McGibbney commented on NUTCH-2106: - [~kwhitehall] lets touch base on this and try

NUTCH-1946 Upgrade to Gora 0.6.1

2015-09-17 Thread Lewis John Mcgibbney
Hi user@ and dev@, Quick message to ask kindly for a call to arms. I pushed a patch to NUTCH-1946 [0] for Nutch 2.X HEAD [1] This includes - Upgrade to Gora 0.6.1 - Upgrade to Hadoop 2.5.1 (which Gora supports fully) see NUTCH-2101

[jira] [Updated] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X Docker

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2050: Flags: Patch,Important Patch Info: Patch Available > Upgrade HB

[jira] [Created] (NUTCH-2105) Update Nutch Cassandra Dockerfile to work with Gora Nutch 2.3.1

2015-09-17 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2105: --- Summary: Update Nutch Cassandra Dockerfile to work with Gora Nutch 2.3.1 Key: NUTCH-2105 URL: https://issues.apache.org/jira/browse/NUTCH-2105 Project

[jira] [Updated] (NUTCH-1286) Refactoring/reimplementing crawling API (NutchApp)

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1286: Fix Version/s: (was: 2.4) 2.3.1 > Refactoring/reimplement

[jira] [Updated] (NUTCH-1169) Write JUnit tests for urlfilter-prefix

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1169: Assignee: Talat UYARER > Write JUnit tests for urlfilter-pre

[jira] [Reopened] (NUTCH-1286) Refactoring/reimplementing crawling API (NutchApp)

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reopened NUTCH-1286: - > Refactoring/reimplementing crawling API (Nutch

[jira] [Updated] (NUTCH-1946) Upgrade to Gora 0.6.1

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1946: Attachment: NUTCH-1946v4.patch Patch for 2.X HEAD This includes * Upgrade to Gora

[jira] [Updated] (NUTCH-1946) Upgrade to Gora 0.6.1

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1946: Flags: Patch,Important Patch Info: Patch Available Priority

[jira] [Updated] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X Docker

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2050: Attachment: NUTCH-2050.patch Patch for 2.X HEAD blocker by NUTCH-1946. This patch

[jira] [Updated] (NUTCH-1893) Parse-tika fails to parse feed files

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1893: Fix Version/s: (was: 2.4) 2.3.1 > Parse-tika fails to pa

[jira] [Updated] (NUTCH-1886) Review and update default.properties

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1886: Fix Version/s: (was: 2.4) 2.3.1 > Review and upd

[jira] [Updated] (NUTCH-1709) Generated classes o.a.n.storage.Host and o.a.n.storage.ProtocolStatus contain methods not defined in source .avsc

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1709: Fix Version/s: (was: 2.3.1) 2.4 > Generated clas

[jira] [Updated] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X Docker

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2050: Component/s: (was: build) docker > Upgrade HBase and Had

[jira] [Updated] (NUTCH-2018) Ensure that the Docker containers for Nutch 2.X are part of the Release Management Documentation

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2018: Component/s: docker > Ensure that the Docker containers for Nutch 2.X are p

[jira] [Commented] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14791680#comment-14791680 ] Lewis John McGibbney commented on NUTCH-2104: - Hi [~kwhitehall] if you think you can get

[jira] [Updated] (NUTCH-1920) Upgrade Nutch to use Java 1.7

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1920: Fix Version/s: (was: 2.4) 2.3.1 > Upgrade Nutch to use J

[jira] [Updated] (NUTCH-1981) Upgrade icu4j

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1981: Fix Version/s: (was: 2.4) 2.3.1 > Upgrade ic

[jira] [Updated] (NUTCH-1941) Optional rolling http.agent.name's

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1941: Fix Version/s: (was: 2.4) 2.3.1 > Optional roll

[jira] [Updated] (NUTCH-1169) Write JUnit tests for urlfilter-prefix

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1169: Fix Version/s: (was: 2.4) 2.3.1 > Write JUnit te

[jira] [Commented] (NUTCH-1946) Upgrade to Gora 0.6.1

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14791632#comment-14791632 ] Lewis John McGibbney commented on NUTCH-1946: - These are intrinsically linked. > Upgr

[jira] [Updated] (NUTCH-2101) Upgrade Nutch 2.X to Hadoop 2.5.1

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2101: Summary: Upgrade Nutch 2.X to Hadoop 2.5.1 (was: Upgrade Nutch 2.X to Hadoop 2.4.0

[jira] [Commented] (NUTCH-1946) Upgrade to Gora 0.6.1

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14791633#comment-14791633 ] Lewis John McGibbney commented on NUTCH-1946: - As We've fixed a deal of things over

[jira] [Updated] (NUTCH-1062) Migrate BasicURLNormalizer from Apache ORO to java.util.regex

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1062: Fix Version/s: (was: 2.4) 2.3.1 > Migrate BasicURLNormali

[jira] [Updated] (NUTCH-1990) Use URI.normalise() in BasicURLNormalizer

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1990: Fix Version/s: 2.3.1 > Use URI.normalise() in BasicURLNormali

[jira] [Closed] (NUTCH-1936) GSoC 2015 - Move Nutch to Hadoop 2.X

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1936. --- Resolution: Fixed > GSoC 2015 - Move Nutch to Hadoop

[jira] [Reopened] (NUTCH-1936) GSoC 2015 - Move Nutch to Hadoop 2.X

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reopened NUTCH-1936: - > GSoC 2015 - Move Nutch to Hadoop

<    5   6   7   8   9   10   11   12   13   14   >