[jira] [Comment Edited] (NUTCH-2165) FileDumper Util hard codes part-# folder name

2015-11-12 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15002598#comment-15002598 ] Lewis John McGibbney edited comment on NUTCH-2165 at 11/12/15 6:39 PM: -

[jira] [Commented] (NUTCH-2165) FileDumper Util hard codes part-# folder name

2015-11-12 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15002598#comment-15002598 ] Lewis John McGibbney commented on NUTCH-2165: - +1 [~mjoyce] verified on small

[jira] [Resolved] (NUTCH-2160) Upgrade Selenium Java to 2.48.2

2015-11-12 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2160. - Resolution: Fixed Committed revision 1714071 > Upgrade Selenium Java to 2.48.2 >

[jira] [Updated] (NUTCH-2160) Upgrade Selenium Java to 2.48.2

2015-11-12 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2160: Issue Type: Improvement (was: Bug) > Upgrade Selenium Java to 2.48.2 >

[jira] [Closed] (NUTCH-2120) Remove MapWritable from trunk codebase

2015-11-12 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-2120. --- Committed revision 1714068 > Remove MapWritable from trunk codebase > ---

[jira] [Resolved] (NUTCH-2120) Remove MapWritable from trunk codebase

2015-11-12 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2120. - Resolution: Fixed Fix Version/s: (was: 1.12) 1.11 >

[jira] [Updated] (NUTCH-2120) Remove MapWritable from trunk codebase

2015-11-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2120: Issue Type: Task (was: Bug) > Remove MapWritable from trunk codebase >

[jira] [Updated] (NUTCH-2120) Remove MapWritable from trunk codebase

2015-11-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2120: Flags: Patch Patch Info: Patch Available > Remove MapWritable from trunk co

[jira] [Commented] (NUTCH-2160) Upgrade Selenium Java to 2.48.2

2015-11-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15001105#comment-15001105 ] Lewis John McGibbney commented on NUTCH-2160: - Will commit by EoB today unless

[jira] [Updated] (NUTCH-2120) Remove MapWritable from trunk codebase

2015-11-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2120: Attachment: NUTCH-2120.patch Patch which removes this class from Trunk. > Remove Ma

[jira] [Commented] (NUTCH-2167) Backport TableUtil from 2.x for URL reversing

2015-11-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15000912#comment-15000912 ] Lewis John McGibbney commented on NUTCH-2167: - Yes, an example of this being u

[jira] [Commented] (NUTCH-2165) FileDumper Util hard codes part-# folder name

2015-11-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15000658#comment-15000658 ] Lewis John McGibbney commented on NUTCH-2165: - It means that the remaining dat

[jira] [Updated] (NUTCH-2163) Utilize current JVM threads to augment URLClassLoader with newly discovered classes

2015-11-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2163: Summary: Utilize current JVM threads to augment URLClassLoader with newly discovered

[jira] [Created] (NUTCH-2163) Utilize current JVM threads to augment URLClassLoader with newlt discovered classes

2015-11-06 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2163: --- Summary: Utilize current JVM threads to augment URLClassLoader with newlt discovered classes Key: NUTCH-2163 URL: https://issues.apache.org/jira/browse/NUTCH-2163

[jira] [Commented] (NUTCH-2162) Nutch Webapp Crawl fails as it tries to index

2015-11-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14994168#comment-14994168 ] Lewis John McGibbney commented on NUTCH-2162: - Ack. I also got it working well

[jira] [Commented] (NUTCH-2162) Nutch Webapp Crawl fails as it tries to index

2015-11-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14993376#comment-14993376 ] Lewis John McGibbney commented on NUTCH-2162: - In all honesty a work around fo

[jira] [Updated] (NUTCH-2162) Nutch Webapp Crawl fails as it tries to index

2015-11-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2162: Attachment: nutch_webapp.log Example log output from initiating a Crawl from the Web

[jira] [Created] (NUTCH-2162) Nutch Webapp Crawl fails as it tries to index

2015-11-05 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2162: --- Summary: Nutch Webapp Crawl fails as it tries to index Key: NUTCH-2162 URL: https://issues.apache.org/jira/browse/NUTCH-2162 Project: Nutch Iss

[jira] [Created] (NUTCH-2161) Interrupted failed and/or killed tasks fail to clean up temp directories in HDFS

2015-11-05 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2161: --- Summary: Interrupted failed and/or killed tasks fail to clean up temp directories in HDFS Key: NUTCH-2161 URL: https://issues.apache.org/jira/browse/NUTCH-2161

[jira] [Updated] (NUTCH-2129) Track Protocol Status in Crawl Datum

2015-11-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2129: Fix Version/s: (was: 2.4) > Track Protocol Status in Crawl Datum > -

[jira] [Commented] (NUTCH-2160) Upgrade Selenium Java to 2.48.2

2015-11-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991063#comment-14991063 ] Lewis John McGibbney commented on NUTCH-2160: - I was under the impression that

[jira] [Resolved] (NUTCH-2159) Ensure that all WebApp files are copied into generated artifacts for 1.X Webapp

2015-11-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2159. - Resolution: Fixed Committed @revision 1712705 in trunk > Ensure that all WebApp f

[jira] [Commented] (NUTCH-2160) Upgrade Selenium Java to 2.48.2

2015-11-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991031#comment-14991031 ] Lewis John McGibbney commented on NUTCH-2160: - Thanks Kim. I have it working w

[jira] [Updated] (NUTCH-2160) Upgrade Selenium Java to 2.48.2

2015-11-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2160: Attachment: NUTCH-2160.patch Patch for trunk. [~kwhitehall] hopefully this will mean

[jira] [Created] (NUTCH-2160) Upgrade Selenium Java to 2.48.2

2015-11-03 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2160: --- Summary: Upgrade Selenium Java to 2.48.2 Key: NUTCH-2160 URL: https://issues.apache.org/jira/browse/NUTCH-2160 Project: Nutch Issue Type: Bug

[jira] [Commented] (NUTCH-2159) Ensure that all WebApp files are copied into generated artifacts for 1.X Webapp

2015-11-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14989009#comment-14989009 ] Lewis John McGibbney commented on NUTCH-2159: - [~sujenshah], please scope if y

[jira] [Updated] (NUTCH-2159) Ensure that all WebApp files are copied into generated artifacts for 1.X Webapp

2015-11-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2159: Attachment: Screen Shot 2015-11-03 at 10.36.44 PM.png Nice WebApp for us to improve

[jira] [Updated] (NUTCH-2159) Ensure that all WebApp files are copied into generated artifacts for 1.X Webapp

2015-11-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2159: Attachment: NUTCH-2159.patch Patch fro trunk. This resolves the issue and also remov

[jira] [Updated] (NUTCH-2159) Ensure that all WebApp files are copied into generated artifacts for 1.X Webapp

2015-11-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2159: Flags: Patch,Important Patch Info: Patch Available > Ensure that all WebApp

[jira] [Created] (NUTCH-2159) Ensure that all WebApp files are copied into generated artifacts for 1.X Webapp

2015-11-03 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2159: --- Summary: Ensure that all WebApp files are copied into generated artifacts for 1.X Webapp Key: NUTCH-2159 URL: https://issues.apache.org/jira/browse/NUTCH-2159

[jira] [Resolved] (NUTCH-2086) Nutch 1.X Webui

2015-11-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2086. - Resolution: Fixed Fix Version/s: (was: 1.12) 1.11 >

[jira] [Assigned] (NUTCH-2143) GeneratorJob ignores batch id passed as argument

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2143: --- Assignee: Lewis John McGibbney > GeneratorJob ignores batch id passed as argu

[jira] [Resolved] (NUTCH-1988) Make nested output directory dump optional

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1988. - Resolution: Fixed Fix Version/s: 1.11 Committed @ revision 1711366 in trunk

[jira] [Closed] (NUTCH-1988) Make nested output directory dump optional

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1988. --- > Make nested output directory dump optional > --

[jira] [Updated] (NUTCH-1988) Make nested output directory dump optional

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1988: Attachment: NUTCH-1988v2.patch Patch for trunk which reintroduces this patch. I have

[jira] [Reopened] (NUTCH-1988) Make nested output directory dump optional

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reopened NUTCH-1988: - Assignee: Lewis John McGibbney (was: Chris A. Mattmann) This issue seems to hav

[jira] [Updated] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1800: Issue Type: New Feature (was: Bug) > Documentation for Nutch 1.X and 2.X REST APIs

[jira] [Created] (NUTCH-2157) Parent Issue for Addressing Miredot REST API Warnings

2015-10-29 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2157: --- Summary: Parent Issue for Addressing Miredot REST API Warnings Key: NUTCH-2157 URL: https://issues.apache.org/jira/browse/NUTCH-2157 Project: Nutch

[jira] [Commented] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14981251#comment-14981251 ] Lewis John McGibbney commented on NUTCH-1800: - Agreed! Committed @revision 171

[jira] [Commented] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14981186#comment-14981186 ] Lewis John McGibbney commented on NUTCH-1800: - [~sujenshah] did you see that t

[jira] [Commented] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14979834#comment-14979834 ] Lewis John McGibbney commented on NUTCH-1800: - Improvements can be seen here

[jira] [Commented] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14979831#comment-14979831 ] Lewis John McGibbney commented on NUTCH-1800: - For those who want to see the d

[jira] [Updated] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1800: Flags: Patch Patch Info: Patch Available > Documentation for Nutch 1.X and

[jira] [Commented] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14979828#comment-14979828 ] Lewis John McGibbney commented on NUTCH-1800: - [~sujenshah] > Documentation f

[jira] [Updated] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1800: Attachment: NUTCH-1800.patch Patch for trunk. Currently uses my own license key (whi

[jira] [Assigned] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-1800: --- Assignee: Lewis John McGibbney > Documentation for Nutch 1.X and 2.X REST API

[jira] [Updated] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs

2015-10-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1800: Fix Version/s: (was: 2.4) 2.3.1 1.11 > Doc

[jira] [Updated] (NUTCH-2147) MetadataScoringFilter for Nutch

2015-10-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2147: Description: This issue originally started by envisioning an implementation of a La

[jira] [Updated] (NUTCH-2147) MetadataScoringFilter for Nutch

2015-10-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2147: Summary: MetadataScoringFilter for Nutch (was: LanguagePreferenceScoringFilter for

[jira] [Commented] (NUTCH-2147) LanguagePreferenceScoringFilter for Nutch

2015-10-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14974565#comment-14974565 ] Lewis John McGibbney commented on NUTCH-2147: - Hi [~markus17], I never took a

[jira] [Closed] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-2148. --- > Review and update mapred --> mapreduce config params in crawl script > -

[jira] [Resolved] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2148. - Resolution: Fixed Committed @revision 1709943 in trunk > Review and update mapred

[jira] [Updated] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2148: Fix Version/s: (was: 2.3.1) > Review and update mapred --> mapreduce config para

[jira] [Updated] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2148: Attachment: NUTCH-2148v2.patch Updated patch for trunk, this deals with the parse co

[jira] [Commented] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14967986#comment-14967986 ] Lewis John McGibbney commented on NUTCH-2148: - These ones also {code} 15/10/21

[jira] [Commented] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14966183#comment-14966183 ] Lewis John McGibbney commented on NUTCH-2148: - Seeing as the crawl script for

[jira] [Updated] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2148: Attachment: NUTCH-2148.patch Patch for trunk > Review and update mapred --> mapredu

[jira] [Created] (NUTCH-2148) Review and update mapred --> mapreduce config params in crawl script

2015-10-20 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2148: --- Summary: Review and update mapred --> mapreduce config params in crawl script Key: NUTCH-2148 URL: https://issues.apache.org/jira/browse/NUTCH-2148 Proj

[jira] [Created] (NUTCH-2147) LanguagePreferenceScoringFilter for Nutch

2015-10-20 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2147: --- Summary: LanguagePreferenceScoringFilter for Nutch Key: NUTCH-2147 URL: https://issues.apache.org/jira/browse/NUTCH-2147 Project: Nutch Issue T

[jira] [Assigned] (NUTCH-2147) LanguagePreferenceScoringFilter for Nutch

2015-10-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2147: --- Assignee: Lewis John McGibbney > LanguagePreferenceScoringFilter for Nutch >

[jira] [Commented] (NUTCH-2086) Nutch 1.X Webui

2015-10-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14942846#comment-14942846 ] Lewis John McGibbney commented on NUTCH-2086: - Nice Aron -- *Lewis* > N

[jira] [Created] (NUTCH-2130) copyField rawcontent creates error within schema.xml

2015-09-30 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2130: --- Summary: copyField rawcontent creates error within schema.xml Key: NUTCH-2130 URL: https://issues.apache.org/jira/browse/NUTCH-2130 Project: Nutch

[jira] [Commented] (NUTCH-2086) Nutch 1.X Webui

2015-09-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14933808#comment-14933808 ] Lewis John McGibbney commented on NUTCH-2086: - Folks this is committed @revisi

[jira] [Commented] (NUTCH-2086) Nutch 1.X Webui

2015-09-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14909345#comment-14909345 ] Lewis John McGibbney commented on NUTCH-2086: - I would also be +1 based on thi

[jira] [Updated] (NUTCH-2086) Nutch 1.X Webui

2015-09-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2086: Attachment: NUTCH-2086.patch Updated patch to accomodate src/bin/nutch logic but the

[jira] [Updated] (NUTCH-2086) Nutch 1.X Webui

2015-09-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2086: Attachment: (was: NUTCH-2086.patch) > Nutch 1.X Webui > > >

[jira] [Commented] (NUTCH-2086) Nutch 1.X Webui

2015-09-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14909103#comment-14909103 ] Lewis John McGibbney commented on NUTCH-2086: - OK, so far I've found that src/

[jira] [Updated] (NUTCH-2086) Nutch 1.X Webui

2015-09-25 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2086: Attachment: NUTCH-2086.patch Patch for trunk. [~sujenshah] this was starting to get

[jira] [Commented] (NUTCH-1644) Should have a parser that uses xpath

2015-09-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14907644#comment-14907644 ] Lewis John McGibbney commented on NUTCH-1644: - [~bipin] bq. will this be fixed

[jira] [Created] (NUTCH-2122) Implement Javadoc package.html for service packages

2015-09-24 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2122: --- Summary: Implement Javadoc package.html for service packages Key: NUTCH-2122 URL: https://issues.apache.org/jira/browse/NUTCH-2122 Project: Nutch

[jira] [Created] (NUTCH-2120) Remove MapWritable from trunk codebase

2015-09-24 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2120: --- Summary: Remove MapWritable from trunk codebase Key: NUTCH-2120 URL: https://issues.apache.org/jira/browse/NUTCH-2120 Project: Nutch Issue Type

[jira] [Resolved] (NUTCH-2117) NutchServer CLI Option for CMD_PORT is incorrect and should be CMD_HOST

2015-09-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2117. - Resolution: Fixed Committed @revision 1704972 in trunk. > NutchServer CLI Option

[jira] [Created] (NUTCH-2117) NutchServer CLI Option for CMD_PORT is incorrect and should be CMD_HOST

2015-09-23 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2117: --- Summary: NutchServer CLI Option for CMD_PORT is incorrect and should be CMD_HOST Key: NUTCH-2117 URL: https://issues.apache.org/jira/browse/NUTCH-2117 P

[jira] [Resolved] (NUTCH-2115) Add total counts to dump stats

2015-09-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2115. - Resolution: Fixed Assignee: Michael Joyce Nice patch Mike Committed revision

[jira] [Created] (NUTCH-2116) NutchServer and NutchApp should contain shutdown hooks

2015-09-23 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2116: --- Summary: NutchServer and NutchApp should contain shutdown hooks Key: NUTCH-2116 URL: https://issues.apache.org/jira/browse/NUTCH-2116 Project: Nutch

[jira] [Resolved] (NUTCH-2111) Delete temporary files location for selenium tmp files after driver quits

2015-09-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2111. - Resolution: Fixed Committed @revision 1704896 in trunk > Delete temporary files l

[jira] [Updated] (NUTCH-2111) Delete temporary files location for selenium tmp files after driver quits

2015-09-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2111: Assignee: Kim Whitehall (was: Lewis John McGibbney) > Delete temporary files locati

[jira] [Updated] (NUTCH-2111) Delete temporary files location for selenium tmp files after driver quits

2015-09-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2111: Summary: Delete temporary files location for selenium tmp files after driver quits

[jira] [Assigned] (NUTCH-2111) Delete temporary files location for selenium tmp files after driver quits

2015-09-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2111: --- Assignee: Lewis John McGibbney > Delete temporary files location for selenium

[jira] [Commented] (NUTCH-2111) Set temporary file location for selenium tmp files

2015-09-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14904070#comment-14904070 ] Lewis John McGibbney commented on NUTCH-2111: - Hi [~kwhitehall] bq. The patch

[jira] [Resolved] (NUTCH-2018) Ensure that the Docker containers for Nutch 2.X are part of the Release Management Documentation

2015-09-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2018. - Resolution: Fixed Added to release management HOWTO > Ensure that the Docker cont

[jira] [Resolved] (NUTCH-2105) Update Nutch Cassandra Dockerfile to work with Gora Nutch 2.3.1

2015-09-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2105. - Resolution: Fixed Committed @revision 1704754 in 2.X HEAD > Update Nutch Cassandr

[jira] [Updated] (NUTCH-2018) Ensure that the Docker containers for Nutch 2.X are part of the Release Management Documentation

2015-09-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2018: Issue Type: Improvement (was: Bug) > Ensure that the Docker containers for Nutch 2.

[jira] [Updated] (NUTCH-2105) Update Nutch Cassandra Dockerfile to work with Gora Nutch 2.3.1

2015-09-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2105: Attachment: NUTCH-2105.patch Patch for 2.X HEAD Would like to commit today and get a

[jira] [Commented] (NUTCH-2105) Update Nutch Cassandra Dockerfile to work with Gora Nutch 2.3.1

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14899915#comment-14899915 ] Lewis John McGibbney commented on NUTCH-2105: - I will work on this tomorrow th

[jira] [Resolved] (NUTCH-2028) java.lang.IllegalArgumentException: can't serialize class org.apache.avro.util.Utf8

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2028. - Resolution: Fixed This issue has been resolved in upgrade to Gora 0.6.1. > java.l

[jira] [Commented] (NUTCH-2018) Ensure that the Docker containers for Nutch 2.X are part of the Release Management Documentation

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14899916#comment-14899916 ] Lewis John McGibbney commented on NUTCH-2018: - I'll deal with this tomorrow fo

[jira] [Resolved] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X HBase Docker

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2050. - Resolution: Fixed Committed @revision 1704129 in 2.X HEAD > Upgrade HBase and Had

[jira] [Updated] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X HBase Docker

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2050: Summary: Upgrade HBase and Hadoop versioning on 2.X HBase Docker (was: Upgrade HBa

[jira] [Resolved] (NUTCH-1572) Nutch 2.x should use o.a.g.mem.store.MemStore for testing

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1572. - Resolution: Fixed Resolved with NUTCH-1946 > Nutch 2.x should use o.a.g.mem.store

[jira] [Resolved] (NUTCH-1286) Refactoring/reimplementing crawling API (NutchApp)

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1286. - Resolution: Won't Fix > Refactoring/reimplementing crawling API (NutchApp) > -

[jira] [Resolved] (NUTCH-2101) Upgrade Nutch 2.X to Hadoop 2.5.1

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2101. - Resolution: Fixed resolved in NUTCH-1946 > Upgrade Nutch 2.X to Hadoop 2.5.1 > --

[jira] [Resolved] (NUTCH-1946) Upgrade to Gora 0.6.1

2015-09-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1946. - Resolution: Fixed Committed @revision 1704128 in 2.X HEAD > Upgrade to Gora 0.6.1

[jira] [Commented] (NUTCH-2106) Runtime to contain Selenium and dependencies only once

2015-09-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14877251#comment-14877251 ] Lewis John McGibbney commented on NUTCH-2106: - +1 for commit [~wastl-nagel] >

[jira] [Commented] (NUTCH-2111) Set temporary file location for selenium tmp files

2015-09-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14877248#comment-14877248 ] Lewis John McGibbney commented on NUTCH-2111: - lol > Set temporary file locat

[jira] [Commented] (NUTCH-2094) Stopping and Restarting a crawl has issues in the Web UI

2015-09-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14876902#comment-14876902 ] Lewis John McGibbney commented on NUTCH-2094: - Grand thanks Chris for committi

[jira] [Commented] (NUTCH-2106) Runtime to contain Selenium and dependencies only once

2015-09-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14805366#comment-14805366 ] Lewis John McGibbney commented on NUTCH-2106: - [~kwhitehall] lets touch base o

[jira] [Commented] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X Docker

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14803379#comment-14803379 ] Lewis John McGibbney commented on NUTCH-2050: - ACK. We are on it and will have

[jira] [Commented] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X Docker

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14803311#comment-14803311 ] Lewis John McGibbney commented on NUTCH-2050: - Hi [~stack] I agree, GORA-443 w

[jira] [Updated] (NUTCH-1169) Write JUnit tests for urlfilter-prefix

2015-09-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1169: Fix Version/s: (was: 2.4) 2.3.1 > Write JUnit tests for urlfi

<    4   5   6   7   8   9   10   11   12   13   >