[jira] [Commented] (NUTCH-2517) mergesegs corrupts segment data

2018-03-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16391430#comment-16391430 ] Lewis John McGibbney commented on NUTCH-2517: - Hi [~mebbinghaus] I ran it from

[jira] [Commented] (NUTCH-2517) mergesegs corrupts segment data

2018-03-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16388718#comment-16388718 ] Lewis John McGibbney commented on NUTCH-2517: - Should be noted that I didn't r

[jira] [Assigned] (NUTCH-2517) mergesegs corrupts segment data

2018-03-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2517: --- Assignee: Lewis John McGibbney > mergesegs corrupts segment data > --

[jira] [Comment Edited] (NUTCH-2517) mergesegs corrupts segment data

2018-03-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16388650#comment-16388650 ] Lewis John McGibbney edited comment on NUTCH-2517 at 3/6/18 11:09 PM: --

[jira] [Comment Edited] (NUTCH-2517) mergesegs corrupts segment data

2018-03-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16388650#comment-16388650 ] Lewis John McGibbney edited comment on NUTCH-2517 at 3/6/18 10:50 PM: --

[jira] [Comment Edited] (NUTCH-2517) mergesegs corrupts segment data

2018-03-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16388650#comment-16388650 ] Lewis John McGibbney edited comment on NUTCH-2517 at 3/6/18 10:49 PM: --

[jira] [Commented] (NUTCH-2517) mergesegs corrupts segment data

2018-03-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16388650#comment-16388650 ] Lewis John McGibbney commented on NUTCH-2517: - I cannot reproduce this... see

[jira] [Commented] (NUTCH-2517) mergesegs corrupts segment data

2018-03-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16386469#comment-16386469 ] Lewis John McGibbney commented on NUTCH-2517: - Thank you [~mebbinghaus] for re

[jira] [Updated] (NUTCH-2517) mergesegs corrupts segment data

2018-03-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2517: Priority: Blocker (was: Major) > mergesegs corrupts segment data >

[jira] [Updated] (NUTCH-2517) mergesegs corrupts segment data

2018-03-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2517: Fix Version/s: 1.15 > mergesegs corrupts segment data >

[jira] [Updated] (NUTCH-2516) Hadoop imports use wildcards

2018-02-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2516: Description: Right now the Hadoop imports use wildcards all over the place. We want

[jira] [Created] (NUTCH-2516) Hadoop imports use wildcards

2018-02-27 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2516: --- Summary: Hadoop imports use wildcards Key: NUTCH-2516 URL: https://issues.apache.org/jira/browse/NUTCH-2516 Project: Nutch Issue Type: Improvem

[jira] [Resolved] (NUTCH-2375) Upgrade the code base from org.apache.hadoop.mapred to org.apache.hadoop.mapreduce

2018-02-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2375. - Resolution: Fixed > Upgrade the code base from org.apache.hadoop.mapred to > org.

[jira] [Assigned] (NUTCH-2375) Upgrade the code base from org.apache.hadoop.mapred to org.apache.hadoop.mapreduce

2018-02-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2375: --- Assignee: Lewis John McGibbney > Upgrade the code base from org.apache.hadoop

[jira] [Commented] (NUTCH-2512) Nutch 1.14 does not work under JDK9

2018-02-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16373173#comment-16373173 ] Lewis John McGibbney commented on NUTCH-2512: - Hi [~Bl4ck1c3] thanks for loggi

[jira] [Updated] (NUTCH-2512) Nutch 1.14 does not work under JDK9

2018-02-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2512: Fix Version/s: 1.15 > Nutch 1.14 does not work under JDK9 >

[jira] [Resolved] (NUTCH-2489) Dependency collision with lucene-analyzers-common in scoring-similarity plugin

2018-02-07 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2489. - Resolution: Fixed Thank you [~yossi] > Dependency collision with lucene-analyzers

[jira] [Updated] (NUTCH-2489) Dependency collision with lucene-analyzers-common in scoring-similarity plugin

2018-02-07 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2489: Fix Version/s: 1.15 > Dependency collision with lucene-analyzers-common in scoring-s

[jira] [Resolved] (NUTCH-2508) Misleading documentation about http.proxy.exception.list

2018-01-31 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2508. - Resolution: Fixed Thank you [~mfeltscher] > Misleading documentation about http.p

[jira] [Updated] (NUTCH-2508) Misleading documentation about http.proxy.exception.list

2018-01-31 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2508: Fix Version/s: 1.15 > Misleading documentation about http.proxy.exception.list > ---

[jira] [Commented] (NUTCH-2369) Create a new GraphGenerator Tool for writing Nutch Records as a Full Web Graph

2018-01-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16341326#comment-16341326 ] Lewis John McGibbney commented on NUTCH-2369: - Hi [~markus17] the idea here wa

[jira] [Updated] (NUTCH-2369) Create a new GraphGenerator Tool for writing Nutch Records as a Full Web Graph

2018-01-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2369: Labels: gsoc2017 gsoc2018 (was: gsoc2017) > Create a new GraphGenerator Tool for wr

[jira] [Resolved] (NUTCH-2502) Any23 Plugin: Add Content-Type filtering

2018-01-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2502. - Resolution: Fixed Thank you [~mfeltscher] > Any23 Plugin: Add Content-Type filter

[jira] [Updated] (NUTCH-2502) Any23 Plugin: Add Content-Type filtering

2018-01-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2502: Fix Version/s: 1.15 > Any23 Plugin: Add Content-Type filtering > ---

[jira] [Updated] (NUTCH-2499) Elastic REST Indexer: Duplicate values

2018-01-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2499: Fix Version/s: 1.15 > Elastic REST Indexer: Duplicate values > -

[jira] [Resolved] (NUTCH-2499) Elastic REST Indexer: Duplicate values

2018-01-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2499. - Resolution: Fixed Thank you [~mfeltscher]   > Elastic REST Indexer: Duplicate va

[jira] [Resolved] (NUTCH-2503) Add option to run tests for a single plugin

2018-01-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2503. - Resolution: Fixed Thank you [~mfeltscher] > Add option to run tests for a single

[jira] [Updated] (NUTCH-2503) Add option to run tests for a single plugin

2018-01-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2503: Fix Version/s: 1.15 > Add option to run tests for a single plugin >

[jira] [Resolved] (NUTCH-2441) ARG_SEGMENT usage

2018-01-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2441. - Resolution: Fixed Thank you [~semyon.semyo...@mail.com] > ARG_SEGMENT usage > ---

[jira] [Resolved] (NUTCH-2497) Elastic REST Indexer: Allow multiple hosts

2018-01-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2497. - Resolution: Fixed Thank you [~mfeltscher] > Elastic REST Indexer: Allow multiple

[jira] [Updated] (NUTCH-2497) Elastic REST Indexer: Allow multiple hosts

2018-01-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2497: Fix Version/s: 1.15 > Elastic REST Indexer: Allow multiple hosts > -

[jira] [Resolved] (NUTCH-2461) Generate passes the data to when maxCount == 0

2018-01-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2461. - Resolution: Fixed Thank you [~semyon.semyo...@mail.com] > Generate passes the dat

[jira] [Commented] (NUTCH-2321) Indexing filter checker leaks threads

2018-01-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16326453#comment-16326453 ] Lewis John McGibbney commented on NUTCH-2321: - Thank you [~jurian] > Indexing

[jira] [Resolved] (NUTCH-2321) Indexing filter checker leaks threads

2018-01-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2321. - Resolution: Fixed > Indexing filter checker leaks threads > --

[jira] [Updated] (NUTCH-1129) Any23 Nutch plugin

2018-01-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1129: Fix Version/s: (was: 2.5) 1.15 > Any23 Nutch plugin > ---

[jira] [Resolved] (NUTCH-1129) Any23 Nutch plugin

2018-01-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1129. - Resolution: Fixed Thank you [~mfeltscher] this is great > Any23 Nutch plugin > --

[jira] [Resolved] (NUTCH-2493) Add configuration parameter for sitemap processing to crawler script

2018-01-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2493. - Resolution: Fixed Thank you [~mfeltscher] > Add configuration parameter for sitem

[jira] [Updated] (NUTCH-2493) Add configuration parameter for sitemap processing to crawler script

2018-01-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2493: Fix Version/s: 1.15 > Add configuration parameter for sitemap processing to crawler

[jira] [Resolved] (NUTCH-2324) Issue in setting default linkdb path

2018-01-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2324. - Resolution: Fixed Thank you [~sachin] > Issue in setting default linkdb path > -

[jira] [Updated] (NUTCH-2324) Issue in setting default linkdb path

2018-01-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2324: Fix Version/s: 1.15 > Issue in setting default linkdb path > --

[jira] [Resolved] (NUTCH-2492) Add more configuration parameters to crawl script

2018-01-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2492. - Resolution: Fixed Thank you [~mfeltscher] > Add more configuration parameters to

[jira] [Updated] (NUTCH-2492) Add more configuration parameters to crawl script

2018-01-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2492: Fix Version/s: 1.15 > Add more configuration parameters to crawl script > -

[jira] [Updated] (NUTCH-2490) Sitemap processing: Sitemap index files not working

2018-01-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2490: Fix Version/s: 1.15 > Sitemap processing: Sitemap index files not working >

[jira] [Resolved] (NUTCH-2490) Sitemap processing: Sitemap index files not working

2018-01-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2490. - Resolution: Fixed Thank you [~mfeltscher] > Sitemap processing: Sitemap index fil

[jira] [Resolved] (NUTCH-2491) Integrate sitemap processing and HostDB into crawl script

2018-01-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2491. - Resolution: Fixed Thank you [~mfeltscher] > Integrate sitemap processing and Host

[jira] [Updated] (NUTCH-2491) Integrate sitemap processing and HostDB into crawl script

2018-01-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2491: Fix Version/s: 1.15 > Integrate sitemap processing and HostDB into crawl script > --

[jira] [Resolved] (NUTCH-2454) REST API fix for usage of hostdb in generator

2018-01-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2454. - Resolution: Fixed Thank you [~semyon.semyo...@mail.com] > REST API fix for usage

[jira] [Resolved] (NUTCH-2486) Compiler Warning: Unchecked / unsafe operations in MimeTypeIndexingFilter

2017-12-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2486. - Resolution: Fixed > Compiler Warning: Unchecked / unsafe operations in MimeTypeInd

[jira] [Updated] (NUTCH-2486) Compiler Warning: Unchecked / unsafe operations in MimeTypeIndexingFilter

2017-12-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2486: Fix Version/s: 1.14 > Compiler Warning: Unchecked / unsafe operations in MimeTypeInd

[jira] [Resolved] (NUTCH-2358) HostInjectorJob doesn't work

2017-12-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2358. - Resolution: Fixed > HostInjectorJob doesn't work > >

[jira] [Commented] (NUTCH-2358) HostInjectorJob doesn't work

2017-12-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16294236#comment-16294236 ] Lewis John McGibbney commented on NUTCH-2358: - Thank you [~cloudysunny14] patc

[jira] [Updated] (NUTCH-2358) HostInjectorJob doesn't work

2017-12-17 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2358: Fix Version/s: 2.4 > HostInjectorJob doesn't work > > >

[jira] [Resolved] (NUTCH-2484) Extend indexer-elastic-rest to support languages

2017-12-16 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2484. - Resolution: Fixed > Extend indexer-elastic-rest to support languages > ---

[jira] [Created] (NUTCH-2484) Extend indexer-elastic-rest to support languages

2017-12-16 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2484: --- Summary: Extend indexer-elastic-rest to support languages Key: NUTCH-2484 URL: https://issues.apache.org/jira/browse/NUTCH-2484 Project: Nutch

[jira] [Commented] (NUTCH-2157) Parent Issue for Addressing Miredot REST API Warnings

2017-12-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292830#comment-16292830 ] Lewis John McGibbney commented on NUTCH-2157: - There are still many warnings.

[jira] [Updated] (NUTCH-2157) Parent Issue for Addressing Miredot REST API Warnings

2017-12-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2157: Fix Version/s: (was: 1.14) 1.15 > Parent Issue for Addressing

[jira] [Resolved] (NUTCH-2181) Add Webpage for 3rd Party Connectors/Libraries to Apache Nutch

2017-12-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2181. - Resolution: Won't Fix Fix Version/s: 1.14 These are never kept up-to-date

[jira] [Updated] (NUTCH-2185) protocol-soda-consumer plugin

2017-12-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2185: Fix Version/s: (was: 1.15) 1.14 > protocol-soda-consumer plug

[jira] [Resolved] (NUTCH-2185) protocol-soda-consumer plugin

2017-12-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2185. - Resolution: Won't Fix This was a very limited use case and is not worth integratio

[jira] [Resolved] (NUTCH-2473) Elasticsearch REST Indexer broken due to wrong depenency

2017-12-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2473. - Resolution: Fixed > Elasticsearch REST Indexer broken due to wrong depenency > ---

[jira] [Updated] (NUTCH-2473) Elasticsearch REST Indexer broken due to wrong depenency

2017-12-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2473: Fix Version/s: 1.14 > Elasticsearch REST Indexer broken due to wrong depenency > ---

[jira] [Assigned] (NUTCH-2414) Allow LanguageIndexingFilter to actually filter documents by language.

2017-12-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2414: --- Assignee: Lewis John McGibbney > Allow LanguageIndexingFilter to actually fil

[jira] [Updated] (NUTCH-2414) Allow LanguageIndexingFilter to actually filter documents by language.

2017-12-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2414: Fix Version/s: 1.14 > Allow LanguageIndexingFilter to actually filter documents by l

[jira] [Resolved] (NUTCH-2414) Allow LanguageIndexingFilter to actually filter documents by language.

2017-12-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2414. - Resolution: Fixed > Allow LanguageIndexingFilter to actually filter documents by l

[jira] [Resolved] (NUTCH-2438) Upgrade Nutch 2.X to Gora 0.8

2017-12-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2438. - Resolution: Fixed > Upgrade Nutch 2.X to Gora 0.8 > -

[jira] [Assigned] (NUTCH-2438) Upgrade Nutch 2.X to Gora 0.8

2017-12-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2438: --- Assignee: Lewis John McGibbney > Upgrade Nutch 2.X to Gora 0.8 >

[jira] [Resolved] (NUTCH-2437) gora mongodb mapping file error

2017-10-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2437. - Resolution: Fixed > gora mongodb mapping file error >

[jira] [Updated] (NUTCH-2374) Upgrade Nutch 2.X to Gora 0.7

2017-10-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2374: Issue Type: Improvement (was: Bug) > Upgrade Nutch 2.X to Gora 0.7 > --

[jira] [Resolved] (NUTCH-2374) Upgrade Nutch 2.X to Gora 0.7

2017-10-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2374. - Resolution: Fixed > Upgrade Nutch 2.X to Gora 0.7 > -

[jira] [Resolved] (NUTCH-2436) Remove empty comment, and redundant semicolon from CommandRunner

2017-09-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2436. - Resolution: Fixed Thank you [~kpm1985] > Remove empty comment, and redundant semi

[jira] [Updated] (NUTCH-2436) Remove empty comment, and redundant semicolon from CommandRunner

2017-09-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2436: Fix Version/s: 1.14 > Remove empty comment, and redundant semicolon from CommandRunn

[jira] [Resolved] (NUTCH-2235) Classpath discrepancy with protocol-selenium in deploy mode

2017-09-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2235. - Resolution: Fixed NUTCH-2378 > Classpath discrepancy with protocol-selenium in de

[jira] [Resolved] (NUTCH-2399) indexer-elastic does not index multi-value fields (only the first value is indexed)

2017-08-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2399. - Resolution: Fixed > indexer-elastic does not index multi-value fields (only the fi

[jira] [Updated] (NUTCH-2399) indexer-elastic does not index multi-value fields (only the first value is indexed)

2017-08-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2399: Fix Version/s: 1.14 > indexer-elastic does not index multi-value fields (only the fi

[jira] [Resolved] (NUTCH-2400) Solr 6.6.0 compatibility

2017-08-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2400. - Resolution: Fixed > Solr 6.6.0 compatibility > > >

[jira] [Resolved] (NUTCH-2405) jsoup-extractor structure correction, typo fixed

2017-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2405. - Resolution: Fixed Thank you [~kaidul] > jsoup-extractor structure correction, typ

[jira] [Updated] (NUTCH-2406) Sum up constants, make minor changes

2017-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2406: Fix Version/s: 1.14 > Sum up constants, make minor changes > ---

[jira] [Assigned] (NUTCH-2406) Sum up constants, make minor changes

2017-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2406: --- Assignee: kenneth mcfarland > Sum up constants, make minor changes >

[jira] [Resolved] (NUTCH-2406) Sum up constants, make minor changes

2017-08-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2406. - Resolution: Fixed > Sum up constants, make minor changes > ---

[jira] [Resolved] (NUTCH-2404) Failed Jenkin Build #1588 error in unit test resolved

2017-07-31 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2404. - Resolution: Fixed Thank you [~kaidul] > Failed Jenkin Build #1588 error in unit t

[jira] [Resolved] (NUTCH-2389) Precise data parsing using Jsoup CSS selectors

2017-07-30 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2389. - Resolution: Fixed Thank you [~kaidul] > Precise data parsing using Jsoup CSS sele

[jira] [Commented] (NUTCH-1129) Any23 Nutch plugin

2017-07-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16102898#comment-16102898 ] Lewis John McGibbney commented on NUTCH-1129: - We need some sort of reasonable

[jira] [Assigned] (NUTCH-2403) Nutch Selenium: Wrong documentation about PhantomJS

2017-07-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2403: --- Assignee: Moreno Feltscher > Nutch Selenium: Wrong documentation about Phanto

[jira] [Updated] (NUTCH-2403) Nutch Selenium: Wrong documentation about PhantomJS

2017-07-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2403: Affects Version/s: 1.13 > Nutch Selenium: Wrong documentation about PhantomJS >

[jira] [Updated] (NUTCH-2403) Nutch Selenium: Wrong documentation about PhantomJS

2017-07-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2403: Fix Version/s: 1.14 > Nutch Selenium: Wrong documentation about PhantomJS >

[jira] [Resolved] (NUTCH-2403) Nutch Selenium: Wrong documentation about PhantomJS

2017-07-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2403. - Resolution: Fixed > Nutch Selenium: Wrong documentation about PhantomJS >

[jira] [Updated] (NUTCH-2403) Nutch Selenium: Wrong documentation about PhantomJS

2017-07-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2403: Component/s: plugin documentation > Nutch Selenium: Wrong documenta

[jira] [Updated] (NUTCH-2400) Solr 6.6.0 compatibility

2017-07-12 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2400: Attachment: managed-schema This is the managed-schema generated from schema.xml, ple

[jira] [Created] (NUTCH-2400) Solr 6.6.0 compatibility

2017-07-12 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2400: --- Summary: Solr 6.6.0 compatibility Key: NUTCH-2400 URL: https://issues.apache.org/jira/browse/NUTCH-2400 Project: Nutch Issue Type: Improvement

[jira] [Commented] (NUTCH-1465) Support sitemaps in Nutch

2017-07-04 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16074046#comment-16074046 ] Lewis John McGibbney commented on NUTCH-1465: - [~markus17] can we also update

[jira] [Commented] (NUTCH-1465) Support sitemaps in Nutch

2017-07-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16072959#comment-16072959 ] Lewis John McGibbney commented on NUTCH-1465: - [~markus17] when attempting to

[jira] [Commented] (NUTCH-1465) Support sitemaps in Nutch

2017-07-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16072954#comment-16072954 ] Lewis John McGibbney commented on NUTCH-1465: - Hi [~markus17] I went ahead and

[jira] [Commented] (NUTCH-1465) Support sitemaps in Nutch

2017-06-30 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16070468#comment-16070468 ] Lewis John McGibbney commented on NUTCH-1465: - Fantastic [~markus17] is this w

[jira] [Commented] (NUTCH-2184) Enable IndexingJob to function with no crawldb

2017-06-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16068597#comment-16068597 ] Lewis John McGibbney commented on NUTCH-2184: - Hi [~markus17] I need to finish

[jira] [Commented] (NUTCH-2389) Precise data parsing using Jsoup CSS selectors

2017-06-01 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16034162#comment-16034162 ] Lewis John McGibbney commented on NUTCH-2389: - [~kaidul], i think that the plu

[jira] [Resolved] (NUTCH-2388) bin/crawl indexing only webpages containing batchID instead of all in 2.x

2017-05-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2388. - Resolution: Fixed > bin/crawl indexing only webpages containing batchID instead of

[jira] [Commented] (NUTCH-2382) indexer-hbase Nutch 1.x branch

2017-05-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16020207#comment-16020207 ] Lewis John McGibbney commented on NUTCH-2382: - I am +1 for committing this to

[jira] [Resolved] (NUTCH-2373) Indexer for Hbase

2017-05-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2373. - Resolution: Fixed > Indexer for Hbase > - > > Key:

[jira] [Resolved] (NUTCH-2353) Create seed file with metadata using the REST API

2017-05-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2353. - Resolution: Fixed Nice work [~jorgelbg] > Create seed file with metadata using th

[jira] [Commented] (NUTCH-1465) Support sitemaps in Nutch

2017-04-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15979078#comment-15979078 ] Lewis John McGibbney commented on NUTCH-1465: - I'm going to take this on. We w

<    1   2   3   4   5   6   7   8   9   10   >