[jira] [Commented] (NUTCH-2412) Exchange component for indexing job

2018-06-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519856#comment-16519856 ] ASF GitHub Bot commented on NUTCH-2412: --- r0ann3l commented on issue #340: Fixes for

[jira] [Commented] (NUTCH-2601) Elasticsearch Rest and Amazon CloudSearch have the same implementation class in indexer-writers.xml

2018-06-21 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519502#comment-16519502 ] Hudson commented on NUTCH-2601: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3537 (Se

[jira] [Commented] (NUTCH-2565) MergeDB incorrectly handles unfetched CrawlDatums

2018-06-21 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519500#comment-16519500 ] Hudson commented on NUTCH-2565: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3537 (Se

[jira] [Commented] (NUTCH-2597) NPE in updatehostdb

2018-06-21 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519501#comment-16519501 ] Hudson commented on NUTCH-2597: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3537 (Se

[jira] [Commented] (NUTCH-2600) Refactoring indexer-solr

2018-06-21 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519503#comment-16519503 ] Hudson commented on NUTCH-2600: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3537 (Se

[jira] [Created] (NUTCH-2609) urlnormalizer-basic to normalize path of file: URLs

2018-06-21 Thread Sebastian Nagel (JIRA)
Sebastian Nagel created NUTCH-2609: -- Summary: urlnormalizer-basic to normalize path of file: URLs Key: NUTCH-2609 URL: https://issues.apache.org/jira/browse/NUTCH-2609 Project: Nutch Issue T

[jira] [Resolved] (NUTCH-2601) Elasticsearch Rest and Amazon CloudSearch have the same implementation class in indexer-writers.xml

2018-06-21 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2601. Resolution: Fixed Merged. Thanks, [~roannel]. > Elasticsearch Rest and Amazon CloudSearch

[jira] [Commented] (NUTCH-2601) Elasticsearch Rest and Amazon CloudSearch have the same implementation class in indexer-writers.xml

2018-06-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519455#comment-16519455 ] ASF GitHub Bot commented on NUTCH-2601: --- sebastian-nagel closed pull request #350:

[jira] [Commented] (NUTCH-2600) Refactoring indexer-solr

2018-06-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519449#comment-16519449 ] ASF GitHub Bot commented on NUTCH-2600: --- sebastian-nagel commented on issue #351: f

[jira] [Resolved] (NUTCH-2600) Refactoring indexer-solr

2018-06-21 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2600. Resolution: Implemented > Refactoring indexer-solr > > >

[jira] [Commented] (NUTCH-2600) Refactoring indexer-solr

2018-06-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519448#comment-16519448 ] ASF GitHub Bot commented on NUTCH-2600: --- sebastian-nagel closed pull request #351:

[jira] [Commented] (NUTCH-2565) MergeDB incorrectly handles unfetched CrawlDatums

2018-06-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519443#comment-16519443 ] ASF GitHub Bot commented on NUTCH-2565: --- sebastian-nagel closed pull request #311:

[jira] [Updated] (NUTCH-2565) MergeDB incorrectly handles unfetched CrawlDatums

2018-06-21 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-2565: --- Fix Version/s: 1.15 > MergeDB incorrectly handles unfetched CrawlDatums > ---

[jira] [Resolved] (NUTCH-2565) MergeDB incorrectly handles unfetched CrawlDatums

2018-06-21 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2565. Resolution: Fixed Merged. Thanks, [~jurian]! > MergeDB incorrectly handles unfetched Crawl

[jira] [Updated] (NUTCH-2565) MergeDB incorrectly handles unfetched CrawlDatums

2018-06-21 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-2565: --- Component/s: crawldb > MergeDB incorrectly handles unfetched CrawlDatums > --

[jira] [Resolved] (NUTCH-2597) NPE in updatehostdb

2018-06-21 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2597. Resolution: Fixed Tested/verified solution and merged. Thanks, [~jurian]! > NPE in updateh

[jira] [Updated] (NUTCH-2597) NPE in updatehostdb

2018-06-21 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-2597: --- Fix Version/s: 1.15 > NPE in updatehostdb > --- > > Key: NUTC

[jira] [Commented] (NUTCH-2597) NPE in updatehostdb

2018-06-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519437#comment-16519437 ] ASF GitHub Bot commented on NUTCH-2597: --- sebastian-nagel closed pull request #349:

[jira] [Commented] (NUTCH-2597) NPE in updatehostdb

2018-06-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519438#comment-16519438 ] ASF GitHub Bot commented on NUTCH-2597: --- sebastian-nagel commented on issue #349: N

[jira] [Commented] (NUTCH-2412) Exchange component for indexing job

2018-06-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519417#comment-16519417 ] ASF GitHub Bot commented on NUTCH-2412: --- sebastian-nagel commented on issue #340: F

[jira] [Commented] (NUTCH-2576) HTTP protocol plugin based on okhttp

2018-06-21 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519281#comment-16519281 ] Sebastian Nagel commented on NUTCH-2576: Sharing some metrics from testing protoc

[jira] [Created] (NUTCH-2608) Reduce size of Nutch job file and package

2018-06-21 Thread Sebastian Nagel (JIRA)
Sebastian Nagel created NUTCH-2608: -- Summary: Reduce size of Nutch job file and package Key: NUTCH-2608 URL: https://issues.apache.org/jira/browse/NUTCH-2608 Project: Nutch Issue Type: Impro

Re: [ANNOUNCE] New Nutch committer and PMC - Omkar Reddy

2018-06-21 Thread Omkar Reddy
Thank you very much, Sebastian. Glad to be on board. Cheers, Omkar On 21 June 2018 at 13:48, Sebastian Nagel wrote: > Dear all, > > it is my pleasure to announce that Omkar Reddy has joined us > as a committer and member of the Nutch PMC. Omkar has worked > on upgrading Nutch to use the new Map

[jira] [Updated] (NUTCH-2607) ParserChecker should call ScoringFilters.passScoreAfterParsing() on all parses

2018-06-21 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-2607: --- Labels: patch-available (was: ) > ParserChecker should call ScoringFilters.passScoreAfterPar

[jira] [Updated] (NUTCH-2607) ParserChecker should call ScoringFilters.passScoreAfterParsing() on all parses

2018-06-21 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-2607: --- Component/s: parser > ParserChecker should call ScoringFilters.passScoreAfterParsing() on all

[jira] [Commented] (NUTCH-2607) ParserChecker should call ScoringFilters.passScoreAfterParsing() on all parses

2018-06-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519121#comment-16519121 ] ASF GitHub Bot commented on NUTCH-2607: --- sebastian-nagel opened a new pull request

[jira] [Assigned] (NUTCH-2607) ParserChecker should call ScoringFilters.passScoreAfterParsing() on all parses

2018-06-21 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-2607: -- Assignee: Sebastian Nagel > ParserChecker should call ScoringFilters.passScoreAfterPar

[jira] [Created] (NUTCH-2607) ParserChecker should call ScoringFilters.passScoreAfterParsing() on all parses

2018-06-21 Thread Sebastian Nagel (JIRA)
Sebastian Nagel created NUTCH-2607: -- Summary: ParserChecker should call ScoringFilters.passScoreAfterParsing() on all parses Key: NUTCH-2607 URL: https://issues.apache.org/jira/browse/NUTCH-2607 Proj

[ANNOUNCE] New Nutch committer and PMC - Omkar Reddy

2018-06-21 Thread Sebastian Nagel
Dear all, it is my pleasure to announce that Omkar Reddy has joined us as a committer and member of the Nutch PMC. Omkar has worked on upgrading Nutch to use the new MapReduce API as part of his Google Summer of Code project last year. Thanks, Omkar, and congratulations on your new role within th