Jenkins build is back to normal : Nutch-trunk #3624

2019-05-06 Thread Apache Jenkins Server
See

[jira] [Commented] (NUTCH-2708) urlfilter-automaton: update library dependency (dk.brics.automaton)

2019-05-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16834003#comment-16834003 ] Hudson commented on NUTCH-2708: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3624 (Se

[jira] [Commented] (NUTCH-2716) protocol-http: Response headers are not stored for a compressed response

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833983#comment-16833983 ] Sebastian Nagel commented on NUTCH-2716: Thanks, [~yossi]! In case, you already w

[jira] [Resolved] (NUTCH-2708) urlfilter-automaton: update library dependency (dk.brics.automaton)

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2708. Resolution: Fixed Ok, updated the plugin.xml > urlfilter-automaton: update library depende

[jira] [Reopened] (NUTCH-2708) urlfilter-automaton: update library dependency (dk.brics.automaton)

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reopened NUTCH-2708: Ok, even for a small change in a plugin should have run the full set of unit tests. The feed p

[jira] [Commented] (NUTCH-2708) urlfilter-automaton: update library dependency (dk.brics.automaton)

2019-05-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833944#comment-16833944 ] Hudson commented on NUTCH-2708: --- FAILURE: Integrated in Jenkins build Nutch-trunk #3623 (Se

[jira] [Commented] (NUTCH-2626) bin/crawl: remove option -noParsing from fetch command

2019-05-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833948#comment-16833948 ] Hudson commented on NUTCH-2626: --- FAILURE: Integrated in Jenkins build Nutch-trunk #3623 (Se

[jira] [Commented] (NUTCH-2709) Remove unused properties and code related to HTTP protocol

2019-05-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833945#comment-16833945 ] Hudson commented on NUTCH-2709: --- FAILURE: Integrated in Jenkins build Nutch-trunk #3623 (Se

Build failed in Jenkins: Nutch-trunk #3623

2019-05-06 Thread Apache Jenkins Server
See Changes: [sebastian] NUTCH-2690 Configurable and fast URL filter - performs fast exact [sebastian] NUTCH-2708 urlfilter-automaton: update library dependency [snagel] NUTCH-2709 Remove unused properties and code

[jira] [Commented] (NUTCH-2585) NPE in TrieStringMatcher

2019-05-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833947#comment-16833947 ] Hudson commented on NUTCH-2585: --- FAILURE: Integrated in Jenkins build Nutch-trunk #3623 (Se

[jira] [Commented] (NUTCH-2690) Configurable and fast URL filter

2019-05-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833943#comment-16833943 ] Hudson commented on NUTCH-2690: --- FAILURE: Integrated in Jenkins build Nutch-trunk #3623 (Se

[jira] [Resolved] (NUTCH-2690) Configurable and fast URL filter

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2690. Resolution: Implemented > Configurable and fast URL filter > --

[jira] [Work started] (NUTCH-2690) Configurable and fast URL filter

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2690 started by Sebastian Nagel. -- > Configurable and fast URL filter > > >

[jira] [Assigned] (NUTCH-2690) Configurable and fast URL filter

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-2690: -- Assignee: Sebastian Nagel > Configurable and fast URL filter > ---

[jira] [Commented] (NUTCH-2690) Configurable and fast URL filter

2019-05-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833919#comment-16833919 ] ASF GitHub Bot commented on NUTCH-2690: --- sebastian-nagel commented on pull request

[jira] [Resolved] (NUTCH-2626) bin/crawl: remove option -noParsing from fetch command

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2626. Resolution: Fixed Committed in [290e3cb|https://github.com/apache/nutch/commit/290e3cb0071

[jira] [Assigned] (NUTCH-2626) bin/crawl: remove option -noParsing from fetch command

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-2626: -- Assignee: Sebastian Nagel > bin/crawl: remove option -noParsing from fetch command > -

[jira] [Commented] (NUTCH-2716) protocol-http: Response headers are not stored for a compressed response

2019-05-06 Thread Yossi Tamari (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833917#comment-16833917 ] Yossi Tamari commented on NUTCH-2716: - OK, I'll try to submit a patch tomorrow along

[jira] [Resolved] (NUTCH-2709) Remove unused properties and code related to HTTP protocol

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2709. Resolution: Implemented > Remove unused properties and code related to HTTP protocol >

[jira] [Commented] (NUTCH-2709) Remove unused properties and code related to HTTP protocol

2019-05-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833914#comment-16833914 ] ASF GitHub Bot commented on NUTCH-2709: --- sebastian-nagel commented on pull request

[jira] [Assigned] (NUTCH-2709) Remove unused properties and code related to HTTP protocol

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-2709: -- Assignee: Sebastian Nagel > Remove unused properties and code related to HTTP protocol

[jira] [Assigned] (NUTCH-2708) urlfilter-automaton: update library dependency (dk.brics.automaton)

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-2708: -- Assignee: Sebastian Nagel > urlfilter-automaton: update library dependency (dk.brics.a

[jira] [Resolved] (NUTCH-2708) urlfilter-automaton: update library dependency (dk.brics.automaton)

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2708. Resolution: Implemented > urlfilter-automaton: update library dependency (dk.brics.automato

[jira] [Commented] (NUTCH-2708) urlfilter-automaton: update library dependency (dk.brics.automaton)

2019-05-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833911#comment-16833911 ] ASF GitHub Bot commented on NUTCH-2708: --- sebastian-nagel commented on pull request

[jira] [Commented] (NUTCH-2716) protocol-http: Response headers are not stored for a compressed response

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833907#comment-16833907 ] Sebastian Nagel commented on NUTCH-2716: I think removing the headers is not idea

[jira] [Resolved] (NUTCH-2585) NPE in TrieStringMatcher

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2585. Resolution: Fixed Merged/committed. Thanks, [~markus17]! > NPE in TrieStringMatcher >

[jira] [Commented] (NUTCH-2585) NPE in TrieStringMatcher

2019-05-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833898#comment-16833898 ] ASF GitHub Bot commented on NUTCH-2585: --- sebastian-nagel commented on pull request

[jira] [Commented] (NUTCH-2716) protocol-http: Response headers are not stored for a compressed response

2019-05-06 Thread Yossi Tamari (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833888#comment-16833888 ] Yossi Tamari commented on NUTCH-2716: - Actually, I meant to remove those two headers

[jira] [Resolved] (NUTCH-2688) Unify the licence headers

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2688. Resolution: Fixed Resolving - it's committed since more than one month. Please open a new i

[jira] [Resolved] (NUTCH-2694) HostDB to aggregate by long instead of integer

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2694. Resolution: Fixed Resolving as it's already committed. Thanks, [~markus17]! > HostDB to ag

[jira] [Updated] (NUTCH-2716) Response headers are not stored for a compressed response

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-2716: --- Fix Version/s: 1.16 > Response headers are not stored for a compressed response > ---

[jira] [Commented] (NUTCH-2716) Response headers are not stored for a compressed response

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833874#comment-16833874 ] Sebastian Nagel commented on NUTCH-2716: Agreed: although this reverts NUTCH-2213

[jira] [Assigned] (NUTCH-2716) protocol-http: Response headers are not stored for a compressed response

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-2716: -- Assignee: Sebastian Nagel > protocol-http: Response headers are not stored for a compr

[jira] [Updated] (NUTCH-2716) protocol-http: Response headers are not stored for a compressed response

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-2716: --- Summary: protocol-http: Response headers are not stored for a compressed response (was: Resp

[jira] [Commented] (NUTCH-2715) WARCExporter fails on large records

2019-05-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833858#comment-16833858 ] Sebastian Nagel commented on NUTCH-2715: Hi [~yossi], thanks! Unfortunately both

[jira] [Created] (NUTCH-2716) Response headers are not stored for a compressed response

2019-05-06 Thread Yossi Tamari (JIRA)
Yossi Tamari created NUTCH-2716: --- Summary: Response headers are not stored for a compressed response Key: NUTCH-2716 URL: https://issues.apache.org/jira/browse/NUTCH-2716 Project: Nutch Issue T

[jira] [Created] (NUTCH-2715) WARCExporter fails on large records

2019-05-06 Thread Yossi Tamari (JIRA)
Yossi Tamari created NUTCH-2715: --- Summary: WARCExporter fails on large records Key: NUTCH-2715 URL: https://issues.apache.org/jira/browse/NUTCH-2715 Project: Nutch Issue Type: Bug Affects V

[jira] [Commented] (NUTCH-2585) NPE in TrieStringMatcher

2019-05-06 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16833675#comment-16833675 ] Markus Jelsma commented on NUTCH-2585: -- That seems fine enough! +1 > NPE in TrieStr