[jira] [Commented] (NUTCH-2517) mergesegs corrupts segment data

2018-03-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16386469#comment-16386469 ] Lewis John McGibbney commented on NUTCH-2517: - Thank you [~mebbinghaus] for reporting. This

[jira] [Updated] (NUTCH-2517) mergesegs corrupts segment data

2018-03-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2517: Priority: Blocker (was: Major) > mergesegs corrupts segment data >

[jira] [Commented] (NUTCH-2519) Log mapreduce job counters in local mode

2018-03-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16386339#comment-16386339 ] ASF GitHub Bot commented on NUTCH-2519: --- lewismc commented on issue #287: NUTCH-2519 Log mapreduce

[jira] [Commented] (NUTCH-2520) Wrong Accept-Charset sent when http.accept.charset is not defined

2018-03-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16386336#comment-16386336 ] ASF GitHub Bot commented on NUTCH-2520: --- lewismc commented on issue #288: NUTCH-2520 Use default

[jira] [Commented] (NUTCH-2521) SitemapProcessor to use property sitemap.redir.max

2018-03-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16386334#comment-16386334 ] ASF GitHub Bot commented on NUTCH-2521: --- lewismc commented on issue #289: NUTCH-2521

[jira] [Updated] (NUTCH-2523) UpdateHostDB blocks plugins unintenionally

2018-03-05 Thread Yossi Tamari (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yossi Tamari updated NUTCH-2523: Attachment: NUTCH-2523.tamari.180305.patch.txt > UpdateHostDB blocks plugins unintenionally >

[jira] [Created] (NUTCH-2523) UpdateHostDB blocks plugins unintenionally

2018-03-05 Thread Yossi Tamari (JIRA)
Yossi Tamari created NUTCH-2523: --- Summary: UpdateHostDB blocks plugins unintenionally Key: NUTCH-2523 URL: https://issues.apache.org/jira/browse/NUTCH-2523 Project: Nutch Issue Type: Bug

[jira] [Created] (NUTCH-2522) Bidirectional URL exemption filter

2018-03-05 Thread Semyon Semyonov (JIRA)
Semyon Semyonov created NUTCH-2522: -- Summary: Bidirectional URL exemption filter Key: NUTCH-2522 URL: https://issues.apache.org/jira/browse/NUTCH-2522 Project: Nutch Issue Type:

[no subject]

2018-03-05 Thread Muhammet Dinç

[jira] [Created] (NUTCH-2521) SitemapProcessor to use property sitemap.redir.max

2018-03-05 Thread Sebastian Nagel (JIRA)
Sebastian Nagel created NUTCH-2521: -- Summary: SitemapProcessor to use property sitemap.redir.max Key: NUTCH-2521 URL: https://issues.apache.org/jira/browse/NUTCH-2521 Project: Nutch Issue

[jira] [Commented] (NUTCH-2521) SitemapProcessor to use property sitemap.redir.max

2018-03-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16386021#comment-16386021 ] ASF GitHub Bot commented on NUTCH-2521: --- sebastian-nagel opened a new pull request #289: NUTCH-2521

[jira] [Commented] (NUTCH-2520) Wrong Accept-Charset sent when http.accept.charset is not defined

2018-03-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16386018#comment-16386018 ] ASF GitHub Bot commented on NUTCH-2520: --- sebastian-nagel opened a new pull request #288: NUTCH-2520

[jira] [Created] (NUTCH-2520) Wrong Accept-Charset sent when http.accept.charset is not defined

2018-03-05 Thread Sebastian Nagel (JIRA)
Sebastian Nagel created NUTCH-2520: -- Summary: Wrong Accept-Charset sent when http.accept.charset is not defined Key: NUTCH-2520 URL: https://issues.apache.org/jira/browse/NUTCH-2520 Project: Nutch

[jira] [Commented] (NUTCH-2519) Log mapreduce job counters in local mode

2018-03-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16386009#comment-16386009 ] ASF GitHub Bot commented on NUTCH-2519: --- sebastian-nagel opened a new pull request #287: NUTCH-2519

[jira] [Created] (NUTCH-2519) Log mapreduce job counters in local mode

2018-03-05 Thread Sebastian Nagel (JIRA)
Sebastian Nagel created NUTCH-2519: -- Summary: Log mapreduce job counters in local mode Key: NUTCH-2519 URL: https://issues.apache.org/jira/browse/NUTCH-2519 Project: Nutch Issue Type:

[jira] [Commented] (NUTCH-2518) Must check return value of job.waitForCompletion()

2018-03-05 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16385963#comment-16385963 ] Sebastian Nagel commented on NUTCH-2518: It seems to affect all 25 occurrences of {code:java} int

[jira] [Commented] (NUTCH-2518) Must check return value of job.waitForCompletion()

2018-03-05 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16385960#comment-16385960 ] Sebastian Nagel commented on NUTCH-2518: [~kamaci]: wasn't this part of your PR for NUTCH-2375

[jira] [Created] (NUTCH-2518) Must check return value of job.waitForCompletion()

2018-03-05 Thread Sebastian Nagel (JIRA)
Sebastian Nagel created NUTCH-2518: -- Summary: Must check return value of job.waitForCompletion() Key: NUTCH-2518 URL: https://issues.apache.org/jira/browse/NUTCH-2518 Project: Nutch Issue

[jira] [Updated] (NUTCH-2510) Crawl script modification. HostDb : generate, optional usage and descirption

2018-03-05 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-2510: --- Fix Version/s: (was: 1.14) 1.15 > Crawl script modification. HostDb :

[jira] [Commented] (NUTCH-2310) Protocol-Selenium does not support HTTPS protocol

2018-03-05 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16385853#comment-16385853 ] Sebastian Nagel commented on NUTCH-2310: The