[jira] [Updated] (NUTCH-2667) Update Tika and Commons Collections 4

2018-10-23 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2667: Description: Tika and Commons Collections 4 need to be updated. This issue needs

[jira] [Commented] (NUTCH-2667) Update Tika and Commons Collections 4

2018-10-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16661740#comment-16661740 ] ASF GitHub Bot commented on NUTCH-2667: --- lewismc opened a new pull request #403: NUTCH-2667 Update

[jira] [Created] (NUTCH-2667) Update Tika and Commons Collections 4

2018-10-23 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2667: --- Summary: Update Tika and Commons Collections 4 Key: NUTCH-2667 URL: https://issues.apache.org/jira/browse/NUTCH-2667 Project: Nutch Issue

[jira] [Commented] (NUTCH-2630) Fetcher to log skipped records by robots.txt

2018-10-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16661726#comment-16661726 ] ASF GitHub Bot commented on NUTCH-2630: --- lewismc commented on issue #387: NUTCH-2630 Fetcher to log

[jira] [Commented] (NUTCH-2655) Update Solr schema.xml for Solr 7.x

2018-10-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16661725#comment-16661725 ] ASF GitHub Bot commented on NUTCH-2655: --- lewismc commented on issue #395: NUTCH-2655 Update Solr

[jira] [Commented] (NUTCH-2658) Add README file to all plugins in src/plugin

2018-10-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16661724#comment-16661724 ] ASF GitHub Bot commented on NUTCH-2658: --- lewismc commented on issue #398: NUTCH-2658 Add README for

[jira] [Comment Edited] (NUTCH-2665) Upgrade to Apache Tika 1.19.1

2018-10-23 Thread Akshar Dave (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16661680#comment-16661680 ] Akshar Dave edited comment on NUTCH-2665 at 10/24/18 4:08 AM: -- were you able

[jira] [Commented] (NUTCH-2665) Upgrade to Apache Tika 1.19.1

2018-10-23 Thread Akshar Dave (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16661680#comment-16661680 ] Akshar Dave commented on NUTCH-2665: were you able to commit this change and successfully build? I am

Jenkins build is back to normal : Nutch-trunk #3578

2018-10-23 Thread Apache Jenkins Server
See

[jira] [Commented] (NUTCH-2659) Add missing Apache license headers

2018-10-23 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16661343#comment-16661343 ] Hudson commented on NUTCH-2659: --- FAILURE: Integrated in Jenkins build Nutch-trunk #3577 (See

Build failed in Jenkins: Nutch-trunk #3577

2018-10-23 Thread Apache Jenkins Server
See Changes: [snagel] NUTCH-2659 Add missing Apache license headers -- [...truncated 349.50 KB...] [ivy:resolve] .. (0kB) [ivy:resolve] [SUCCESSFUL ]

[jira] [Commented] (NUTCH-2655) Update Solr schema.xml for Solr 7.x

2018-10-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16661285#comment-16661285 ] ASF GitHub Bot commented on NUTCH-2655: --- jorgelbg commented on issue #395: NUTCH-2655 Update Solr

[jira] [Commented] (NUTCH-2661) Move TestOutlinks to the proper path

2018-10-23 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16661261#comment-16661261 ] Hudson commented on NUTCH-2661: --- FAILURE: Integrated in Jenkins build Nutch-trunk #3576 (See

Build failed in Jenkins: Nutch-trunk #3576

2018-10-23 Thread Apache Jenkins Server
See Changes: [jorge-luis.betancourt] NUTCH-2661 Move the TestOutlinks class into the o.a.n.parse path -- [...truncated 349.12 KB...] [ivy:resolve] .. (0kB) [ivy:resolve]

[jira] [Commented] (NUTCH-2659) Add missing Apache license headers

2018-10-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16661253#comment-16661253 ] ASF GitHub Bot commented on NUTCH-2659: --- jorgelbg commented on issue #396: NUTCH-2659 Add missing

[jira] [Commented] (NUTCH-2659) Add missing Apache license headers

2018-10-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16661254#comment-16661254 ] ASF GitHub Bot commented on NUTCH-2659: --- jorgelbg closed pull request #396: NUTCH-2659 Add missing

[jira] [Commented] (NUTCH-2661) Move TestOutlinks to the proper path

2018-10-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16661199#comment-16661199 ] ASF GitHub Bot commented on NUTCH-2661: --- jorgelbg closed pull request #399: NUTCH-2661 Move the

[jira] [Updated] (NUTCH-2666) increase default value for http.content.limit

2018-10-23 Thread Marco Ebbinghaus (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Ebbinghaus updated NUTCH-2666: Description: The default value for http.content.limit in nutch-default.xml (The length

[jira] [Updated] (NUTCH-2666) increase default value for http.content.limit

2018-10-23 Thread Marco Ebbinghaus (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Ebbinghaus updated NUTCH-2666: Description: The default value for http.content.limit in nutch-default.xml (The length

[jira] [Created] (NUTCH-2666) increase default value for http.content.limit

2018-10-23 Thread Marco Ebbinghaus (JIRA)
Marco Ebbinghaus created NUTCH-2666: --- Summary: increase default value for http.content.limit Key: NUTCH-2666 URL: https://issues.apache.org/jira/browse/NUTCH-2666 Project: Nutch Issue

[jira] [Commented] (NUTCH-2665) Upgrade to Apache Tika 1.19.1

2018-10-23 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16660625#comment-16660625 ] Markus Jelsma commented on NUTCH-2665: -- I'll commit this one later today, if i don't forget, unless

[jira] [Updated] (NUTCH-2665) Upgrade to Apache Tika 1.19.1

2018-10-23 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-2665: - Attachment: NUTCH-2665.patch > Upgrade to Apache Tika 1.19.1 > - > >

[jira] [Commented] (NUTCH-2665) Upgrade to Apache Tika 1.19.1

2018-10-23 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16660525#comment-16660525 ] Markus Jelsma commented on NUTCH-2665: -- Updated patch defining the property in ivysettings.xml. >

[jira] [Commented] (NUTCH-2665) Upgrade to Apache Tika 1.19.1

2018-10-23 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16660518#comment-16660518 ] Sebastian Nagel commented on NUTCH-2665: +1 Thanks, [~markus17]! For 1.x I needed several trials

[jira] [Commented] (NUTCH-2665) Upgrade to Apache Tika 1.19.1

2018-10-23 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16660455#comment-16660455 ] Markus Jelsma commented on NUTCH-2665: -- Patch for 2.x! > Upgrade to Apache Tika 1.19.1 >

[jira] [Updated] (NUTCH-2665) Upgrade to Apache Tika 1.19.1

2018-10-23 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-2665: - Attachment: NUTCH-2665.patch > Upgrade to Apache Tika 1.19.1 > - > >

[jira] [Created] (NUTCH-2665) Upgrade to Apache Tika 1.19.1

2018-10-23 Thread Markus Jelsma (JIRA)
Markus Jelsma created NUTCH-2665: Summary: Upgrade to Apache Tika 1.19.1 Key: NUTCH-2665 URL: https://issues.apache.org/jira/browse/NUTCH-2665 Project: Nutch Issue Type: Task