Messages by Thread
-
-
[jira] [Resolved] (NUTCH-2998) Remove the Any23 plugin
Tim Allison (Jira)
-
[jira] [Resolved] (NUTCH-3000) protocol-selenium returns only the body,strips off the <head/> element
Tim Allison (Jira)
-
[jira] [Resolved] (NUTCH-3001) protocol-selenium requires Content-Type header
Tim Allison (Jira)
-
[GitHub] [nutch] tballison opened a new pull request, #775: Remove Any23 from Nutch
via GitHub
-
[DISCUSS] Removing Any23 from Nutch?
Tim Allison
-
[GitHub] [nutch] tballison opened a new pull request, #774: NUTCH-3001 - fix logic for grabbing bytes if there's no content type …
via GitHub
-
[GitHub] [nutch] tballison opened a new pull request, #773: NUTCH-3000 - the selenium protocol should return the full html, not just the inner body
via GitHub
-
[jira] [Commented] (NUTCH-3001) protocol-selenium requires Content-Type header
Tim Allison (Jira)
-
[jira] [Updated] (NUTCH-3001) protocol-selenium requires Content-Type header
Tim Allison (Jira)
-
[jira] [Created] (NUTCH-3001) protocol-selenium requires Content-Type header
Tim Allison (Jira)
-
[jira] [Commented] (NUTCH-3000) protocol-selenium returns only the body,strips off the <head/> element
Sebastian Nagel (Jira)
-
[jira] [Created] (NUTCH-3000) protocol-selenium returns only the body,strips off the <head/> element
Tim Allison (Jira)
-
[jira] [Comment Edited] (NUTCH-2998) Remove the Any23 plugin
Sebastian Nagel (Jira)
-
[jira] [Commented] (NUTCH-2998) Remove the Any23 plugin
Tim Allison (Jira)
-
[GitHub] [nutch] tballison opened a new pull request, #772: NUTCH-2978 -- upgrade to log4j2 throughout
via GitHub
-
Build failed in Jenkins: Nutch » Nutch-trunk #108
Apache Jenkins Server
-
[GitHub] [nutch] tballison opened a new pull request, #771: NUTCH-2999 fix for initial PR
via GitHub
-
[jira] [Reopened] (NUTCH-2999) Update Lucene version to latest 8.x
Tim Allison (Jira)
-
[jira] [Resolved] (NUTCH-2961) Upgrade dependencies of parsefilter-naivebayes
Tim Allison (Jira)
-
[jira] [Resolved] (NUTCH-2999) Update Lucene version to latest 8.x
Tim Allison (Jira)
-
[GitHub] [nutch] tballison opened a new pull request, #770: NUTCH-2999 Upgrade Lucene to latest 8.x version throughout
via GitHub
-
[jira] [Commented] (NUTCH-2999) Update Lucene version to latest 8.x
Tim Allison (Jira)
-
[jira] [Created] (NUTCH-2999) Update Lucene version to latest 8.x
Tim Allison (Jira)
-
[jira] [Commented] (NUTCH-2961) Upgrade dependencies of parsefilter-naivebayes
Tim Allison (Jira)
-
[GitHub] [nutch] tballison opened a new pull request, #769: NUTCH-2978 -- move to log4j2 logging throughout
via GitHub
-
[jira] [Created] (NUTCH-2998) Remove the Any23 plugin
Tim Allison (Jira)
-
[jira] [Resolved] (NUTCH-2989) Can't have username/pw AND https on elastic-indexer?!
Tim Allison (Jira)
-
[jira] [Commented] (NUTCH-2989) Can't have username/pw AND https on elastic-indexer?!
ASF GitHub Bot (Jira)
-
[GitHub] [nutch] tballison opened a new pull request, #768: NUTCH-2989 -- enable auth in ElasticIndexWriter for https
via GitHub
-
[jira] [Assigned] (NUTCH-2989) Can't have username/pw AND https on elastic-indexer?!
Tim Allison (Jira)
-
Build failed in Jenkins: Nutch » Nutch-trunk #103
Apache Jenkins Server
-
[jira] [Resolved] (NUTCH-2997) Add Override annotations where applicable
Sebastian Nagel (Jira)
-
[jira] [Assigned] (NUTCH-2997) Add Override annotations where applicable
Sebastian Nagel (Jira)
-
[jira] [Resolved] (NUTCH-2996) Use new SimpleRobotRulesParser API entry point (crawler-commons 1.4)
Sebastian Nagel (Jira)
-
[jira] [Resolved] (NUTCH-2995) Upgrade to crawler-commons 1.4
Sebastian Nagel (Jira)
-
[jira] [Resolved] (NUTCH-2993) ScoringDepth plugin to skip depth check based on URL Pattern
Sebastian Nagel (Jira)
-
[GitHub] [nutch] sebastian-nagel merged pull request #764: NUTCH-2993 ScoringDepth plugin to skip depth check based on URL Pattern
via GitHub
-
Mailing list threading improvements
Christofer Dutz
-
[jira] [Commented] (NUTCH-2997) Add Override annotations where applicable
ASF GitHub Bot (Jira)
-
[GitHub] [nutch] sebastian-nagel opened a new pull request, #767: NUTCH-2997 Add Override annotations
via GitHub
-
[jira] [Created] (NUTCH-2997) Add Override annotations where applicable
Sebastian Nagel (Jira)
-
[jira] [Commented] (NUTCH-2996) Use new SimpleRobotRulesParser API entry point (crawler-commons 1.4)
ASF GitHub Bot (Jira)
-
[GitHub] [nutch] sebastian-nagel opened a new pull request, #766: NUTCH-2996 Use new SimpleRobotRulesParser API entry point crawler-commons 1.4
via GitHub
-
[jira] [Commented] (NUTCH-2995) Upgrade to crawler-commons 1.4
ASF GitHub Bot (Jira)
-
[GitHub] [nutch] sebastian-nagel opened a new pull request, #765: NUTCH-2995 Upgrade to crawler-commons 1.4
via GitHub
-
[jira] [Created] (NUTCH-2996) Use new SimpleRobotRulesParser API entry point (crawler-commons 1.4)
Sebastian Nagel (Jira)
-
[jira] [Assigned] (NUTCH-2996) Use new SimpleRobotRulesParser API entry point (crawler-commons 1.4)
Sebastian Nagel (Jira)
-
[jira] [Assigned] (NUTCH-2995) Upgrade to crawler-commons 1.4
Sebastian Nagel (Jira)
-
[jira] [Created] (NUTCH-2995) Upgrade to crawler-commons 1.4
Sebastian Nagel (Jira)
-
[ANNOUNCE] New Nutch committer and PMC - Tim Allison
Sebastian Nagel
-
[jira] [Updated] (NUTCH-2989) Can't have username/pw AND https on elastic-indexer?!
Sebastian Nagel (Jira)
-
[GitHub] [nutch-webapp] dependabot[bot] opened a new pull request, #9: Bump h2 from 1.4.197 to 2.2.220
via GitHub
-
Final Reminder: Community Over Code call for presentations closing soon
Rich Bowen
-
[jira] [Commented] (NUTCH-2993) ScoringDepth plugin to skip depth check based on URL Pattern
Sebastian Nagel (Jira)
-
TAC Applications for Community Over Code North America and Asia now open
Gavin McDonald
-
[GitHub] [nutch-webapp] dependabot[bot] opened a new pull request, #8: Bump guava from 30.1.1-jre to 32.0.0-jre
via GitHub
-
[jira] [Updated] (NUTCH-2994) Implement an indexer for OpenSearch 2.x
Tim Allison (Jira)
-
[jira] [Created] (NUTCH-2994) Implement an indexer for OpenSearch 2.x
Tim Allison (Jira)
-
[jira] [Updated] (NUTCH-2993) ScoringDepth plugin to skip depth check based on URL Pattern
Markus Jelsma (Jira)
-
[jira] [Commented] (NUTCH-2993) ScoringDepth plugin to override maxDepth based on URL Pattern
Markus Jelsma (Jira)
-
[jira] [Updated] (NUTCH-2993) ScoringDepth plugin to override maxDepth based on URL Pattern
Markus Jelsma (Jira)
-
[jira] [Created] (NUTCH-2993) ScoringDepth plugin to override maxDepth based on URL Pattern
Markus Jelsma (Jira)
-
Build failed in Jenkins: Nutch » Nutch-trunk #100
Apache Jenkins Server
-
[jira] [Resolved] (NUTCH-2991) Support HTTP/S Header Authorization for Solr connections
Sebastian Nagel (Jira)
-
Call for Presentations, Community Over Code Asia 2023
Rich Bowen
-
[GitHub] [nutch] lewismc closed pull request #97: NUTCH-2202 Integration of Anthelion (Focused Crawling Module) into Nutch
via GitHub
-
[GitHub] [nutch] lewismc commented on pull request #97: NUTCH-2202 Integration of Anthelion (Focused Crawling Module) into Nutch
via GitHub
-
[GitHub] [nutch] lewismc closed pull request #725: NUTCH-2938 Use Any23's RepositoryWriter to write structured data to Rdf4j repository
via GitHub
-
[GitHub] [nutch] lewismc commented on pull request #725: NUTCH-2938 Use Any23's RepositoryWriter to write structured data to Rdf4j repository
via GitHub
-
Build failed in Jenkins: Nutch » Nutch-trunk #98
Apache Jenkins Server
-
[GitHub] [nutch] sebastian-nagel opened a new pull request, #763: NUTCH-2991 Support HTTP/S Header Authorization for Solr connections
via GitHub
-
[jira] [Resolved] (NUTCH-2992) Fetcher: always block fetch queues when exceptions threshold is reached
Sebastian Nagel (Jira)
-
[jira] [Commented] (NUTCH-2992) Fetcher: always block fetch queues when exceptions threshold is reached
ASF GitHub Bot (Jira)
-
[jira] [Assigned] (NUTCH-2990) HttpRobotRulesParser to follow 5 redirects as specified by RFC 9309
Sebastian Nagel (Jira)
-
[jira] [Assigned] (NUTCH-2992) Fetcher: always block fetch queues when exceptions threshold is reached
Sebastian Nagel (Jira)
-
[GitHub] [nutch] sebastian-nagel opened a new pull request, #762: NUTCH-2992 Fetcher: always block fetch queues when exceptions threshold is reached
via GitHub
-
[jira] [Created] (NUTCH-2992) Fetcher: always block fetch queues when exceptions threshold is reached
Sebastian Nagel (Jira)
-
[jira] [Commented] (NUTCH-2991) Support HTTP/S Header Authorization for Solr connections
Sebastian Nagel (Jira)
-
[jira] [Updated] (NUTCH-2991) Support HTTP/S Header Authorization for Solr connections
Sebastian Nagel (Jira)
-
[jira] [Created] (NUTCH-2991) Support HTTP/S Header Authorization for Solr connections
Marcos Gomez (Jira)
-
Call for Presentations, Community Over Code 2023
Rich Bowen
-
[GitHub] [nutch-webapp] dependabot[bot] opened a new pull request, #7: Bump spring-core from 4.0.9.RELEASE to 5.2.24.RELEASE
via GitHub
-
A Message from the Board to PMC members
Rich Bowen
-
[GitHub] [nutch-webapp] dependabot[bot] opened a new pull request, #6: Bump spring-core from 4.0.9.RELEASE to 5.2.23.RELEASE
via GitHub
-
[jira] [Resolved] (NUTCH-2596) Upgrade from org.mortbay.jetty to org.eclipse.jetty
Sebastian Nagel (Jira)
-
[jira] [Created] (NUTCH-2990) HttpRobotRulesParser to follow 5 redirects as specified by RFC 9309
Sebastian Nagel (Jira)
-
[jira] [Resolved] (NUTCH-2984) Drop test proxy server and benchmark tool
Sebastian Nagel (Jira)
-
[jira] [Commented] (NUTCH-2984) Drop test proxy server and benchmark tool
ASF GitHub Bot (Jira)