Messages by Thread
-
-
[jira] [Resolved] (NUTCH-2831) Elastic indexer does not support SSL
Sebastian Nagel (Jira)
-
[jira] [Closed] (NUTCH-2073) Unable to create index on elasticsearch through nutch
Sebastian Nagel (Jira)
-
[jira] [Resolved] (NUTCH-2806) Nutch can't parse links
Sebastian Nagel (Jira)
-
[jira] [Closed] (NUTCH-2806) Nutch can't parse links
Sebastian Nagel (Jira)
-
[jira] [Commented] (NUTCH-1611) Elastic Search Indexer Creates field in elastic search "boost" as a string value, so cannot be used in custom boost queries
Sebastian Nagel (Jira)
-
[jira] [Updated] (NUTCH-1611) Elastic Search Indexer Creates field in elastic search "boost" as a string value, so cannot be used in custom boost queries
Sebastian Nagel (Jira)
-
[jira] [Resolved] (NUTCH-2951) Crawl datum with metadata WRITABLE_GENERATE_TIME_KEY awaits fetching forever
Sebastian Nagel (Jira)
-
[jira] [Updated] (NUTCH-2953) Indexer Elastic to ignore SSL issues
Markus Jelsma (Jira)
-
[jira] [Created] (NUTCH-2953) Indexer Elastic to ignore SSL issues
Markus Jelsma (Jira)
-
[jira] [Commented] (NUTCH-2940) Develop Gradle Core Build for Apache Nutch
Lewis John McGibbney (Jira)
-
[jira] [Commented] (NUTCH-2490) Sitemap processing: Sitemap index files not working
ASF GitHub Bot (Jira)
-
[GitHub] [nutch] lewismc opened a new pull request, #735: Nutch 2940
GitBox
-
[jira] [Assigned] (NUTCH-2940) Develop Gradle Core Build for Apache Nutch
Lewis John McGibbney (Jira)
-
[jira] [Commented] (NUTCH-2952) Upgrade core dependencies (Hadoop 3.3.3, log4j 2.17.2)
ASF GitHub Bot (Jira)
-
[GitHub] [nutch] sebastian-nagel opened a new pull request, #734: NUTCH-2952 Upgrade core dependencies
GitBox
-
[jira] [Assigned] (NUTCH-2952) Upgrade core dependencies (Hadoop 3.3.3, log4j 2.17.2)
Sebastian Nagel (Jira)
-
[jira] [Created] (NUTCH-2952) Upgrade core dependencies (Hadoop 3.3.3, log4j 2.17.2)
Sebastian Nagel (Jira)
-
[jira] [Commented] (NUTCH-2949) Tasks of a multi-threaded map runner may fail because of slow creation of URL stream handlers
Sebastian Nagel (Jira)
-
[GitHub] [nutch] sebastian-nagel opened a new pull request, #733: NUTCH-2936 / NUTCH-2949 URLStreamHandler may fail jobs in distributed mode
GitBox
-
[jira] [Commented] (NUTCH-2936) Early registration of URL stream handlers provided by plugins may fail Hadoop jobs running in distributed mode if protocol-okhttp is used
Sebastian Nagel (Jira)
-
[jira] [Updated] (NUTCH-2936) Early registration of URL stream handlers provided by plugins may fail Hadoop jobs running in distributed mode if protocol-okhttp is used
Sebastian Nagel (Jira)
-
[GitHub] [nutch] sebastian-nagel commented on pull request #697: NUTCH-2896 Protocol-okhttp: make connection pool configurable
GitBox
-
[jira] [Assigned] (NUTCH-2951) Crawl datum with metadata WRITABLE_GENERATE_TIME_KEY awaits fetching forever
Sebastian Nagel (Jira)
-
[GitHub] [nutch] sebastian-nagel opened a new pull request, #732: NUTCH-2951 Crawl datum with metadata WRITABLE_GENERATE_TIME_KEY awaits fetching forever
GitBox
-
[jira] [Commented] (NUTCH-2951) Crawl datum with metadata WRITABLE_GENERATE_TIME_KEY awaits fetching forever
Sebastian Nagel (Jira)
-
[jira] [Updated] (NUTCH-2951) Crawl datum with metadata WRITABLE_GENERATE_TIME_KEY awaits fetching forever
Sebastian Nagel (Jira)
-
[jira] [Created] (NUTCH-2951) Crawl datum with metadata WRITABLE_GENERATE_TIME_KEY awaits fetching forever
Lapadula Alessandro (Jira)
-
[GitHub] [nutch-webapp] dependabot[bot] opened a new pull request, #3: Bump spring-core from 4.0.9.RELEASE to 5.2.22.RELEASE
GitBox
-
[jira] [Resolved] (NUTCH-2950) UpdateHostDb: performance improvements
Sebastian Nagel (Jira)
-
[GitHub] [nutch] lewismc commented on pull request #726: NUTCH-2936 Early registration of URL stream handlers provided by plugins may fail Hadoop jobs running in distributed mode
GitBox
-
[GitHub] [nutch] lewismc merged pull request #726: NUTCH-2936 Early registration of URL stream handlers provided by plugins may fail Hadoop jobs running in distributed mode
GitBox
-
[jira] [Commented] (NUTCH-2950) UpdateHostDb: performance improvements
Sebastian Nagel (Jira)
-
[GitHub] [nutch] sebastian-nagel opened a new pull request, #731: NUTCH-2950 UpdateHostDb: performance improvements
GitBox
-
[jira] [Created] (NUTCH-2950) UpdateHostDb: performance improvements
Sebastian Nagel (Jira)
-
[jira] [Resolved] (NUTCH-2946) Fetcher: optionally slow down fetching from hosts with repeated exceptions
Sebastian Nagel (Jira)
-
Final reminder: ApacheCon North America call for presentations closing soon
Rich Bowen
-
[GitHub] [nutch] sebastian-nagel commented on pull request #726: NUTCH-2936 Early registration of URL stream handlers provided by plugins may fail Hadoop jobs running in distributed mode
GitBox
-
[jira] [Created] (NUTCH-2949) Tasks of a multi-threaded map runner may fail because of slow creation of URL stream handlers
Sebastian Nagel (Jira)
-
Build failed in Jenkins: Nutch » Nutch-trunk #73
Apache Jenkins Server
-
[jira] [Resolved] (NUTCH-2948) Upgrade dependencies to Any23 2.7 and Tika 2.3.0
Sebastian Nagel (Jira)
-
[jira] [Assigned] (NUTCH-2948) Upgrade dependencies to Any23 2.7 and Tika 2.3.0
Sebastian Nagel (Jira)
-
[jira] [Commented] (NUTCH-2948) Upgrade dependencies to Any23 2.7 and Tika 2.3.0
ASF GitHub Bot (Jira)
-
[GitHub] [nutch] sebastian-nagel merged pull request #730: NUTCH-2948 Upgrade dependencies to Any23 2.7 and Tika 2.3.0
GitBox
-
[jira] [Created] (NUTCH-2948) Upgrade dependencies to Any23 2.7 and Tika 2.3.0
Sebastian Nagel (Jira)
-
[jira] [Commented] (NUTCH-2947) Fetcher: keep state of empty fetch queues unless queue feeder is finished
ASF GitHub Bot (Jira)
-
[GitHub] [nutch] sebastian-nagel opened a new pull request, #729: NUTCH-2947 Fetcher: keep state of empty fetch queues unless queue feeder is finished
GitBox
-
[GitHub] [nutch] sebastian-nagel opened a new pull request, #728: NUTCH-2946 Fetcher: optionally slow down fetching from hosts with repeated exceptions
GitBox
-
[jira] [Created] (NUTCH-2947) Fetcher: keep state of empty fetch queues unless queue feeder is finished
Sebastian Nagel (Jira)
-
[jira] [Commented] (NUTCH-2946) Fetcher: optionally slow down fetching from hosts with repeated exceptions
Markus Jelsma (Jira)
-
[jira] [Created] (NUTCH-2946) Fetcher: optionally slow down fetching from hosts with repeated exceptions
Sebastian Nagel (Jira)
-
REMINDER - Travel Assistance available for ApacheCon NA New Orleans 2022
Gavin McDonald
-
[jira] [Commented] (NUTCH-2831) Elastic indexer does not support SSL
Sebastian Nagel (Jira)
-
[jira] [Updated] (NUTCH-2945) Solr Index Writer pluging schema.xml missing a copyToField
Danielle Fisla (Jira)
-
[jira] [Commented] (NUTCH-2945) Solr Index Writer pluging schema.xml missing a copyToField
Danielle Fisla (Jira)
-
[jira] [Created] (NUTCH-2945) Solr Index Writer pluging schema.xml missing a copyToField
Danielle Fisla (Jira)
-
[GitHub] [nutch-webapp] dependabot[bot] opened a new pull request, #2: Bump spring-core from 4.0.9.RELEASE to 5.3.19
GitBox
-
[jira] [Created] (NUTCH-2944) Create Gradle Javadoc task
Lewis John McGibbney (Jira)
-
[jira] [Work started] (NUTCH-2944) Create Gradle Javadoc task
Lewis John McGibbney (Jira)
-
[jira] [Resolved] (NUTCH-2943) Implement core dependencies in build.gradle.kts
Lewis John McGibbney (Jira)
-
[jira] [Commented] (NUTCH-2943) Implement core dependencies in build.gradle.kts
ASF GitHub Bot (Jira)
-
[GitHub] [nutch] lewismc opened a new pull request, #727: NUTCH-2943 Implement core dependencies in build.gradle.kts
GitBox
-
[jira] [Updated] (NUTCH-2943) Implement core dependencies in build.gradle.kts
Lewis John McGibbney (Jira)
-
[jira] [Work started] (NUTCH-2943) Implement core dependencies in build.gradle.kts
Lewis John McGibbney (Jira)
-
[jira] [Assigned] (NUTCH-2943) Implement core dependencies in build.gradle.kts
Lewis John McGibbney (Jira)
-
[jira] [Commented] (NUTCH-2939) Create Initial Jenkinsfile for Nutch Gradle Build
Lewis John McGibbney (Jira)
-
[GitHub] [nutch-webapp] dependabot[bot] opened a new pull request, #1: Bump spring-core from 4.0.9.RELEASE to 5.3.18
GitBox
-
[jira] [Deleted] (NUTCH-2942) it is best form of software
Sebastian Nagel (Jira)
-
[jira] [Created] (NUTCH-2943) Management of dependencies in Build.kts file
Iman Arfa-Zanganeh (Jira)
-
[jira] [Updated] (NUTCH-2942) it is best form of software
jimmy jones (Jira)
-
[jira] [Created] (NUTCH-2942) it is best form of software
jimmy jones (Jira)
-
[jira] [Commented] (NUTCH-2900) Integrate Nutch with Kerberized Solr Cloud
Joe Gilvary (Jira)
-
Call for Presentations now open, ApacheCon North America 2022
Rich Bowen
-
[jira] [Created] (NUTCH-2941) Migrate plugins over to Gradle build system
Ryan Li (Jira)
-
CVE-2022-25312: An XML external entity (XXE) injection vulnerability exists in the Apache Any23 RDFa XSLTStylesheet extractor
lewis john mcgibbney
-
[ANNOUNCE] Apache Any23 2.7
lewis john mcgibbney
-
[jira] [Assigned] (NUTCH-2939) Create Jenkinsfile for Nutch Gradle Build
Lewis John McGibbney (Jira)
-
[jira] [Updated] (NUTCH-2939) Create Initial Jenkinsfile for Nutch Gradle Build
Ryan Li (Jira)
-
[jira] [Created] (NUTCH-2940) Develop Gradle Core Build for Apache Nutch
James Simmons (Jira)
-
[jira] [Created] (NUTCH-2939) Create Jenkinsfile for Nutch Gradle Build
Lewis John McGibbney (Jira)
-
[jira] [Updated] (NUTCH-2934) Replace Apache Ant build system with Gradle
Lewis John McGibbney (Jira)
-
[jira] (NUTCH-122) block numbers need a better random number generator
Chris Lambertus (Jira)
-
[jira] [Resolved] (NUTCH-2923) Add Job Id in Job Failure messages
Sebastian Nagel (Jira)
-
[jira] [Commented] (NUTCH-122) block numbers need a better random number generator
pankaj kumar singh (Jira)
-
[jira] [Work started] (NUTCH-2925) Secure the Nutch REST API using Apache Shiro
Lewis John McGibbney (Jira)
-
[jira] [Resolved] (NUTCH-2573) Suspend crawling if robots.txt fails to fetch with 5xx status
Sebastian Nagel (Jira)
-
[jira] [Resolved] (NUTCH-2935) DeduplicationJob: failure on URLs with invalid percent encoding
Sebastian Nagel (Jira)
-
[GitHub] [nutch] lewismc opened a new pull request #726: NUTCH-2936 Early registration of URL stream handlers provided by plugins may fail Hadoop jobs running in distributed mode
GitBox
-
[jira] [Work started] (NUTCH-2936) Early registration of URL stream handlers provided by plugins may fail Hadoop jobs running in distributed mode
Lewis John McGibbney (Jira)
-
[jira] [Assigned] (NUTCH-2936) Early registration of URL stream handlers provided by plugins may fail Hadoop jobs running in distributed mode
Lewis John McGibbney (Jira)
-
[jira] [Commented] (NUTCH-2938) Use Any23's RepositoryWriter to write structured data to Rdf4j repository
ASF GitHub Bot (Jira)
-
[GitHub] [nutch] lewismc opened a new pull request #725: NUTCH-2938 Use Any23's RepositoryWriter to write structured data to Rdf4j repository
GitBox
-
[jira] [Created] (NUTCH-2938) Use Any23's RepositoryWriter to write structured data to Rdf4j repository
Lewis John McGibbney (Jira)
-
[jira] [Resolved] (NUTCH-2919) NUTCH-2919 Upgrade to Tika 2.2.1 and Any23 2.6
Lewis John McGibbney (Jira)
-
[jira] [Commented] (NUTCH-2919) NUTCH-2919 Upgrade to Tika 2.2.1 and Any23 2.6
ASF GitHub Bot (Jira)
-
[jira] [Updated] (NUTCH-2919) NUTCH-2919 Upgrade to Tika 2.2.1 and Any23 2.6
Lewis John McGibbney (Jira)
-
[jira] [Updated] (NUTCH-2936) Early registration of URL stream handlers provided by plugins may fail Hadoop jobs running in distributed mode
Sebastian Nagel (Jira)
-
[jira] [Commented] (NUTCH-2936) Early registration of URL stream handlers provided by plugins may fail Hadoop jobs running in distributed mode
Sebastian Nagel (Jira)