dev
Thread
Date
Earlier messages
Messages by Thread
[jira] [Commented] (NUTCH-1446) Port NUTCH-1444 to trunk (Indexing should not create temporary files)
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1446) Port NUTCH-1444 to trunk (Indexing should not create temporary files)
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1446) Port NUTCH-1444 to trunk (Indexing should not create temporary files)
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1446) Port NUTCH-1444 to trunk (Indexing should not create temporary files)
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1446) Port NUTCH-1444 to trunk (Indexing should not create temporary files)
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1446) Port NUTCH-1444 to trunk (Indexing should not create temporary files)
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1446) Port NUTCH-1444 to trunk (Indexing should not create temporary files)
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1446) Port NUTCH-1444 to trunk (Indexing should not create temporary files)
ASF GitHub Bot (Jira)
[PR] NUTCH-1446 Port NUTCH-1444 to trunk (Indexing should not create tempo… [nutch]
via GitHub
Re: [PR] NUTCH-1446 Port NUTCH-1444 to trunk (Indexing should not create tempo… [nutch]
via GitHub
Re: [PR] NUTCH-1446 Port NUTCH-1444 to trunk (Indexing should not create tempo… [nutch]
via GitHub
Re: [PR] NUTCH-1446 Port NUTCH-1444 to trunk (Indexing should not create tempo… [nutch]
via GitHub
Re: [PR] NUTCH-1446 Port NUTCH-1444 to trunk (Indexing should not create tempo… [nutch]
via GitHub
Re: [PR] NUTCH-1446 Port NUTCH-1444 to trunk (Indexing should not create tempo… [nutch]
via GitHub
Re: [PR] NUTCH-1446 Port NUTCH-1444 to trunk (Indexing should not create tempo… [nutch]
via GitHub
[jira] [Assigned] (NUTCH-3162) Latency metrics to properly merge data from all threads and tasks
Lewis John McGibbney (Jira)
[jira] [Work stopped] (NUTCH-3151) Dynamic Counter Management
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3162) Latency metrics to properly merge data from all threads and tasks
Lewis John McGibbney (Jira)
[jira] [Work stopped] (NUTCH-3155) Add ErrorTracker to remaining MapReduce jobs missing error metrics
Lewis John McGibbney (Jira)
[jira] [Comment Edited] (NUTCH-3162) Latency metrics to properly merge data from all threads and tasks
Lewis John McGibbney (Jira)
[jira] [Comment Edited] (NUTCH-3162) Latency metrics to properly merge data from all threads and tasks
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3162) Latency metrics to properly merge data from all threads and tasks
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3162) Latency metrics to properly merge data from all threads and tasks
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3162) Latency metrics to properly merge data from all threads and tasks
Sebastian Nagel (Jira)
[jira] [Created] (NUTCH-3162) Latency metrics to properly merge data from all threads and tasks
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3162) Latency metrics to properly merge data from all threads and tasks
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3162) Latency metrics to properly merge data from all threads and tasks
Sebastian Nagel (Jira)
Integrating Nutch into the Bigtop Ecosystem
lewis john mcgibbney
Re: Integrating Nutch into the Bigtop Ecosystem
Sebastian Nagel
Re: Integrating Nutch into the Bigtop Ecosystem
lewis john mcgibbney
[jira] [Resolved] (NUTCH-3160) Leftover System.exit(..) in CommonCrawlDataDumper
Lewis John McGibbney (Jira)
[jira] [Assigned] (NUTCH-3160) Leftover System.exit(..) in CommonCrawlDataDumper
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3161) Address Sonarcloud High and Medium Security Hotspots
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3161) Address Sonarcloud High and Medium Security Hotspots
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3161) Address Sonarcloud High Security Hotspots
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3161) Address Sonarcloud High Security Hotspots
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3160) Leftover System.exit(..) in CommonCrawlDataDumper
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3160) Leftover System.exit(..) in CommonCrawlDataDumper
Luca Foppiano (Jira)
[jira] [Commented] (NUTCH-3160) Leftover System.exit(..) in CommonCrawlDataDumper
Hudson (Jira)
[jira] [Updated] (NUTCH-3160) Leftover System.exit(..) in CommonCrawlDataDumper
Luca Foppiano (Jira)
[jira] [Updated] (NUTCH-3160) Leftover System.exit(..) in CommonCrawlDataDumper
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3160) Leftover System.exit(..) in CommonCrawlDataDumper
Sebastian Nagel (Jira)
[jira] [Created] (NUTCH-3160) Leftover System.exit(..) in CommonCrawlDataDumper
Luca Foppiano (Jira)
[jira] [Created] (NUTCH-3159) Review and Address Sonarcloud Security Hotspots
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
Lewis John McGibbney (Jira)
Adding Committers & PMC to Sonarcloud Analysis Admin Roster
lewis john mcgibbney
[jira] [Created] (NUTCH-3158) Add GitHub Actions workflow to trigger Jenkins smoke test via PR comment
Lewis John McGibbney (Jira)
[PR] NUTCH-3085 [nutch]
via GitHub
Re: [PR] NUTCH-3085 [nutch]
via GitHub
[PR] Bump SonarSource/sonarqube-scan-action from 5 to 6 in /.github/workflows [nutch]
via GitHub
Re: [PR] Bump SonarSource/sonarqube-scan-action from 5 to 6 in /.github/workflows [nutch]
via GitHub
[PR] NUTCH-3085 Augment CI by adding code coverage and code quality reporting [nutch]
via GitHub
Re: [PR] NUTCH-3085 Augment CI by adding code coverage and code quality reporting [nutch]
via GitHub
[PR] NUTCH-3085 Augment CI by adding code coverage and code quality reporting [nutch]
via GitHub
Re: [PR] NUTCH-3085 Augment CI by adding code coverage and code quality reporting [nutch]
via GitHub
[PR] NUTCH-3085 Augment CI by adding code coverage and code quality reporting [nutch]
via GitHub
Re: [PR] NUTCH-3085 Augment CI by adding code coverage and code quality reporting [nutch]
via GitHub
[PR] NUTCH-3085 Augment CI by adding code coverage and code quality reporting [nutch]
via GitHub
Re: [PR] NUTCH-3085 Augment CI by adding code coverage and code quality reporting [nutch]
via GitHub
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
Hudson (Jira)
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
Hudson (Jira)
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3085) Augment CI by adding code coverage and code quality reporting
Hudson (Jira)
[jira] [Created] (NUTCH-3157) Add indexer plugin integration testing guidance to the Nutch wiki
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3156) Deprecate end of life or old Nutch indexer plugins
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3053) Upgrade build and CI to JDK17
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-2987) Upgrade to Java 17
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3145) Upgrade to JUnit 6
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3155) Add ErrorTracker to remaining MapReduce jobs missing error metrics
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3155) Add ErrorTracker to remaining MapReduce jobs missing error metrics
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3155) Missing ErrorTracker in CrawlDbFilter, DeduplicationJob, WebGraph and inconsistent initialization in FetcherThread
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3155) Missing ErrorTracker in CrawlDbFilter, DeduplicationJob, WebGraph and inconsistent initialization in FetcherThread
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
Sebastian Nagel (Jira)
[ANNOUNCE] Apache Nutch 1.22 Release
Sebastian Nagel
[RESULT] was [VOTE] Release Apache Nutch 1.22 RC#1
Sebastian Nagel
[jira] [Commented] (NUTCH-2931) Improvements to 1.x REST API
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-2931) Improvements to 1.x REST API
ASF GitHub Bot (Jira)
[PR] NUTCH-2931 Create OpenAPI specification for Nutch 1.x REST API [nutch]
via GitHub
Re: [PR] NUTCH-2931 Create OpenAPI specification for Nutch 1.x REST API [nutch]
via GitHub
[jira] [Assigned] (NUTCH-2932) Create OpenAPI specification for Nutch 1.x REST API
Lewis John McGibbney (Jira)
[PR] NUTCH-3154 Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers [nutch]
via GitHub
Re: [PR] NUTCH-3154 Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers [nutch]
via GitHub
Re: [PR] NUTCH-3154 Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers [nutch]
via GitHub
Re: [PR] NUTCH-3154 Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers [nutch]
via GitHub
Re: [PR] NUTCH-3154 Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers [nutch]
via GitHub
[jira] [Commented] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
Hudson (Jira)
[jira] [Work started] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
Lewis John McGibbney (Jira)
[jira] [Assigned] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
Lewis John McGibbney (Jira)
[DISCUSS] Future of the Nutch REST API
lewis john mcgibbney
Re: [DISCUSS] Future of the Nutch REST API
Lewis John McGibbney
Re: [DISCUSS] Future of the Nutch REST API
Sebastian Nagel
Re: [DISCUSS] Future of the Nutch REST API
Isabelle Giguere
Re: [DISCUSS] Future of the Nutch REST API
Lewis John McGibbney
Re: [DISCUSS] Future of the Nutch REST API
Lewis John McGibbney
Re: [DISCUSS] Future of the Nutch REST API
BlackIce
Re: [DISCUSS] Future of the Nutch REST API
Joe Gilvary
Re: [DISCUSS] Future of the Nutch REST API
Isabelle Giguere
[VOTE] Release Apache Nutch 1.22 RC#1
Sebastian Nagel
[jira] [Resolved] (NUTCH-3153) Update of license and notice files
Sebastian Nagel (Jira)
[jira] [Updated] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3154) Implement integration testing framework for Nutch IndexWriter plugins using Testcontainers
Lewis John McGibbney (Jira)
[jira] [Assigned] (NUTCH-3003) Consider integration testing in a Dockerized mini-hadoop cluster via testcontainers?
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3153) Update of license and notice files
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3153) Update of license and notice files
ASF GitHub Bot (Jira)
[PR] NUTCH-3153 Update of license and notice files [nutch]
via GitHub
Re: [PR] NUTCH-3153 Update of license and notice files [nutch]
via GitHub
[jira] [Created] (NUTCH-3153) Update of license and notice files
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3120) Automatically increase crawl-delay on HTTP 429
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3127) Deprecate or remove DmozParser
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3152) Job counters getGroup to use metrics constants
Sebastian Nagel (Jira)
Build failed in Jenkins: Nutch » Nutch-trunk #219
Apache Jenkins Server
Jenkins build is back to normal : Nutch » Nutch-trunk #220
Apache Jenkins Server
[jira] [Resolved] (NUTCH-2793) CSV indexer does not work in distributed mode
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3151) Dynamic Counter Management
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3150) Expand Caching Hadoop Counter References
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3152) Job counters getGroup to use metrics constants
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3152) Job counters getGroup to use metrics constants
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3152) Job counters getGroup to use metrics constants
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3152) Job counters getGroup to use metrics constants
Hudson (Jira)
[PR] NUTCH-3152 Job counters getGroup to use metrics constants [nutch]
via GitHub
Re: [PR] NUTCH-3152 Job counters getGroup to use metrics constants [nutch]
via GitHub
Re: [PR] NUTCH-3152 Job counters getGroup to use metrics constants [nutch]
via GitHub
[jira] [Created] (NUTCH-3152) Job counters getGroup to use metrics constants
Sebastian Nagel (Jira)
Re: [PR] NUTCH-2793 indexer-csv: make it work in distributed mode [nutch]
via GitHub
Re: [PR] NUTCH-2793 indexer-csv: make it work in distributed mode [nutch]
via GitHub
Re: [PR] NUTCH-2793 indexer-csv: make it work in distributed mode [nutch]
via GitHub
Re: [PR] fix for NUTCH-2455 more efficient usage of hostdb in generate [nutch]
via GitHub
Re: [PR] fix for NUTCH-2455 more efficient usage of hostdb in generate [nutch]
via GitHub
[jira] [Created] (NUTCH-3151) Dynamic Counter Management
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3150) Expand Caching Hadoop Counter References
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3150) Expand Caching Hadoop Counter References
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3150) Expand Caching Hadoop Counter References
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3150) Expand Caching Hadoop Counter References
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3150) Expand Caching Hadoop Counter References
Hudson (Jira)
[jira] [Commented] (NUTCH-3150) Expand Caching Hadoop Counter References
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3150) Expand Caching Hadoop Counter References
Lewis John McGibbney (Jira)
[PR] NUTCH-3150 Expand Caching Hadoop Counter References [nutch]
via GitHub
Re: [PR] NUTCH-3150 Expand Caching Hadoop Counter References [nutch]
via GitHub
Re: [PR] NUTCH-3150 Expand Caching Hadoop Counter References [nutch]
via GitHub
[jira] [Updated] (NUTCH-3150) Expand Caching Hadoop Counter References
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3150) Expand Caching Hadoop Counter References
Lewis John McGibbney (Jira)
[jira] [Work stopped] (NUTCH-3146) Add Resource Utilization Metrics for Fetcher
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-1732) IndexerMapReduce to delete explicitly not indexable documents
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1732) IndexerMapReduce to delete explicitly not indexable documents
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1732) IndexerMapReduce to delete explicitly not indexable documents
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1732) IndexerMapReduce to delete explicitly not indexable documents
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1732) IndexerMapReduce to delete explicitly not indexable documents
ASF GitHub Bot (Jira)
[PR] NUTCH-1732: allow deleting non-parsable documents [nutch]
via GitHub
Re: [PR] NUTCH-1732: allow deleting non-parsable documents [nutch]
via GitHub
Re: [PR] NUTCH-1732: allow deleting non-parsable documents [nutch]
via GitHub
Re: [PR] NUTCH-1732: allow deleting non-parsable documents [nutch]
via GitHub
Re: [PR] NUTCH-1732: allow deleting non-parsable documents [nutch]
via GitHub
[jira] [Work started] (NUTCH-3146) Add Resource Utilization Metrics for Fetcher
Lewis John McGibbney (Jira)
[jira] [Work stopped] (NUTCH-3142) Add Error Context to Metrics
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3142) Add Error Context to Metrics
Lewis John McGibbney (Jira)
[DISCUSS] Migrate to Java 17
Sebastian Nagel
Re: [DISCUSS] Migrate to Java 17
Isabelle Giguere
Re: [DISCUSS] Migrate to Java 17
BlackIce
Re: [DISCUSS] Migrate to Java 17
Sebastian Nagel
Re: [DISCUSS] Migrate to Java 17
Sebastian Nagel
Re: [DISCUSS] Migrate to Java 17
Edward Capriolo
Re: [DISCUSS] Migrate to Java 17
Joe Gilvary
Re: [DISCUSS] Migrate to Java 17
Lewis John McGibbney
Re: [DISCUSS] Migrate to Java 17
Sebastian Nagel
Re: [DISCUSS] Migrate to Java 17
Lewis John McGibbney
Re: [DISCUSS] Migrate to Java 17
Joe Gilvary
Re: [DISCUSS] Migrate to Java 17
Sebastian Nagel
[jira] [Resolved] (NUTCH-3110) Upgrade to Tika 3.2.3
Sebastian Nagel (Jira)
[jira] [Assigned] (NUTCH-3110) Upgrade to Tika 3.2.3
Sebastian Nagel (Jira)
[discuss] rolling nutch 1.22
lewis john mcgibbney
Re: [discuss] rolling nutch 1.22
BlackIce
Re: [discuss] rolling nutch 1.22
Joe Gilvary
Re: [discuss] rolling nutch 1.22
Sebastian Nagel
Re: [discuss] rolling nutch 1.22
Lewis John McGibbney
Re: [discuss] rolling nutch 1.22
Sebastian Nagel
Re: (nutch) branch master updated: NUTCH-3143 GitHub workflow does not run all unit tests (#889)
Doug Baber via dev
[jira] [Resolved] (NUTCH-3143) GitHub workflow does not run all unit tests
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3143) GitHub workflow does not run all unit tests
Lewis John McGibbney (Jira)
[jira] [Assigned] (NUTCH-3034) Evolve the legacy Nutch plugin framework to use PF4J
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3034) Evolve the legacy Nutch plugin framework to use PF4J
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3149) Investigate Remote Shuffle Service Integration
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3149) Investigate Remote Shuffle Service Integration (Apache Uniffle / Celeborn) for Shuffle-Intensive Nutch Jobs
Lewis John McGibbney (Jira)
[jira] [Assigned] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
Lewis John McGibbney (Jira)
Earlier messages