dev
Thread
Date
Earlier messages
Messages by Thread
[jira] [Updated] (NUTCH-3150) Expand Caching Hadoop Counter References
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3150) Expand Caching Hadoop Counter References
Lewis John McGibbney (Jira)
[jira] [Work stopped] (NUTCH-3146) Add Resource Utilization Metrics for Fetcher
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-1732) IndexerMapReduce to delete explicitly not indexable documents
ASF GitHub Bot (Jira)
[PR] NUTCH-1732: allow deleting non-parsable documents [nutch]
via GitHub
[jira] [Work started] (NUTCH-3146) Add Resource Utilization Metrics for Fetcher
Lewis John McGibbney (Jira)
[jira] [Work stopped] (NUTCH-3142) Add Error Context to Metrics
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3142) Add Error Context to Metrics
Lewis John McGibbney (Jira)
[DISCUSS] Migrate to Java 17
Sebastian Nagel
Re: [DISCUSS] Migrate to Java 17
Isabelle Giguere
Re: [DISCUSS] Migrate to Java 17
BlackIce
Re: [DISCUSS] Migrate to Java 17
Sebastian Nagel
Re: [DISCUSS] Migrate to Java 17
Sebastian Nagel
Re: [DISCUSS] Migrate to Java 17
Edward Capriolo
Re: [DISCUSS] Migrate to Java 17
Joe Gilvary
Re: [DISCUSS] Migrate to Java 17
Lewis John McGibbney
Re: [DISCUSS] Migrate to Java 17
Sebastian Nagel
Re: [DISCUSS] Migrate to Java 17
Lewis John McGibbney
[jira] [Resolved] (NUTCH-3110) Upgrade to Tika 3.2.3
Sebastian Nagel (Jira)
[jira] [Assigned] (NUTCH-3110) Upgrade to Tika 3.2.3
Sebastian Nagel (Jira)
[discuss] rolling nutch 1.22
lewis john mcgibbney
Re: [discuss] rolling nutch 1.22
BlackIce
Re: [discuss] rolling nutch 1.22
Joe Gilvary
Re: [discuss] rolling nutch 1.22
Sebastian Nagel
Re: [discuss] rolling nutch 1.22
Lewis John McGibbney
Re: (nutch) branch master updated: NUTCH-3143 GitHub workflow does not run all unit tests (#889)
Doug Baber via dev
[jira] [Resolved] (NUTCH-3143) GitHub workflow does not run all unit tests
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3143) GitHub workflow does not run all unit tests
Lewis John McGibbney (Jira)
[jira] [Assigned] (NUTCH-3034) Evolve the legacy Nutch plugin framework to use PF4J
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3034) Evolve the legacy Nutch plugin framework to use PF4J
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3149) Investigate Remote Shuffle Service Integration
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3149) Investigate Remote Shuffle Service Integration (Apache Uniffle / Celeborn) for Shuffle-Intensive Nutch Jobs
Lewis John McGibbney (Jira)
[jira] [Assigned] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
Lewis John McGibbney (Jira)
[PR] NUTCH-2455 Use secondary sorting for memory-efficient HostDb integration in Generator [nutch]
via GitHub
[jira] [Comment Edited] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
ASF GitHub Bot (Jira)
[jira] [Updated] (NUTCH-2455) Use secondary sorting for memory-efficient HostDb integration in Generator
Lewis John McGibbney (Jira)
Build failed in Jenkins: Nutch » Nutch-trunk #214
Apache Jenkins Server
Jenkins build is back to normal : Nutch » Nutch-trunk #215
Apache Jenkins Server
[jira] [Resolved] (NUTCH-3042) Use GitHub cache action to improve CI execution time
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3148) Cache Ivy dependencies in GitHub CI builds
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3110) Upgrade to Tika 3.2.3
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3110) Upgrade to Tika 3.2.3
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3110) Upgrade to Tika 3.2.3
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3110) Upgrade to Tika 3.2.3
Hudson (Jira)
[jira] [Commented] (NUTCH-3110) Upgrade to Tika 3.2.3
Tim Allison (Jira)
[PR] NUTCH-3110 Upgrade to Tika 3.2.3 [nutch]
via GitHub
Re: [PR] NUTCH-3110 Upgrade to Tika 3.2.3 [nutch]
via GitHub
[jira] [Updated] (NUTCH-3110) Upgrade to Tika 3.2.3
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3144) URLUtil unit tests fail after upgrade to crawler-commons 1.6
Sebastian Nagel (Jira)
[jira] [Work started] (NUTCH-3148) Cache Ivy dependencies in GitHub CI builds
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3148) Cache Ivy dependencies in GitHub CI builds
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3148) Cache Ivy dependencies in GitHub CI builds
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3148) Cache Ivy dependencies in GitHub CI builds
Hudson (Jira)
[PR] NUTCH-3148 Cache Ivy dependencies in GitHub CI builds [nutch]
via GitHub
Re: [PR] NUTCH-3148 Cache Ivy dependencies in GitHub CI builds [nutch]
via GitHub
[jira] [Created] (NUTCH-3148) Cache Ivy dependencies in GitHub CI builds
Lewis John McGibbney (Jira)
[PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
Re: [PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
Re: [PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
[PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
Re: [PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
[PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
Re: [PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
[PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
Re: [PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
Re: [PR] NUTCH-3143 GitHub workflow does not run all unit tests [nutch]
via GitHub
[jira] [Assigned] (NUTCH-3143) GitHub workflow does not run all unit tests
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3147) Nutch JMX Metrics Evolution with OpenTelemetry
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3145) Upgrade to JUnit 6
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3145) Upgrade to JUnit 6
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3145) Upgrade to JUnit 6
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3145) Upgrade to JUnit 6
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3145) Upgrade to JUnit 6
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3145) Upgrade to JUnit 6
Sebastian Nagel (Jira)
[PR] NUTCH-3145 Upgrade to JUnit 6 [nutch]
via GitHub
Re: [PR] NUTCH-3145 Upgrade to JUnit 6 [nutch]
via GitHub
Re: [PR] NUTCH-3145 Upgrade to JUnit 6 [nutch]
via GitHub
Re: [PR] NUTCH-3145 Upgrade to JUnit 6 [nutch]
via GitHub
[jira] [Assigned] (NUTCH-3145) Upgrade to JUnit 6
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3146) Add Resource Utilization Metrics for Fetcher
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
Sebastian Nagel (Jira)
[jira] [Work stopped] (NUTCH-3064) Upgrade com.maxmind.geoip2:geoip2 dependency in geoip-index to v4.2.0
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3064) Upgrade index-geoip to GeoIP2 5.0.2
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3142) Add Error Context to Metrics
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3142) Add Error Context to Metrics
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3142) Add Error Context to Metrics
Hudson (Jira)
[PR] NUTCH-3142 Add Error Context to Metrics [nutch]
via GitHub
Re: [PR] NUTCH-3142 Add Error Context to Metrics [nutch]
via GitHub
[jira] [Work started] (NUTCH-3142) Add Error Context to Metrics
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3145) Upgrade to JUnit 6
Lewis John McGibbney (Jira)
[jira] [Updated] (NUTCH-3145) Upgrade to JUnit 6
Sebastian Nagel (Jira)
[jira] [Created] (NUTCH-3145) Upgrade to JUnit 6
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3141) Cache Hadoop Counter References in Hot Paths
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
Hudson (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3143) GitHub workflow does not run all unit tests
Hudson (Jira)
[jira] [Commented] (NUTCH-3144) URLUtil unit tests fail after upgrade to crawler-commons 1.6
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3144) URLUtil unit tests fail after upgrade to crawler-commons 1.6
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3144) URLUtil unit tests fail after upgrade to crawler-commons 1.6
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3144) URLUtil unit tests fail after upgrade to crawler-commons 1.6
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3144) URLUtil unit tests fail after upgrade to crawler-commons 1.6
Hudson (Jira)
[PR] NUTCH-3144 URLUtil unit tests fail after upgrade to crawler-commons 1.6 [nutch]
via GitHub
Re: [PR] NUTCH-3144 URLUtil unit tests fail after upgrade to crawler-commons 1.6 [nutch]
via GitHub
Re: [PR] NUTCH-3144 URLUtil unit tests fail after upgrade to crawler-commons 1.6 [nutch]
via GitHub
Re: [PR] NUTCH-3144 URLUtil unit tests fail after upgrade to crawler-commons 1.6 [nutch]
via GitHub
[jira] [Created] (NUTCH-3144) URLUtil unit tests fail after upgrade to crawler-commons 1.6
Sebastian Nagel (Jira)
[jira] [Created] (NUTCH-3143) GitHub workflow does not run all unit tests
Sebastian Nagel (Jira)
[jira] [Assigned] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified
Lewis John McGibbney (Jira)
[PR] [NUTCH-1564] AdaptiveFetchSchedule sync_delta forces refetch of unmodified pages [nutch]
via GitHub
Re: [PR] [NUTCH-1564] AdaptiveFetchSchedule sync_delta forces refetch of unmodified pages [nutch]
via GitHub
Re: [PR] [NUTCH-1564] AdaptiveFetchSchedule sync_delta forces refetch of unmodified pages [nutch]
via GitHub
Re: [PR] [NUTCH-1564] AdaptiveFetchSchedule sync_delta forces refetch of unmodified pages [nutch]
via GitHub
Re: [PR] [NUTCH-1564] AdaptiveFetchSchedule sync_delta forces refetch of unmodified pages [nutch]
via GitHub
Re: [PR] [NUTCH-1564] AdaptiveFetchSchedule sync_delta forces refetch of unmodified pages [nutch]
via GitHub
Re: [PR] [NUTCH-1564] AdaptiveFetchSchedule sync_delta forces refetch of unmodified pages [nutch]
via GitHub
Re: [PR] [NUTCH-1564] AdaptiveFetchSchedule sync_delta forces refetch of unmodified pages [nutch]
via GitHub
Working on NUTCH-1564
Isabelle Giguere
Re: Working on NUTCH-1564
Isabelle Giguere
[jira] [Commented] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified
Isabelle Giguere (Jira)
[jira] [Commented] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified
Hudson (Jira)
[PR] NUTCH-2934 Replace Apache Ant build system with Gradle [nutch]
via GitHub
[jira] [Work stopped] (NUTCH-2944) Create Gradle Javadoc task
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-2939) Create Initial Jenkinsfile for Nutch Gradle Build
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3142) Add Error Context to Metrics
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3141) Cache Hadoop Counter References in Hot Paths
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3141) Cache Hadoop Counter References in Hot Paths
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3141) Cache Hadoop Counter References in Hot Paths
Hudson (Jira)
[PR] NUTCH-3141 Cache Hadoop Counter References in Hot Paths [nutch]
via GitHub
Re: [PR] NUTCH-3141 Cache Hadoop Counter References in Hot Paths [nutch]
via GitHub
[jira] [Created] (NUTCH-3141) Cache Hadoop Counter References in Hot Paths
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3141) Cache Hadoop Counter References in Hot Paths
Lewis John McGibbney (Jira)
[jira] [Work started] (NUTCH-3131) Nutch Metrics Refactoring & Enhancements
Lewis John McGibbney (Jira)
Observability Dashboards for Nutch
lewis john mcgibbney
[jira] [Updated] (NUTCH-3140) Create example Observability Dashboards for Nutch Metrics
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3140) Create example Observability Dashboards for Nutch Metrics
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3140) Create example Observability Dashboards for Nutch Metrics
Lewis John McGibbney (Jira)
[jira] [Created] (NUTCH-3140) Create example Observability Dashboards for Nutch Metrics
Lewis John McGibbney (Jira)
[jira] [Comment Edited] (NUTCH-3140) Create example Observability Dashboards for Nutch Metrics
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3139) protocol-okhttp: add support for zstd content-encoding
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3137) Upgrade Nutch core dependencies
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3136) Upgrade crawler-commons dependency
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3133) Upgrade GitHub workflows to JDK 17
Sebastian Nagel (Jira)
[jira] [Resolved] (NUTCH-3135) Cache downloaded ant-eclipse.jar
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3131) Nutch Metrics Refactoring & Enhancements
Lewis John McGibbney (Jira)
[jira] [Resolved] (NUTCH-3134) Add latency metrics with percentile support to Fetcher, Parser, and Indexer
Lewis John McGibbney (Jira)
[jira] [Work stopped] (NUTCH-3134) Add latency metrics with percentile support to Fetcher, Parser, and Indexer
Lewis John McGibbney (Jira)
[jira] [Commented] (NUTCH-3139) protocol-okhttp: add support for zstd content-encoding
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3139) protocol-okhttp: add support for zstd content-encoding
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3139) protocol-okhttp: add support for zstd content-encoding
Hudson (Jira)
[PR] NUTCH-3139 protocol-okhttp: add support for zstd content-encoding [nutch]
via GitHub
Re: [PR] NUTCH-3139 protocol-okhttp: add support for zstd content-encoding [nutch]
via GitHub
[jira] [Created] (NUTCH-3139) protocol-okhttp: add support for zstd content-encoding
Sebastian Nagel (Jira)
[jira] [Created] (NUTCH-3138) Upgrade to Creadur RAT 0.17
Sebastian Nagel (Jira)
[jira] [Commented] (NUTCH-3134) Add latency metrics with percentile support to Fetcher, Parser, and Indexer
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3134) Add latency metrics with percentile support to Fetcher, Parser, and Indexer
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3134) Add latency metrics with percentile support to Fetcher, Parser, and Indexer
Hudson (Jira)
[PR] NUTCH-3134 Add latency metrics with percentile support to Fetcher, Parser, and Indexer [nutch]
via GitHub
Re: [PR] NUTCH-3134 Add latency metrics with percentile support to Fetcher, Parser, and Indexer [nutch]
via GitHub
[jira] [Commented] (NUTCH-3137) Upgrade Nutch core dependencies
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3137) Upgrade Nutch core dependencies
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3137) Upgrade Nutch core dependencies
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3137) Upgrade Nutch core dependencies
ASF GitHub Bot (Jira)
[jira] [Commented] (NUTCH-3137) Upgrade Nutch core dependencies
Hudson (Jira)
[PR] NUTCH-3137 Upgrade Nutch core dependencies [nutch]
via GitHub
Re: [PR] NUTCH-3137 Upgrade Nutch core dependencies [nutch]
via GitHub
Re: [PR] NUTCH-3137 Upgrade Nutch core dependencies [nutch]
via GitHub
Re: [PR] NUTCH-3137 Upgrade Nutch core dependencies [nutch]
via GitHub
[jira] [Created] (NUTCH-3137) Upgrade Nutch core dependencies
Sebastian Nagel (Jira)
[PR] NUTCH-3136 Upgrade crawler-commons dependency [nutch]
via GitHub
Earlier messages