Lewis John McGibbney created NUTCH-3147:
-------------------------------------------

             Summary: Nutch JMX Metrics Evolution with OpenTelemetry
                 Key: NUTCH-3147
                 URL: https://issues.apache.org/jira/browse/NUTCH-3147
             Project: Nutch
          Issue Type: Sub-task
          Components: metrics
            Reporter: Lewis John McGibbney
             Fix For: 1.22


This task will evolve the Nutch metrics system to provide real-time JMX 
observability using OpenTelemetry, supporting both NutchServer mode and 
MapReduce job execution while maintaining backward compatibility with existing 
Hadoop counters. This would dramatically improve the observability experience 
for Nutch administrators. 

Currently, Nutch metrics are only accessible via Hadoop MapReduce counters 
after job completion. The JMX enhancement will add real-time observability 
through:
 * JMX MBeans for live monitoring
 * Prometheus endpoint for scraping (if/when running NutchServer)
 * OTLP export for OpenTelemetry collectors

I propose we instrument Nutch using the OpenTelemetry Java SDK. This will be a 
significant undertaking and should likely be implemented after everything else 
in NUTCH-3131.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to