o-nikolas commented on code in PR #61079:
URL: https://github.com/apache/airflow/pull/61079#discussion_r2742928431


##########
airflow-core/src/airflow/dag_processing/manager.py:
##########
@@ -1195,10 +1195,18 @@ def process_parse_results(
             run_count=run_count + 1,
         )
 
-    # TODO: AIP-66 emit metrics
-    # file_name = Path(dag_file.path).stem
-    # Stats.timing(f"dag_processing.last_duration.{file_name}", 
stat.last_duration)
-    # Stats.timing("dag_processing.last_duration", stat.last_duration, 
tags={"file_name": file_name})
+    # Note: relative_fileloc has a None default. In practice it is always 
provided but code defensively here in case
+    if relative_fileloc is not None and stat.last_duration is not None:
+        # Normalize names to ensure they only contain valid characters for 
stats (alphanumeric, underscore, dot, dash)
+        file_name = normalize_name_for_stats(Path(relative_fileloc).stem)
+        # bundle_name is included to distinguish files with the same name 
across different bundles
+        normalized_bundle = normalize_name_for_stats(bundle_name)
+        
Stats.timing(f"dag_processing.last_duration.{normalized_bundle}.{file_name}", 
stat.last_duration)
+        Stats.timing(
+            "dag_processing.last_duration",
+            stat.last_duration,
+            tags={"file_name": file_name, "bundle_name": normalized_bundle},

Review Comment:
   Hmm, I'm not sure either. It seemed the most consistent to provide the same 
sanitized filename to each stat backend. I think statsd is a subset of what 
otel provides, so it should be safe and then the experience will be the same 
for users. Is there are downside I'm not seeing that's on your radar here?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to