kacpermuda commented on PR #44477:
URL: https://github.com/apache/airflow/pull/44477#issuecomment-2517427061

   Hey @ahidalgob, just to confirm I understand you correctly: are you asking 
if we plan to emit the lineage from the child job (in this case, Spark) 
directly from Airflow? As of now, there aren’t any plans for that that I'm 
aware of. In my opinion, it’s a bit more complex to implement compared to a 
SQL-based approach, where we can parse the SQL on the Airflow side and 
occasionally patch it with API calls to BigQuery or similar solutions. 
Extracting lineage from a Spark jar, which can do virtually anything, is more 
challenging. For now, I’m focusing on making it easier for users to configure 
Spark integration, without changing the entity responsible for emitting the 
events.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to