rahul-madaan commented on code in PR #40290:
URL: https://github.com/apache/airflow/pull/40290#discussion_r1645020677


##########
airflow/providers/google/cloud/transfers/azure_blob_to_gcs.py:
##########
@@ -122,3 +122,21 @@ def execute(self, context: Context) -> str:
                 self.bucket_name,
             )
         return f"gs://{self.bucket_name}/{self.object_name}"
+
+    def get_openlineage_facets_on_start(self):
+        from openlineage.client.run import Dataset
+        from airflow.providers.openlineage.extractors import OperatorLineage
+        wasb_hook = WasbHook(wasb_conn_id=self.wasb_conn_id)
+        connection = wasb_hook.get_connection(wasb_hook.conn_id)

Review Comment:
   1. You are correct, `get_conn` is a better way to get the account name 
instead of getting it from the config. I have updated and pushed the changes. 
Thanks.
   2. based on the naming convention 
[doc](https://github.com/OpenLineage/OpenLineage/blob/main/spec/Naming.md#wasbs-azure-blob-storage)
 we have `service_name` after the `@` which means `account_name`.
   Account names are globally unique in Azure. 
[ref](https://learn.microsoft.com/en-us/azure/storage/common/storage-account-overview#storage-account-name)
   
   > Is account enough to uniquely identify this container?
   
   Yes, because Account names are globally unique.
   
   tagging @Adaverse (Author of AzureBlobStorageToGCSOperator) for context
   



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to