[PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-02 Thread via GitHub
he-gong-sardine opened a new pull request, #36545: URL: https://github.com/apache/airflow/pull/36545 closes: #34995 --- ### Overview: The current implementation in Airflow allows file transfer from SFTP to GCS by first downloading the file to the worker's local storage, the

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-02 Thread via GitHub
boring-cyborg[bot] commented on PR #36545: URL: https://github.com/apache/airflow/pull/36545#issuecomment-1874725064 Congratulations on your first Pull Request and welcome to the Apache Airflow community! If you have any issues or are unsure about any anything please check our Contribution

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-03 Thread via GitHub
Lee-W commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1441262932 ## airflow/providers/google/cloud/transfers/sftp_to_gcs.py: ## @@ -85,13 +106,18 @@ def __init__( source_path: str, destination_bucket: str,

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-03 Thread via GitHub
dirrao commented on PR #36545: URL: https://github.com/apache/airflow/pull/36545#issuecomment-1876651595 > * High Memory (Over 100MB): It is recommended to use Google's upload_from_file method, which is more robust and includes a retry feature. This method is preferable when sufficient memo

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-07 Thread via GitHub
elaye-canopy commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1443975480 ## airflow/providers/google/cloud/transfers/sftp_to_gcs.py: ## @@ -168,6 +204,68 @@ def _copy_single_object( self.log.info("Executing delete of %s",

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-07 Thread via GitHub
elaye-canopy commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1443975333 ## airflow/providers/google/cloud/transfers/sftp_to_gcs.py: ## @@ -168,6 +204,68 @@ def _copy_single_object( self.log.info("Executing delete of %s",

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-07 Thread via GitHub
elaye-canopy commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1443975333 ## airflow/providers/google/cloud/transfers/sftp_to_gcs.py: ## @@ -168,6 +204,68 @@ def _copy_single_object( self.log.info("Executing delete of %s",

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-09 Thread via GitHub
he-gong-sardine commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1445988080 ## airflow/providers/google/cloud/transfers/sftp_to_gcs.py: ## @@ -85,13 +106,18 @@ def __init__( source_path: str, destination_bucket: str,

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-09 Thread via GitHub
he-gong-sardine commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1445989162 ## airflow/providers/google/cloud/transfers/sftp_to_gcs.py: ## @@ -168,6 +204,68 @@ def _copy_single_object( self.log.info("Executing delete of %

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-09 Thread via GitHub
elaye-canopy commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1446051467 ## airflow/providers/google/cloud/transfers/sftp_to_gcs.py: ## @@ -168,6 +204,68 @@ def _copy_single_object( self.log.info("Executing delete of %s",

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-02-23 Thread via GitHub
he-gong-sardine commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1500592065 ## tests/providers/google/cloud/transfers/test_sftp_to_gcs.py: ## @@ -252,3 +254,60 @@ def test_execute_more_than_one_wildcard_exception(self, sftp_hook, gcs

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-02-23 Thread via GitHub
he-gong-sardine commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1500610358 ## licenses/LICENSES-ui.txt: ## Review Comment: thanks. I don't know how this file was generated. removed. -- This is an automated message from the

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-02-23 Thread via GitHub
he-gong-sardine commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1500610842 ## docs/apache-airflow-providers-google/operators/transfer/sftp_to_gcs.rst: ## @@ -93,6 +93,33 @@ and ``tests_sftp_hook_dir/subdir/parent-2.bin`` is copied to

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-02-23 Thread via GitHub
he-gong-sardine commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1500613391 ## docs/apache-airflow-providers-google/operators/transfer/sftp_to_gcs.rst: ## @@ -93,6 +93,33 @@ and ``tests_sftp_hook_dir/subdir/parent-2.bin`` is copied to

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-02-23 Thread via GitHub
he-gong-sardine commented on PR #36545: URL: https://github.com/apache/airflow/pull/36545#issuecomment-1961757630 Thank you guys. updated. @potiuk @Lee-W @elaye-canopy -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use t

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-02-23 Thread via GitHub
eladkal commented on PR #36545: URL: https://github.com/apache/airflow/pull/36545#issuecomment-1961856408 doc build is failing on spell check -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the s

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-20 Thread via GitHub
eladkal commented on PR #36545: URL: https://github.com/apache/airflow/pull/36545#issuecomment-1901997539 > Documentation updates are pending. I am currently seeking guidance on locating the source files for documentation to ensure all necessary updates are made. Assuming you want to

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-20 Thread via GitHub
elaye-canopy commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1460688387 ## airflow/providers/google/cloud/transfers/sftp_to_gcs.py: ## @@ -160,10 +160,8 @@ def execute(self, context: Context): for file in files:

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-28 Thread via GitHub
he-gong-sardine commented on PR #36545: URL: https://github.com/apache/airflow/pull/36545#issuecomment-1913579619 > > Documentation updates are pending. I am currently seeking guidance on locating the source files for documentation to ensure all necessary updates are made. > > Assumi

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-28 Thread via GitHub
he-gong-sardine commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1468859918 ## airflow/providers/google/cloud/transfers/sftp_to_gcs.py: ## @@ -168,6 +204,68 @@ def _copy_single_object( self.log.info("Executing delete of %

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-28 Thread via GitHub
he-gong-sardine commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1468859918 ## airflow/providers/google/cloud/transfers/sftp_to_gcs.py: ## @@ -168,6 +204,68 @@ def _copy_single_object( self.log.info("Executing delete of %

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-28 Thread via GitHub
he-gong-sardine commented on PR #36545: URL: https://github.com/apache/airflow/pull/36545#issuecomment-1913606077 > > * High Memory (Over 100MB): It is recommended to use Google's upload_from_file method, which is more robust and includes a retry feature. This method is preferable when suff

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-28 Thread via GitHub
he-gong-sardine commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1468866487 ## airflow/providers/google/cloud/transfers/sftp_to_gcs.py: ## @@ -160,10 +160,8 @@ def execute(self, context: Context): for file in files:

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-28 Thread via GitHub
he-gong-sardine commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1468869304 ## airflow/providers/google/cloud/transfers/sftp_to_gcs.py: ## @@ -168,6 +204,68 @@ def _copy_single_object( self.log.info("Executing delete of %

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-01-28 Thread via GitHub
he-gong-sardine commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1468869381 ## airflow/providers/google/cloud/transfers/sftp_to_gcs.py: ## @@ -168,6 +204,68 @@ def _copy_single_object( self.log.info("Executing delete of %

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-02-02 Thread via GitHub
potiuk commented on PR #36545: URL: https://github.com/apache/airflow/pull/36545#issuecomment-1923769973 You will need to rebase to resolve conflicts. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-02-05 Thread via GitHub
he-gong-sardine commented on PR #36545: URL: https://github.com/apache/airflow/pull/36545#issuecomment-1928712406 > You will need to rebase to resolve conflicts. Thanks! Resolved conflicts. -- This is an automated message from the Apache Git Service. To respond to the message, plea

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-02-06 Thread via GitHub
potiuk commented on PR #36545: URL: https://github.com/apache/airflow/pull/36545#issuecomment-1929268620 and docs/static checks now. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific c

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-02-08 Thread via GitHub
Lee-W commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1483118035 ## airflow/providers/sftp/provider.yaml: ## @@ -60,7 +60,7 @@ versions: dependencies: - apache-airflow>=2.6.0 - apache-airflow-providers-ssh>=2.1.0 - - paramiko

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-02-15 Thread via GitHub
elaye-canopy commented on code in PR #36545: URL: https://github.com/apache/airflow/pull/36545#discussion_r1490760502 ## airflow/providers/sftp/provider.yaml: ## @@ -60,7 +60,7 @@ versions: dependencies: - apache-airflow>=2.6.0 - apache-airflow-providers-ssh>=2.1.0 - - p

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-05-22 Thread via GitHub
github-actions[bot] commented on PR #36545: URL: https://github.com/apache/airflow/pull/36545#issuecomment-2126272202 This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 5 days if no further activity occurs. Thank you for you

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-05-28 Thread via GitHub
elaye-canopy commented on PR #36545: URL: https://github.com/apache/airflow/pull/36545#issuecomment-2135038607 @Lee-W What is still missing? I am also using this feature and have contributed some of the code to @he-gong-work Maybe I could take over and finish it if they are busy. -- T

Re: [PR] feat: Stream Transfer Enhancement for Airflow SFTP to GCS Operator [airflow]

2024-05-28 Thread via GitHub
Lee-W commented on PR #36545: URL: https://github.com/apache/airflow/pull/36545#issuecomment-2135411519 @elaye-canopy Yep, that would be awesome! I think the major issue we have now is spelling checks and conflicts. If my memory serves me right, other parts seem to be good. But I can take a