Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-24 Thread via GitHub


gopidesupavan commented on code in PR #47814:
URL: https://github.com/apache/airflow/pull/47814#discussion_r2058202359


##
dev/breeze/doc/09_release_management_tasks.rst:
##
@@ -713,3 +713,57 @@ default is to build ``both`` type of distributions 
``sdist`` and ``wheel``.
   :target: 
https://raw.githubusercontent.com/apache/airflow/main/dev/breeze/doc/images/output_release-management_prepare-airflow-ctl-distributions.svg
   :width: 100%
   :alt: Breeze release-management prepare-airflow-ctl-distributions
+
+Publishing the documentation to S3
+""
+
+To publish the documentation generated by ``build-docs`` in Breeze to ``S3``,
+use the ``release-management publish-docs-to-s3`` command:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs-to-s3
+
+The documentation publish to S3 should be done after the ``breeze 
release-management publish-docs`` command.
+Once documentation is available in ``docs-archive`` directory of airflow-site, 
it can be published to S3.
+
+The publishing documentation to S3 consists of the following steps:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path <> 
--destination-location <>
+
+Where ``--source-dir-path`` is a doc-archive location path and 
``--destination-location`` is the S3 bucket path.
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path 
/User/pavan/airflow-site/docs-archive
+ --destination-location s3://airflow-docs/docs
+
+To exclude any documentation from publishing to S3, you can use the 
``--exclude`` flag.
+
+.. code-block:: bash

Review Comment:
   Sure Noted.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-24 Thread via GitHub


potiuk commented on code in PR #47814:
URL: https://github.com/apache/airflow/pull/47814#discussion_r2058204641


##
dev/breeze/doc/09_release_management_tasks.rst:
##
@@ -713,3 +713,57 @@ default is to build ``both`` type of distributions 
``sdist`` and ``wheel``.
   :target: 
https://raw.githubusercontent.com/apache/airflow/main/dev/breeze/doc/images/output_release-management_prepare-airflow-ctl-distributions.svg
   :width: 100%
   :alt: Breeze release-management prepare-airflow-ctl-distributions
+
+Publishing the documentation to S3
+""
+
+To publish the documentation generated by ``build-docs`` in Breeze to ``S3``,
+use the ``release-management publish-docs-to-s3`` command:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs-to-s3
+
+The documentation publish to S3 should be done after the ``breeze 
release-management publish-docs`` command.
+Once documentation is available in ``docs-archive`` directory of airflow-site, 
it can be published to S3.
+
+The publishing documentation to S3 consists of the following steps:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path <> 
--destination-location <>
+
+Where ``--source-dir-path`` is a doc-archive location path and 
``--destination-location`` is the S3 bucket path.
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path 
/User/pavan/airflow-site/docs-archive
+ --destination-location s3://airflow-docs/docs
+
+To exclude any documentation from publishing to S3, you can use the 
``--exclude`` flag.
+
+.. code-block:: bash

Review Comment:
   We are not THERE yet - but we definitely plan to make step-by-step docs on 
how to publish - we also have few more improvements to our doc publishing 
process that should make it even easier and faster.
   
   This is part of https://github.com/apache/airflow/issues/44373 and this is 
but a part of https://github.com/apache/airflow/issues/45641 :)



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-24 Thread via GitHub


kaxil commented on code in PR #47814:
URL: https://github.com/apache/airflow/pull/47814#discussion_r2058004068


##
dev/breeze/doc/09_release_management_tasks.rst:
##
@@ -713,3 +713,57 @@ default is to build ``both`` type of distributions 
``sdist`` and ``wheel``.
   :target: 
https://raw.githubusercontent.com/apache/airflow/main/dev/breeze/doc/images/output_release-management_prepare-airflow-ctl-distributions.svg
   :width: 100%
   :alt: Breeze release-management prepare-airflow-ctl-distributions
+
+Publishing the documentation to S3
+""
+
+To publish the documentation generated by ``build-docs`` in Breeze to ``S3``,
+use the ``release-management publish-docs-to-s3`` command:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs-to-s3
+
+The documentation publish to S3 should be done after the ``breeze 
release-management publish-docs`` command.
+Once documentation is available in ``docs-archive`` directory of airflow-site, 
it can be published to S3.
+
+The publishing documentation to S3 consists of the following steps:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path <> 
--destination-location <>
+
+Where ``--source-dir-path`` is a doc-archive location path and 
``--destination-location`` is the S3 bucket path.
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path 
/User/pavan/airflow-site/docs-archive
+ --destination-location s3://airflow-docs/docs
+
+To exclude any documentation from publishing to S3, you can use the 
``--exclude`` flag.
+
+.. code-block:: bash

Review Comment:
   Cool, thanks both



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-24 Thread via GitHub


kaxil commented on code in PR #47814:
URL: https://github.com/apache/airflow/pull/47814#discussion_r2058004934


##
dev/breeze/doc/09_release_management_tasks.rst:
##
@@ -713,3 +713,57 @@ default is to build ``both`` type of distributions 
``sdist`` and ``wheel``.
   :target: 
https://raw.githubusercontent.com/apache/airflow/main/dev/breeze/doc/images/output_release-management_prepare-airflow-ctl-distributions.svg
   :width: 100%
   :alt: Breeze release-management prepare-airflow-ctl-distributions
+
+Publishing the documentation to S3
+""
+
+To publish the documentation generated by ``build-docs`` in Breeze to ``S3``,
+use the ``release-management publish-docs-to-s3`` command:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs-to-s3
+
+The documentation publish to S3 should be done after the ``breeze 
release-management publish-docs`` command.
+Once documentation is available in ``docs-archive`` directory of airflow-site, 
it can be published to S3.
+
+The publishing documentation to S3 consists of the following steps:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path <> 
--destination-location <>
+
+Where ``--source-dir-path`` is a doc-archive location path and 
``--destination-location`` is the S3 bucket path.
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path 
/User/pavan/airflow-site/docs-archive
+ --destination-location s3://airflow-docs/docs
+
+To exclude any documentation from publishing to S3, you can use the 
``--exclude`` flag.
+
+.. code-block:: bash

Review Comment:
   Worth adding some docs on https://github.com/apache/airflow-site when we get 
to the .htaccess part



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-24 Thread via GitHub


potiuk merged PR #47814:
URL: https://github.com/apache/airflow/pull/47814


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-23 Thread via GitHub


potiuk commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2825464767

   Some conflicts to resolve.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-23 Thread via GitHub


potiuk commented on code in PR #47814:
URL: https://github.com/apache/airflow/pull/47814#discussion_r2056817728


##
dev/breeze/doc/09_release_management_tasks.rst:
##
@@ -713,3 +713,57 @@ default is to build ``both`` type of distributions 
``sdist`` and ``wheel``.
   :target: 
https://raw.githubusercontent.com/apache/airflow/main/dev/breeze/doc/images/output_release-management_prepare-airflow-ctl-distributions.svg
   :width: 100%
   :alt: Breeze release-management prepare-airflow-ctl-distributions
+
+Publishing the documentation to S3
+""
+
+To publish the documentation generated by ``build-docs`` in Breeze to ``S3``,
+use the ``release-management publish-docs-to-s3`` command:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs-to-s3
+
+The documentation publish to S3 should be done after the ``breeze 
release-management publish-docs`` command.
+Once documentation is available in ``docs-archive`` directory of airflow-site, 
it can be published to S3.
+
+The publishing documentation to S3 consists of the following steps:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path <> 
--destination-location <>
+
+Where ``--source-dir-path`` is a doc-archive location path and 
``--destination-location`` is the S3 bucket path.
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path 
/User/pavan/airflow-site/docs-archive
+ --destination-location s3://airflow-docs/docs
+
+To exclude any documentation from publishing to S3, you can use the 
``--exclude`` flag.
+
+.. code-block:: bash

Review Comment:
   Yes. The idea here is that:
   
   * we build provider (and airflow docs) locally
   * we push  them to s3 (only latest versions - all the rest is already in s3) 
   * the change we push to "airflow-site" are just rewrite rules for the new 
versions in s3 + code to populate versions in the dropdwn



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-23 Thread via GitHub


gopidesupavan commented on code in PR #47814:
URL: https://github.com/apache/airflow/pull/47814#discussion_r2056788969


##
dev/breeze/doc/09_release_management_tasks.rst:
##
@@ -713,3 +713,57 @@ default is to build ``both`` type of distributions 
``sdist`` and ``wheel``.
   :target: 
https://raw.githubusercontent.com/apache/airflow/main/dev/breeze/doc/images/output_release-management_prepare-airflow-ctl-distributions.svg
   :width: 100%
   :alt: Breeze release-management prepare-airflow-ctl-distributions
+
+Publishing the documentation to S3
+""
+
+To publish the documentation generated by ``build-docs`` in Breeze to ``S3``,
+use the ``release-management publish-docs-to-s3`` command:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs-to-s3
+
+The documentation publish to S3 should be done after the ``breeze 
release-management publish-docs`` command.
+Once documentation is available in ``docs-archive`` directory of airflow-site, 
it can be published to S3.
+
+The publishing documentation to S3 consists of the following steps:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path <> 
--destination-location <>
+
+Where ``--source-dir-path`` is a doc-archive location path and 
``--destination-location`` is the S3 bucket path.
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path 
/User/pavan/airflow-site/docs-archive
+ --destination-location s3://airflow-docs/docs
+
+To exclude any documentation from publishing to S3, you can use the 
``--exclude`` flag.
+
+.. code-block:: bash

Review Comment:
   >Or is this for the docs on main?
   
   This docs publishing for released artifacts. it requires input of tag or 
branch then it generates docs and publishes to s3.
   
   >or is this for archival of the old docs?
   
   Archival we can add separate process inside aws, once we start serving docs 
from s3 and we can archive.
   



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-23 Thread via GitHub


gopidesupavan commented on code in PR #47814:
URL: https://github.com/apache/airflow/pull/47814#discussion_r2056790231


##
dev/breeze/doc/09_release_management_tasks.rst:
##
@@ -713,3 +713,57 @@ default is to build ``both`` type of distributions 
``sdist`` and ``wheel``.
   :target: 
https://raw.githubusercontent.com/apache/airflow/main/dev/breeze/doc/images/output_release-management_prepare-airflow-ctl-distributions.svg
   :width: 100%
   :alt: Breeze release-management prepare-airflow-ctl-distributions
+
+Publishing the documentation to S3
+""
+
+To publish the documentation generated by ``build-docs`` in Breeze to ``S3``,
+use the ``release-management publish-docs-to-s3`` command:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs-to-s3
+
+The documentation publish to S3 should be done after the ``breeze 
release-management publish-docs`` command.
+Once documentation is available in ``docs-archive`` directory of airflow-site, 
it can be published to S3.
+
+The publishing documentation to S3 consists of the following steps:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path <> 
--destination-location <>
+
+Where ``--source-dir-path`` is a doc-archive location path and 
``--destination-location`` is the S3 bucket path.
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path 
/User/pavan/airflow-site/docs-archive
+ --destination-location s3://airflow-docs/docs
+
+To exclude any documentation from publishing to S3, you can use the 
``--exclude`` flag.
+
+.. code-block:: bash

Review Comment:
   Hope that make sense or am i making too confusing, @potiuk  can pitch in :) 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-23 Thread via GitHub


gopidesupavan commented on code in PR #47814:
URL: https://github.com/apache/airflow/pull/47814#discussion_r2056788969


##
dev/breeze/doc/09_release_management_tasks.rst:
##
@@ -713,3 +713,57 @@ default is to build ``both`` type of distributions 
``sdist`` and ``wheel``.
   :target: 
https://raw.githubusercontent.com/apache/airflow/main/dev/breeze/doc/images/output_release-management_prepare-airflow-ctl-distributions.svg
   :width: 100%
   :alt: Breeze release-management prepare-airflow-ctl-distributions
+
+Publishing the documentation to S3
+""
+
+To publish the documentation generated by ``build-docs`` in Breeze to ``S3``,
+use the ``release-management publish-docs-to-s3`` command:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs-to-s3
+
+The documentation publish to S3 should be done after the ``breeze 
release-management publish-docs`` command.
+Once documentation is available in ``docs-archive`` directory of airflow-site, 
it can be published to S3.
+
+The publishing documentation to S3 consists of the following steps:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path <> 
--destination-location <>
+
+Where ``--source-dir-path`` is a doc-archive location path and 
``--destination-location`` is the S3 bucket path.
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path 
/User/pavan/airflow-site/docs-archive
+ --destination-location s3://airflow-docs/docs
+
+To exclude any documentation from publishing to S3, you can use the 
``--exclude`` flag.
+
+.. code-block:: bash

Review Comment:
   >Or is this for the docs on main?
   
   This docs publishing for released artifacts.
   
   >or is this for archival of the old docs?
   
   Archival we can add separate process inside aws, once we start serving docs 
from s3 and we can archive.
   



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-23 Thread via GitHub


gopidesupavan commented on code in PR #47814:
URL: https://github.com/apache/airflow/pull/47814#discussion_r205606


##
dev/breeze/doc/09_release_management_tasks.rst:
##
@@ -713,3 +713,57 @@ default is to build ``both`` type of distributions 
``sdist`` and ``wheel``.
   :target: 
https://raw.githubusercontent.com/apache/airflow/main/dev/breeze/doc/images/output_release-management_prepare-airflow-ctl-distributions.svg
   :width: 100%
   :alt: Breeze release-management prepare-airflow-ctl-distributions
+
+Publishing the documentation to S3
+""
+
+To publish the documentation generated by ``build-docs`` in Breeze to ``S3``,
+use the ``release-management publish-docs-to-s3`` command:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs-to-s3
+
+The documentation publish to S3 should be done after the ``breeze 
release-management publish-docs`` command.
+Once documentation is available in ``docs-archive`` directory of airflow-site, 
it can be published to S3.
+
+The publishing documentation to S3 consists of the following steps:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path <> 
--destination-location <>
+
+Where ``--source-dir-path`` is a doc-archive location path and 
``--destination-location`` is the S3 bucket path.
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path 
/User/pavan/airflow-site/docs-archive
+ --destination-location s3://airflow-docs/docs
+
+To exclude any documentation from publishing to S3, you can use the 
``--exclude`` flag.
+
+.. code-block:: bash

Review Comment:
   Yeah it will be served directly from s3 all the docs. we plan to use the 
.htaccess file rewrite rules to redirect. 
   so provider and airflow docs are served from the s3 and rest are from the 
gh-pages branches.
   
   An example where i have tested with 
https://github.com/apache/airflow-site/blob/airflow-test/.htaccess and staging 
website https://airflow-test.staged.apache.org/ which serves docs from s3. 
   
   Some discussions with @ashb and @potiuk  are here 
https://apache-airflow.slack.com/archives/CJ1LVREHX/p1737287674851989?thread_ts=1737113729.883969&cid=CJ1LVREHX
   
   
   We could also add another steps later for archive where we can remove 
versions from serving bucket. 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-23 Thread via GitHub


gopidesupavan commented on code in PR #47814:
URL: https://github.com/apache/airflow/pull/47814#discussion_r205606


##
dev/breeze/doc/09_release_management_tasks.rst:
##
@@ -713,3 +713,57 @@ default is to build ``both`` type of distributions 
``sdist`` and ``wheel``.
   :target: 
https://raw.githubusercontent.com/apache/airflow/main/dev/breeze/doc/images/output_release-management_prepare-airflow-ctl-distributions.svg
   :width: 100%
   :alt: Breeze release-management prepare-airflow-ctl-distributions
+
+Publishing the documentation to S3
+""
+
+To publish the documentation generated by ``build-docs`` in Breeze to ``S3``,
+use the ``release-management publish-docs-to-s3`` command:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs-to-s3
+
+The documentation publish to S3 should be done after the ``breeze 
release-management publish-docs`` command.
+Once documentation is available in ``docs-archive`` directory of airflow-site, 
it can be published to S3.
+
+The publishing documentation to S3 consists of the following steps:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path <> 
--destination-location <>
+
+Where ``--source-dir-path`` is a doc-archive location path and 
``--destination-location`` is the S3 bucket path.
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path 
/User/pavan/airflow-site/docs-archive
+ --destination-location s3://airflow-docs/docs
+
+To exclude any documentation from publishing to S3, you can use the 
``--exclude`` flag.
+
+.. code-block:: bash

Review Comment:
   Yeah it will be served directly from s3 all the docs. we plan to use the 
.htaccess(thanks to ash and jarek) file rewrite rules to redirect. 
   so provider and airflow docs are served from the s3 and rest are from the 
gh-pages branches.
   
   An example where i have tested with 
https://github.com/apache/airflow-site/blob/airflow-test/.htaccess and staging 
website https://airflow-test.staged.apache.org/ which serves docs from s3. 
   
   Some discussions with @ashb and @potiuk  are here 
https://apache-airflow.slack.com/archives/CJ1LVREHX/p1737287674851989?thread_ts=1737113729.883969&cid=CJ1LVREHX
   
   
   We could also add another steps later for archive where we can remove 
versions from serving bucket. 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-23 Thread via GitHub


kaxil commented on code in PR #47814:
URL: https://github.com/apache/airflow/pull/47814#discussion_r2056759785


##
dev/breeze/doc/09_release_management_tasks.rst:
##
@@ -713,3 +713,57 @@ default is to build ``both`` type of distributions 
``sdist`` and ``wheel``.
   :target: 
https://raw.githubusercontent.com/apache/airflow/main/dev/breeze/doc/images/output_release-management_prepare-airflow-ctl-distributions.svg
   :width: 100%
   :alt: Breeze release-management prepare-airflow-ctl-distributions
+
+Publishing the documentation to S3
+""
+
+To publish the documentation generated by ``build-docs`` in Breeze to ``S3``,
+use the ``release-management publish-docs-to-s3`` command:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs-to-s3
+
+The documentation publish to S3 should be done after the ``breeze 
release-management publish-docs`` command.
+Once documentation is available in ``docs-archive`` directory of airflow-site, 
it can be published to S3.
+
+The publishing documentation to S3 consists of the following steps:
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path <> 
--destination-location <>
+
+Where ``--source-dir-path`` is a doc-archive location path and 
``--destination-location`` is the S3 bucket path.
+
+.. code-block:: bash
+
+ breeze release-management publish-docs --source-dir-path 
/User/pavan/airflow-site/docs-archive
+ --destination-location s3://airflow-docs/docs
+
+To exclude any documentation from publishing to S3, you can use the 
``--exclude`` flag.
+
+.. code-block:: bash

Review Comment:
   To clarify, the docs will be stored in S3, but served to the users how?
   
   Or is this for the docs on main?
   
   or is this for archival of the old docs?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-23 Thread via GitHub


potiuk commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2825297734

   This looks really good. We should likely merge it and prepare a draft 
documentation for the next steps of doing it by @eladkal with the next 
provider's wave.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-23 Thread via GitHub


gopidesupavan commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2824901149

   Just updated few updates related to AIRFLOW_ROOT_PATH and some ordering of 
steps.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-22 Thread via GitHub


gopidesupavan commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2822098623

   > Happy to review / get back on this after 3.0.0 -- just too fried right 
now, sry!
   
   Yeah fine :)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-19 Thread via GitHub


kaxil commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2816638308

   Happy to review / get back on this after 3.0.0 -- just too fried right now, 
sry!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-19 Thread via GitHub


bugraoz93 commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2816592138

   > > > Can i have some more review on this please?
   > > 
   > > Looks great! Small question, are we planning to integrate this workflow 
in `ci.yml`?
   > 
   > Not configured for CI, these new workflow will be triggered manually by 
release managers to publish docs to s3 when any release happens. :)
   > 
   
   Thanks a lot for the details! Makes sense :)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-18 Thread via GitHub


gopidesupavan commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2816274785

   > > Can i have some more review on this please?
   > 
   > Looks great! Small question, are we planning to integrate this workflow in 
`ci.yml`?
   
   Not configured for CI, these new workflow will be triggered manually by 
release managers to publish docs to s3 when any release happens. :)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-18 Thread via GitHub


bugraoz93 commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2816231554

   > Can i have some more review on this please?
   
   Looks great! Small question, are we planning to integrate this workflow in 
`ci.yml`?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-18 Thread via GitHub


gopidesupavan commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2816020213

   Can i have some more review on this please?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-16 Thread via GitHub


gopidesupavan commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2810707714

   > I am not super familiar with the process of doc publishing so dont take my 
word for it but LGTM :)
   
   Thanks vincent, no problem, this can wait, i will wait for others too 
review. :)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-08 Thread via GitHub


potiuk commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2787935170

   > Will rebase this and update after this merge #48760
   
   Feel free!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-06 Thread via GitHub


gopidesupavan commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2781471573

   Will rebase this and update after this merge #48760


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-04-05 Thread via GitHub


gopidesupavan commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2742492885

   > I will take a look shortly - after finishing airflow-core one :)
   
   Sure, probably I will wait and refactor after big fish ( airflow-core) :) 
merges. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-03-28 Thread via GitHub


vincbeck commented on code in PR #47814:
URL: https://github.com/apache/airflow/pull/47814#discussion_r2018808826


##
dev/breeze/src/airflow_breeze/utils/publish_docs_to_s3.py:
##
@@ -0,0 +1,198 @@
+# Licensed to the Apache Software Foundation (ASF) under one
+# or more contributor license agreements.  See the NOTICE file
+# distributed with this work for additional information
+# regarding copyright ownership.  The ASF licenses this file
+# to you under the Apache License, Version 2.0 (the
+# "License"); you may not use this file except in compliance
+# with the License.  You may obtain a copy of the License at
+#
+#   http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing,
+# software distributed under the License is distributed on an
+# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+# KIND, either express or implied.  See the License for the
+# specific language governing permissions and limitations
+# under the License.
+from __future__ import annotations
+
+import os
+import subprocess
+import sys
+from functools import cached_property
+
+import boto3
+
+from airflow_breeze.utils.console import get_console
+from airflow_breeze.utils.parallel import check_async_run_results, 
run_with_pool
+
+PROVIDER_NAME_FORMAT = "apache-airflow-providers-{}"
+
+NON_SHORT_NAME_PACKAGES = ["docker-stack", "helm-chart", "apache-airflow"]
+
+s3_client = boto3.client("s3")
+
+
+class S3DocsPublish:
+def __init__(
+self,
+source_dir_path: str,
+destination_location: str,
+exclude_docs: str,
+dry_run: bool = False,
+overwrite: bool = False,
+parallelism: int = 1,
+):
+self.source_dir_path = source_dir_path
+self.destination_location = destination_location
+self.exclude_docs = exclude_docs
+self.dry_run = dry_run
+self.overwrite = overwrite
+self.parallelism = parallelism
+self.source_dest_mapping: list[tuple[str, str]] = []
+
+@cached_property
+def get_all_docs(self):
+get_console().print(f"[info]Getting all docs from 
{self.source_dir_path}\n")
+try:
+all_docs = os.listdir(self.source_dir_path)
+except FileNotFoundError:
+get_console().print(f"[error]No docs found in 
{self.source_dir_path}\n")
+sys.exit(1)
+return all_docs
+
+@cached_property
+def get_all_excluded_docs(self):
+if not self.exclude_docs:
+return []
+return self.exclude_docs.split(",")
+
+@cached_property
+def get_all_eligible_docs(self):
+"""
+It excludes the docs that are in the exclude list
+"""
+non_eligible_docs = []
+
+for excluded_doc in self.get_all_excluded_docs:
+if excluded_doc in NON_SHORT_NAME_PACKAGES:
+non_eligible_docs.append(excluded_doc)
+continue
+
+for doc in self.get_all_docs:
+excluded_provider_name = 
PROVIDER_NAME_FORMAT.format(excluded_doc.replace(".", "-"))
+if doc == excluded_provider_name:
+non_eligible_docs.append(doc)
+continue
+
+docs_to_process = list(set(self.get_all_docs) - set(non_eligible_docs))
+if not docs_to_process:
+get_console().print("[error]No eligible docs found, all docs are 
excluded\n")
+sys.exit(1)
+
+return docs_to_process
+
+def is_doc_exists(self, s3_bucket_doc_location: str) -> bool:

Review Comment:
   ```suggestion
   def doc_exists(self, s3_bucket_doc_location: str) -> bool:
   ```



##
dev/breeze/src/airflow_breeze/utils/publish_docs_to_s3.py:
##
@@ -0,0 +1,198 @@
+# Licensed to the Apache Software Foundation (ASF) under one
+# or more contributor license agreements.  See the NOTICE file
+# distributed with this work for additional information
+# regarding copyright ownership.  The ASF licenses this file
+# to you under the Apache License, Version 2.0 (the
+# "License"); you may not use this file except in compliance
+# with the License.  You may obtain a copy of the License at
+#
+#   http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing,
+# software distributed under the License is distributed on an
+# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+# KIND, either express or implied.  See the License for the
+# specific language governing permissions and limitations
+# under the License.
+from __future__ import annotations
+
+import os
+import subprocess
+import sys
+from functools import cached_property
+
+import boto3
+
+from airflow_breeze.utils.console import get_console
+from airflow_breeze.utils.parallel import check_async_run_results, 
run_with_pool
+
+PROVIDER_NAME_FORMAT = "apache-airflow-providers-{}"
+
+NON_SHORT_NAME_PACKAGES = ["docker-stack", "helm-chart", "apache-airflow"]
+

Re: [PR] Add S3 docs publishing scripts [airflow]

2025-03-22 Thread via GitHub


potiuk commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2745379646

   Big fish done. Rebase and I will take a closer look :)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add S3 docs publishing scripts [airflow]

2025-03-19 Thread via GitHub


potiuk commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2735734363

   I will take a look shortly - after finishing airflow-core one :)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add s3 docs publishing scripts [airflow]

2025-03-15 Thread via GitHub


gopidesupavan commented on PR #47814:
URL: https://github.com/apache/airflow/pull/47814#issuecomment-2726783362

   Workflow testing require, will be able to do it only after merge.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Add s3 docs publishing scripts [airflow]

2025-03-15 Thread via GitHub


gopidesupavan commented on code in PR #47814:
URL: https://github.com/apache/airflow/pull/47814#discussion_r1997061672


##
.github/workflows/publish-docs-to-s3.yml:
##
@@ -0,0 +1,204 @@
+# Licensed to the Apache Software Foundation (ASF) under one
+# or more contributor license agreements.  See the NOTICE file
+# distributed with this work for additional information
+# regarding copyright ownership.  The ASF licenses this file
+# to you under the Apache License, Version 2.0 (the
+# "License"); you may not use this file except in compliance
+# with the License.  You may obtain a copy of the License at
+#
+#   http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing,
+# software distributed under the License is distributed on an
+# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+# KIND, either express or implied.  See the License for the
+# specific language governing permissions and limitations
+# under the License.
+#
+---
+name: Publish Docs to S3
+on:  # yamllint disable-line rule:truthy
+  workflow_dispatch:
+inputs:
+  ref:
+description: "The branch or tag to checkout for the docs publishing"
+required: true
+type: string
+  exclude-docs:
+description: "Comma separated list of docs to exclude"
+required: false
+default: ""
+type: string
+  destination-location:
+description: "The destination location in S3"
+required: false
+default: "s3://staging-docs-airflow-apache-org/docs"

Review Comment:
   It requires for testing after that we can configure default live bucket.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org