[jira] [Commented] (AIRFLOW-4030) Add Singularity Container Operator

2020-02-23 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/AIRFLOW-4030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17042866#comment-17042866
 ] 

ASF subversion and git services commented on AIRFLOW-4030:
--

Commit 0bb687990b94da7445f4ba081592de8cea73119e in airflow's branch 
refs/heads/master from Vanessasaurus
[ https://gitbox.apache.org/repos/asf?p=airflow.git;h=0bb6879 ]

[AIRFLOW-4030] second attempt to add singularity to airflow (#7191)

* adding singularity operator and tests

Signed-off-by: Vanessa Sochat 

* removing encoding pragmas and fixing up dockerfile to pass linting

Signed-off-by: Vanessa Sochat 

* make workdir in /tmp because AIRFLOW_SOURCES not defined yet

Signed-off-by: Vanessa Sochat 

* curl needs to follow redirects with -L

Signed-off-by: Vanessa Sochat 

* moving files to where they are supposed to be, more changes to mock, no clue

Signed-off-by: vsoch 

* removing trailing whitespace, moving example_dag for singularity, adding 
licenses to empty init files

Signed-off-by: vsoch 

* ran isort on example dags file

Signed-off-by: vsoch 

* adding missing init in example_dags folder for singularity

Signed-off-by: vsoch 

* removing code from __init__.py files for singularity operator to fix 
documentation generation

Signed-off-by: vsoch 

* forgot to update link to singularity in operators and hooks ref

Signed-off-by: vsoch 

* command must have been provided on init of singularity operator instance

Signed-off-by: vsoch 

* I guess I'm required to have a task_id?

Signed-off-by: vsoch 

* try adding working_dir to singularity operator type definitions

Signed-off-by: vsoch 

* disable too many arguments for pylint of singularity operator init

Signed-off-by: vsoch 

* move pylint disable up to line 64 - doesnt catch at end of statement like 
other examples

Signed-off-by: vsoch 

* two spaces before inline comment

Signed-off-by: vsoch 

* I dont see task_id as a param for other providers, removing for singularity 
operator

Signed-off-by: vsoch 

* adding debug print

Signed-off-by: vsoch 

* allow for return of just image and/or lines

Signed-off-by: vsoch 

* dont understand how mock works, but the image should exist after its 
pulled

Signed-off-by: vsoch 

* try removing shutil, the client should handle pull folder instead

Signed-off-by: vsoch 

* try changing pull-file to same uri that is expected to be pulled

Signed-off-by: vsoch 

* import of AirflowException moved to exceptions

Signed-off-by: vsoch 

* DAG module was moved to airflow.models

Signed-off-by: vsoch 

* ensure pull is called with pull_folder

Signed-off-by: vsoch 


> Add Singularity Container Operator
> --
>
> Key: AIRFLOW-4030
> URL: https://issues.apache.org/jira/browse/AIRFLOW-4030
> Project: Apache Airflow
>  Issue Type: Improvement
>  Components: operators
>Reporter: Vanessa
>Assignee: Vanessa
>Priority: Minor
> Fix For: 2.0.0
>
>
> Singularity containers are comparable to Docker in the level of operation - 
> they provide an encapsulated environment with an OS, libraries, and custom 
> software for the user to run. The key difference is that Docker is not 
> optimized for scientific compute because it could never be installed on a 
> shared research cluster. Singularity, on the other hand, does not have these 
> issues and is installed across HCP centers internationally.
> This issue is to add Singularity containers as an operator to Apache Airflow, 
> so that we can start to explore using airflow in an HPC environment. I work 
> with Encode DCC at Stanford, and am hopeful to explore Airflow as an 
> alternative to the workflow manager(s) we are using. I am one of the 
> [original Singularity developers see 
> |https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0177459] 
> that manages the Singularity Python client (spython), Singularity Hub and 
> Singularity Registry Server, and have started working on this issue here: 
> [https://github.com/apache/airflow/pull/4846.] Looking forward to working 
> with you!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (AIRFLOW-4030) Add Singularity Container Operator

2020-02-23 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/AIRFLOW-4030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17042865#comment-17042865
 ] 

ASF GitHub Bot commented on AIRFLOW-4030:
-

potiuk commented on pull request #7191: [AIRFLOW-4030] second attempt to add 
singularity to airflow
URL: https://github.com/apache/airflow/pull/7191
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Add Singularity Container Operator
> --
>
> Key: AIRFLOW-4030
> URL: https://issues.apache.org/jira/browse/AIRFLOW-4030
> Project: Apache Airflow
>  Issue Type: Improvement
>  Components: operators
>Reporter: Vanessa
>Assignee: Vanessa
>Priority: Minor
>
> Singularity containers are comparable to Docker in the level of operation - 
> they provide an encapsulated environment with an OS, libraries, and custom 
> software for the user to run. The key difference is that Docker is not 
> optimized for scientific compute because it could never be installed on a 
> shared research cluster. Singularity, on the other hand, does not have these 
> issues and is installed across HCP centers internationally.
> This issue is to add Singularity containers as an operator to Apache Airflow, 
> so that we can start to explore using airflow in an HPC environment. I work 
> with Encode DCC at Stanford, and am hopeful to explore Airflow as an 
> alternative to the workflow manager(s) we are using. I am one of the 
> [original Singularity developers see 
> |https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0177459] 
> that manages the Singularity Python client (spython), Singularity Hub and 
> Singularity Registry Server, and have started working on this issue here: 
> [https://github.com/apache/airflow/pull/4846.] Looking forward to working 
> with you!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (AIRFLOW-4030) Add Singularity Container Operator

2020-01-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/AIRFLOW-4030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17017309#comment-17017309
 ] 

ASF GitHub Bot commented on AIRFLOW-4030:
-

vsoch commented on pull request #7191: [AIRFLOW-4030] second attempt to add 
singularity to airflow
URL: https://github.com/apache/airflow/pull/7191
 
 
   This is a second attempt to add Singularity Container support to Apache 
Airflow by way of Singularity Python. I am using the previously created JIRA 
ticket 4030 (created in March 2019) as it is still relevant. I am a new 
contributor and largely not familiar with the community here (and yes I've read 
the guidelines) so I would appreciate support and kindness from the individuals 
that act as maintainers here, and any additional support from other folks that 
are also interested in this integration. Thank you!
   
   Signed-off-by: Vanessa Sochat 
   
   ---
   Issue link: WILL BE INSERTED BY 
[boring-cyborg](https://github.com/kaxil/boring-cyborg)
   
   Make sure to mark the boxes below before creating PR: [x]
   
   - [x] Description above provides context of the change
   - [x] Commit message/PR title starts with `[AIRFLOW-]`. AIRFLOW- = 
JIRA ID*
   - [ ] Unit tests coverage for changes (not needed for documentation changes)
   - [x] Commits follow "[How to write a good git commit 
message](http://chris.beams.io/posts/git-commit/)"
   - [ ] Relevant documentation is updated including usage instructions.
   - [ ] I will engage committers as explained in [Contribution Workflow 
Example](https://github.com/apache/airflow/blob/master/CONTRIBUTING.rst#contribution-workflow-example).
   
   * For document-only changes commit message can start with 
`[AIRFLOW-]`.
   
   ---
   In case of fundamental code change, Airflow Improvement Proposal 
([AIP](https://cwiki.apache.org/confluence/display/AIRFLOW/Airflow+Improvements+Proposals))
 is needed.
   In case of a new dependency, check compliance with the [ASF 3rd Party 
License Policy](https://www.apache.org/legal/resolved.html#category-x).
   In case of backwards incompatible changes please leave a note in 
[UPDATING.md](https://github.com/apache/airflow/blob/master/UPDATING.md).
   Read the [Pull Request 
Guidelines](https://github.com/apache/airflow/blob/master/CONTRIBUTING.rst#pull-request-guidelines)
 for more information.
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Add Singularity Container Operator
> --
>
> Key: AIRFLOW-4030
> URL: https://issues.apache.org/jira/browse/AIRFLOW-4030
> Project: Apache Airflow
>  Issue Type: Improvement
>  Components: operators
>Reporter: Vanessa
>Assignee: Vanessa
>Priority: Minor
>
> Singularity containers are comparable to Docker in the level of operation - 
> they provide an encapsulated environment with an OS, libraries, and custom 
> software for the user to run. The key difference is that Docker is not 
> optimized for scientific compute because it could never be installed on a 
> shared research cluster. Singularity, on the other hand, does not have these 
> issues and is installed across HCP centers internationally.
> This issue is to add Singularity containers as an operator to Apache Airflow, 
> so that we can start to explore using airflow in an HPC environment. I work 
> with Encode DCC at Stanford, and am hopeful to explore Airflow as an 
> alternative to the workflow manager(s) we are using. I am one of the 
> [original Singularity developers see 
> |https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0177459] 
> that manages the Singularity Python client (spython), Singularity Hub and 
> Singularity Registry Server, and have started working on this issue here: 
> [https://github.com/apache/airflow/pull/4846.] Looking forward to working 
> with you!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (AIRFLOW-4030) Add Singularity Container Operator

2020-01-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/AIRFLOW-4030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17017291#comment-17017291
 ] 

ASF GitHub Bot commented on AIRFLOW-4030:
-

vsoch commented on pull request #4846: [AIRFLOW-4030] adding start to 
singularity for airflow
URL: https://github.com/apache/airflow/pull/4846
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Add Singularity Container Operator
> --
>
> Key: AIRFLOW-4030
> URL: https://issues.apache.org/jira/browse/AIRFLOW-4030
> Project: Apache Airflow
>  Issue Type: Improvement
>  Components: operators
>Reporter: Vanessa
>Assignee: Vanessa
>Priority: Minor
>
> Singularity containers are comparable to Docker in the level of operation - 
> they provide an encapsulated environment with an OS, libraries, and custom 
> software for the user to run. The key difference is that Docker is not 
> optimized for scientific compute because it could never be installed on a 
> shared research cluster. Singularity, on the other hand, does not have these 
> issues and is installed across HCP centers internationally.
> This issue is to add Singularity containers as an operator to Apache Airflow, 
> so that we can start to explore using airflow in an HPC environment. I work 
> with Encode DCC at Stanford, and am hopeful to explore Airflow as an 
> alternative to the workflow manager(s) we are using. I am one of the 
> [original Singularity developers see 
> |https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0177459] 
> that manages the Singularity Python client (spython), Singularity Hub and 
> Singularity Registry Server, and have started working on this issue here: 
> [https://github.com/apache/airflow/pull/4846.] Looking forward to working 
> with you!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (AIRFLOW-4030) Add Singularity Container Operator

2019-10-10 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/AIRFLOW-4030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16948742#comment-16948742
 ] 

ASF GitHub Bot commented on AIRFLOW-4030:
-

vsoch commented on pull request #4846: [AIRFLOW-4030] adding start to 
singularity for airflow
URL: https://github.com/apache/airflow/pull/4846
 
 
   Hey airflow maintainers! I'm working on an operator for [Singularity 
containers](https://sylabs.io/guides/3.0/user-guide/), and I have a very basic 
[example 
working](https://github.com/researchapps/airflow-example/tree/master/singularity).
 I'd like to do some more substantial tests beyond sleep (and I haven't written 
proper tests yet) but wanted to open the pull request to start discussion about 
tests and examples needed, etc. I'm new to airflow as of this afternoon, so 
thank you (in the future) for teaching me all the things!
   
   I haven't checked all the boxes below. I'm mostly afraid of Jira, so maybe 
if someone wants to create an issue for this, would be greatly appreciated!
   
   ### Jira
   
   - [x] My PR addresses the following [Airflow JIRA 
issue](https://issues.apache.org/jira/browse/AIRFLOW-4030)
   
   ### Tests
   
   - [x] My PR adds the following unit tests __OR__ does not need testing for 
this extremely good reason:
   
   ### Commits
   
   - [ ] My commits all reference Jira issues in their subject lines, and I 
have squashed multiple commits if they address the same issue. In addition, my 
commits follow the guidelines from "[How to write a good git commit 
message](http://chris.beams.io/posts/git-commit/)":
 1. Subject is separated from body by a blank line
 1. Subject is limited to 50 characters (not including Jira issue reference)
 1. Subject does not end with a period
 1. Subject uses the imperative mood ("add", not "adding")
 1. Body wraps at 72 characters
 1. Body explains "what" and "why", not "how"
   
   ### Documentation
   
   - [ ] In case of new functionality, my PR adds documentation that describes 
how to use it.
 - When adding new operators/hooks/sensors, the autoclass documentation 
generation needs to be added.
 - All the public functions and the classes in the PR contain docstrings 
that explain what it does
 - If you implement backwards incompatible changes, please leave a note in 
the [Updating.md](https://github.com/apache/airflow/blob/master/UPDATING.md) so 
we can assign it to a appropriate release
   
   ### Code Quality
   
   - [ ] Passes `flake8`
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Add Singularity Container Operator
> --
>
> Key: AIRFLOW-4030
> URL: https://issues.apache.org/jira/browse/AIRFLOW-4030
> Project: Apache Airflow
>  Issue Type: Improvement
>  Components: operators
>Reporter: Vanessa
>Assignee: Vanessa
>Priority: Minor
>
> Singularity containers are comparable to Docker in the level of operation - 
> they provide an encapsulated environment with an OS, libraries, and custom 
> software for the user to run. The key difference is that Docker is not 
> optimized for scientific compute because it could never be installed on a 
> shared research cluster. Singularity, on the other hand, does not have these 
> issues and is installed across HCP centers internationally.
> This issue is to add Singularity containers as an operator to Apache Airflow, 
> so that we can start to explore using airflow in an HPC environment. I work 
> with Encode DCC at Stanford, and am hopeful to explore Airflow as an 
> alternative to the workflow manager(s) we are using. I am one of the 
> [original Singularity developers see 
> |https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0177459] 
> that manages the Singularity Python client (spython), Singularity Hub and 
> Singularity Registry Server, and have started working on this issue here: 
> [https://github.com/apache/airflow/pull/4846.] Looking forward to working 
> with you!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (AIRFLOW-4030) Add Singularity Container Operator

2019-09-03 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/AIRFLOW-4030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16921609#comment-16921609
 ] 

ASF GitHub Bot commented on AIRFLOW-4030:
-

vsoch commented on pull request #4846: [AIRFLOW-4030] adding start to 
singularity for airflow
URL: https://github.com/apache/airflow/pull/4846
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Add Singularity Container Operator
> --
>
> Key: AIRFLOW-4030
> URL: https://issues.apache.org/jira/browse/AIRFLOW-4030
> Project: Apache Airflow
>  Issue Type: Improvement
>  Components: operators
>Reporter: Vanessa
>Assignee: Vanessa
>Priority: Minor
>
> Singularity containers are comparable to Docker in the level of operation - 
> they provide an encapsulated environment with an OS, libraries, and custom 
> software for the user to run. The key difference is that Docker is not 
> optimized for scientific compute because it could never be installed on a 
> shared research cluster. Singularity, on the other hand, does not have these 
> issues and is installed across HCP centers internationally.
> This issue is to add Singularity containers as an operator to Apache Airflow, 
> so that we can start to explore using airflow in an HPC environment. I work 
> with Encode DCC at Stanford, and am hopeful to explore Airflow as an 
> alternative to the workflow manager(s) we are using. I am one of the 
> [original Singularity developers see 
> |https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0177459] 
> that manages the Singularity Python client (spython), Singularity Hub and 
> Singularity Registry Server, and have started working on this issue here: 
> [https://github.com/apache/airflow/pull/4846.] Looking forward to working 
> with you!



--
This message was sent by Atlassian Jira
(v8.3.2#803003)