Re: [VOTE] Accept Airflow into the Incubator

2016-03-24 Thread Chris Riccomini
+1 non-binding :)

On Thu, Mar 24, 2016 at 8:00 PM, Siddharth Anand 
wrote:

> Following the discussion earlier:
> https://s.apache.org/AirflowDiscussion
>
> I would like to call a VOTE for accepting Airflow as a new incubator
> project.
>
> The proposal is available at:
> https://wiki.apache.org/incubator/AirflowProposal
>
> The proposal is also included at the bottom of this email.
>
> Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
> [ ] +1 accept Airflow into the Apache Incubator
> [ ] ±0
> [ ] -1 because...
>
> +1 (non-binding)
>
> Thanks,
> -s (Sid)
>
>
> == Abstract ==
>
> Airflow is a workflow automation and scheduling system that can be
> used to author and manage data pipelines.
>
> == Proposal ==
>
> Airflow provides a system for authoring and managing workflows a.k.a.
> data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
> authors DAGs in Python using an Airflow-provided framework. He/She
> then executes the DAG using Airflow’s scheduler or registers the DAG
> for event-based execution. A web-based UI provides the developer with
> a range of options for managing and viewing his/her data pipelines.
> Background
>
> Airflow was developed at Airbnb to enable easier authorship and
> management of DAGs than were possible with existing solutions such as
> Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
> more XML or property files to be bundled together to define a
> workflow. This separation of code and config can present a challenge
> to understanding the DAG - in Azkaban, a DAG’s structure is reflected
> by its file system tree and one can find himself/herself traversing
> the file system when inspecting or changing the structure of the DAG.
> Airflow workflows, on the other hand, are simply and elegantly defined
> in Python code, often a single file. Airflow merges the powerful
> Web-based management aspects of projects like Azkaban and Oozie with
> the simplicity and elegance of defining workflows in Python. Airflow,
> less than a year old in terms of its Open Source launch, is currently
> used in production environments in more than 30 companies and boasts
> an active contributor list of more than 100 developers, the vast
> majority of which (>95%) are outside of Airbnb.
>
> We would like to share it with the ASF and begin developing a
> community of developers and users within Apache.
>
> == Rationale ==
>
> Many organizations (>30) already benefit from running Airflow to
> manage data pipelines. Our 100+ contributors continue to provide
> integrations with 3rd party systems through the implementation of new
> hooks and operators, both of which are used in defining the tasks that
> compose workflows.
>
> == Current Status ==
>
> === Meritocracy ===
>
> Our intent with this incubator proposal is to start building a diverse
> developer community around Airflow following the Apache meritocracy
> model. Since Airflow was open-sourced in mid-2015, we have had fast
> adoption and contributions by multiple organizations the world over.
> We plan to continue to support new contributors and we will work to
> actively promote those who contribute significantly to the project to
> committers.
>
> === Community ===
>
> Airflow is currently being used in over 30 companies. We hope to
> extend our contributor base significantly and invite all those who are
> interested in building large-scale distributed systems to participate.
>
> === Core Developers ===
>
> Airflow is currently being developed by four engineers: Maxime
> Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
> Chris is a member of the Apache Samza PMC and a contributor to various
> Apache projects, including Apache Kafka and Apache YARN. Maxime,
> Siddharth, and Bolke have contributed to Airflow.
>
> === Alignment ===
> The ASF is the natural choice to host the Airflow project as its goal
> of encouraging community-driven open-source projects fits with our
> vision for Airflow.
>
> == Known Risks ==
>
> === Orphaned Products ===
>
> The core developers plan to work part time on the project. There is
> very little risk of Airflow being abandoned as all of our companies
> rely on it.
>
> === Inexperience with Open Source ===
>
> All of the core developers have experience with open source
> development. Chris is a member of the Apache Samza PMC and a
> contributor to various Apache projects, including Apache Kafka and
> Apache YARN. Bolke is contributor on multiple open source projects and
> a few Apache projects as well, including Apache Hive, Apache Hadoop,
> and Apache Ranger.
>
> === Homogeneous Developers ===
>
> The current core developers are all from different companies. Our
> community of 100 contributors hail from over 30 different companies
> from across the world.
>
> === Reliance on Salaried Developers ===
>
> Currently, the only developer paid to work on this project is Maxime.
>
> === Relationships with Other Apache Products ===
>
> Airflow is deeply integrate

Re: [VOTE] Accept Airflow into the Incubator

2016-03-24 Thread Jakob Homan
+1 (binding)

-jakob 

> On Mar 24, 2016, at 8:20 PM, Chris Riccomini  wrote:
> 
> +1 non-binding :)
> 
> On Thu, Mar 24, 2016 at 8:00 PM, Siddharth Anand 
> wrote:
> 
>> Following the discussion earlier:
>>https://s.apache.org/AirflowDiscussion
>> 
>> I would like to call a VOTE for accepting Airflow as a new incubator
>> project.
>> 
>> The proposal is available at:
>> https://wiki.apache.org/incubator/AirflowProposal
>> 
>> The proposal is also included at the bottom of this email.
>> 
>> Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
>> [ ] +1 accept Airflow into the Apache Incubator
>> [ ] ±0
>> [ ] -1 because...
>> 
>> +1 (non-binding)
>> 
>> Thanks,
>> -s (Sid)
>> 
>> 
>> == Abstract ==
>> 
>> Airflow is a workflow automation and scheduling system that can be
>> used to author and manage data pipelines.
>> 
>> == Proposal ==
>> 
>> Airflow provides a system for authoring and managing workflows a.k.a.
>> data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
>> authors DAGs in Python using an Airflow-provided framework. He/She
>> then executes the DAG using Airflow’s scheduler or registers the DAG
>> for event-based execution. A web-based UI provides the developer with
>> a range of options for managing and viewing his/her data pipelines.
>> Background
>> 
>> Airflow was developed at Airbnb to enable easier authorship and
>> management of DAGs than were possible with existing solutions such as
>> Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
>> more XML or property files to be bundled together to define a
>> workflow. This separation of code and config can present a challenge
>> to understanding the DAG - in Azkaban, a DAG’s structure is reflected
>> by its file system tree and one can find himself/herself traversing
>> the file system when inspecting or changing the structure of the DAG.
>> Airflow workflows, on the other hand, are simply and elegantly defined
>> in Python code, often a single file. Airflow merges the powerful
>> Web-based management aspects of projects like Azkaban and Oozie with
>> the simplicity and elegance of defining workflows in Python. Airflow,
>> less than a year old in terms of its Open Source launch, is currently
>> used in production environments in more than 30 companies and boasts
>> an active contributor list of more than 100 developers, the vast
>> majority of which (>95%) are outside of Airbnb.
>> 
>> We would like to share it with the ASF and begin developing a
>> community of developers and users within Apache.
>> 
>> == Rationale ==
>> 
>> Many organizations (>30) already benefit from running Airflow to
>> manage data pipelines. Our 100+ contributors continue to provide
>> integrations with 3rd party systems through the implementation of new
>> hooks and operators, both of which are used in defining the tasks that
>> compose workflows.
>> 
>> == Current Status ==
>> 
>> === Meritocracy ===
>> 
>> Our intent with this incubator proposal is to start building a diverse
>> developer community around Airflow following the Apache meritocracy
>> model. Since Airflow was open-sourced in mid-2015, we have had fast
>> adoption and contributions by multiple organizations the world over.
>> We plan to continue to support new contributors and we will work to
>> actively promote those who contribute significantly to the project to
>> committers.
>> 
>> === Community ===
>> 
>> Airflow is currently being used in over 30 companies. We hope to
>> extend our contributor base significantly and invite all those who are
>> interested in building large-scale distributed systems to participate.
>> 
>> === Core Developers ===
>> 
>> Airflow is currently being developed by four engineers: Maxime
>> Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
>> Chris is a member of the Apache Samza PMC and a contributor to various
>> Apache projects, including Apache Kafka and Apache YARN. Maxime,
>> Siddharth, and Bolke have contributed to Airflow.
>> 
>> === Alignment ===
>> The ASF is the natural choice to host the Airflow project as its goal
>> of encouraging community-driven open-source projects fits with our
>> vision for Airflow.
>> 
>> == Known Risks ==
>> 
>> === Orphaned Products ===
>> 
>> The core developers plan to work part time on the project. There is
>> very little risk of Airflow being abandoned as all of our companies
>> rely on it.
>> 
>> === Inexperience with Open Source ===
>> 
>> All of the core developers have experience with open source
>> development. Chris is a member of the Apache Samza PMC and a
>> contributor to various Apache projects, including Apache Kafka and
>> Apache YARN. Bolke is contributor on multiple open source projects and
>> a few Apache projects as well, including Apache Hive, Apache Hadoop,
>> and Apache Ranger.
>> 
>> === Homogeneous Developers ===
>> 
>> The current core developers are all from different companies. Our
>> community of 100 contributors hail from over 30 different 

Re: [VOTE] Accept Airflow into the Incubator

2016-03-24 Thread Seetharam Venkatesh
+ 1 (binding)

All the best,
Venkatesh

On Thu, Mar 24, 2016 at 10:06 PM Jakob Homan  wrote:

> +1 (binding)
>
> -jakob
>
> > On Mar 24, 2016, at 8:20 PM, Chris Riccomini 
> wrote:
> >
> > +1 non-binding :)
> >
> > On Thu, Mar 24, 2016 at 8:00 PM, Siddharth Anand
> 
> > wrote:
> >
> >> Following the discussion earlier:
> >>https://s.apache.org/AirflowDiscussion
> >>
> >> I would like to call a VOTE for accepting Airflow as a new incubator
> >> project.
> >>
> >> The proposal is available at:
> >> https://wiki.apache.org/incubator/AirflowProposal
> >>
> >> The proposal is also included at the bottom of this email.
> >>
> >> Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
> >> [ ] +1 accept Airflow into the Apache Incubator
> >> [ ] ±0
> >> [ ] -1 because...
> >>
> >> +1 (non-binding)
> >>
> >> Thanks,
> >> -s (Sid)
> >>
> >>
> >> == Abstract ==
> >>
> >> Airflow is a workflow automation and scheduling system that can be
> >> used to author and manage data pipelines.
> >>
> >> == Proposal ==
> >>
> >> Airflow provides a system for authoring and managing workflows a.k.a.
> >> data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
> >> authors DAGs in Python using an Airflow-provided framework. He/She
> >> then executes the DAG using Airflow’s scheduler or registers the DAG
> >> for event-based execution. A web-based UI provides the developer with
> >> a range of options for managing and viewing his/her data pipelines.
> >> Background
> >>
> >> Airflow was developed at Airbnb to enable easier authorship and
> >> management of DAGs than were possible with existing solutions such as
> >> Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
> >> more XML or property files to be bundled together to define a
> >> workflow. This separation of code and config can present a challenge
> >> to understanding the DAG - in Azkaban, a DAG’s structure is reflected
> >> by its file system tree and one can find himself/herself traversing
> >> the file system when inspecting or changing the structure of the DAG.
> >> Airflow workflows, on the other hand, are simply and elegantly defined
> >> in Python code, often a single file. Airflow merges the powerful
> >> Web-based management aspects of projects like Azkaban and Oozie with
> >> the simplicity and elegance of defining workflows in Python. Airflow,
> >> less than a year old in terms of its Open Source launch, is currently
> >> used in production environments in more than 30 companies and boasts
> >> an active contributor list of more than 100 developers, the vast
> >> majority of which (>95%) are outside of Airbnb.
> >>
> >> We would like to share it with the ASF and begin developing a
> >> community of developers and users within Apache.
> >>
> >> == Rationale ==
> >>
> >> Many organizations (>30) already benefit from running Airflow to
> >> manage data pipelines. Our 100+ contributors continue to provide
> >> integrations with 3rd party systems through the implementation of new
> >> hooks and operators, both of which are used in defining the tasks that
> >> compose workflows.
> >>
> >> == Current Status ==
> >>
> >> === Meritocracy ===
> >>
> >> Our intent with this incubator proposal is to start building a diverse
> >> developer community around Airflow following the Apache meritocracy
> >> model. Since Airflow was open-sourced in mid-2015, we have had fast
> >> adoption and contributions by multiple organizations the world over.
> >> We plan to continue to support new contributors and we will work to
> >> actively promote those who contribute significantly to the project to
> >> committers.
> >>
> >> === Community ===
> >>
> >> Airflow is currently being used in over 30 companies. We hope to
> >> extend our contributor base significantly and invite all those who are
> >> interested in building large-scale distributed systems to participate.
> >>
> >> === Core Developers ===
> >>
> >> Airflow is currently being developed by four engineers: Maxime
> >> Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
> >> Chris is a member of the Apache Samza PMC and a contributor to various
> >> Apache projects, including Apache Kafka and Apache YARN. Maxime,
> >> Siddharth, and Bolke have contributed to Airflow.
> >>
> >> === Alignment ===
> >> The ASF is the natural choice to host the Airflow project as its goal
> >> of encouraging community-driven open-source projects fits with our
> >> vision for Airflow.
> >>
> >> == Known Risks ==
> >>
> >> === Orphaned Products ===
> >>
> >> The core developers plan to work part time on the project. There is
> >> very little risk of Airflow being abandoned as all of our companies
> >> rely on it.
> >>
> >> === Inexperience with Open Source ===
> >>
> >> All of the core developers have experience with open source
> >> development. Chris is a member of the Apache Samza PMC and a
> >> contributor to various Apache projects, including Apache Kafka and
> >> Apache YARN. Bolke is contribut

Re: [VOTE] Accept Airflow into the Incubator

2016-03-24 Thread Hitesh Shah
+1 (binding)

— HItesh

On Mar 24, 2016, at 8:00 PM, Siddharth Anand  wrote:

> Following the discussion earlier:
>https://s.apache.org/AirflowDiscussion
> 
> I would like to call a VOTE for accepting Airflow as a new incubator project.
> 
> The proposal is available at: 
> https://wiki.apache.org/incubator/AirflowProposal
> 
> The proposal is also included at the bottom of this email.
> 
> Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
> [ ] +1 accept Airflow into the Apache Incubator
> [ ] ±0
> [ ] -1 because...
> 
> +1 (non-binding)
> 
> Thanks,
> -s (Sid)
> 
> 
> == Abstract ==
> 
> Airflow is a workflow automation and scheduling system that can be
> used to author and manage data pipelines.
> 
> == Proposal ==
> 
> Airflow provides a system for authoring and managing workflows a.k.a.
> data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
> authors DAGs in Python using an Airflow-provided framework. He/She
> then executes the DAG using Airflow’s scheduler or registers the DAG
> for event-based execution. A web-based UI provides the developer with
> a range of options for managing and viewing his/her data pipelines.
> Background
> 
> Airflow was developed at Airbnb to enable easier authorship and
> management of DAGs than were possible with existing solutions such as
> Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
> more XML or property files to be bundled together to define a
> workflow. This separation of code and config can present a challenge
> to understanding the DAG - in Azkaban, a DAG’s structure is reflected
> by its file system tree and one can find himself/herself traversing
> the file system when inspecting or changing the structure of the DAG.
> Airflow workflows, on the other hand, are simply and elegantly defined
> in Python code, often a single file. Airflow merges the powerful
> Web-based management aspects of projects like Azkaban and Oozie with
> the simplicity and elegance of defining workflows in Python. Airflow,
> less than a year old in terms of its Open Source launch, is currently
> used in production environments in more than 30 companies and boasts
> an active contributor list of more than 100 developers, the vast
> majority of which (>95%) are outside of Airbnb.
> 
> We would like to share it with the ASF and begin developing a
> community of developers and users within Apache.
> 
> == Rationale ==
> 
> Many organizations (>30) already benefit from running Airflow to
> manage data pipelines. Our 100+ contributors continue to provide
> integrations with 3rd party systems through the implementation of new
> hooks and operators, both of which are used in defining the tasks that
> compose workflows.
> 
> == Current Status ==
> 
> === Meritocracy ===
> 
> Our intent with this incubator proposal is to start building a diverse
> developer community around Airflow following the Apache meritocracy
> model. Since Airflow was open-sourced in mid-2015, we have had fast
> adoption and contributions by multiple organizations the world over.
> We plan to continue to support new contributors and we will work to
> actively promote those who contribute significantly to the project to
> committers.
> 
> === Community ===
> 
> Airflow is currently being used in over 30 companies. We hope to
> extend our contributor base significantly and invite all those who are
> interested in building large-scale distributed systems to participate.
> 
> === Core Developers ===
> 
> Airflow is currently being developed by four engineers: Maxime
> Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
> Chris is a member of the Apache Samza PMC and a contributor to various
> Apache projects, including Apache Kafka and Apache YARN. Maxime,
> Siddharth, and Bolke have contributed to Airflow.
> 
> === Alignment ===
> The ASF is the natural choice to host the Airflow project as its goal
> of encouraging community-driven open-source projects fits with our
> vision for Airflow.
> 
> == Known Risks ==
> 
> === Orphaned Products ===
> 
> The core developers plan to work part time on the project. There is
> very little risk of Airflow being abandoned as all of our companies
> rely on it.
> 
> === Inexperience with Open Source ===
> 
> All of the core developers have experience with open source
> development. Chris is a member of the Apache Samza PMC and a
> contributor to various Apache projects, including Apache Kafka and
> Apache YARN. Bolke is contributor on multiple open source projects and
> a few Apache projects as well, including Apache Hive, Apache Hadoop,
> and Apache Ranger.
> 
> === Homogeneous Developers ===
> 
> The current core developers are all from different companies. Our
> community of 100 contributors hail from over 30 different companies
> from across the world.
> 
> === Reliance on Salaried Developers ===
> 
> Currently, the only developer paid to work on this project is Maxime.
> 
> === Relationships with Other Apache Products ===

Re: [VOTE] Accept Airflow into the Incubator

2016-03-24 Thread Maxime Beauchemin
+1 (non-binding)

Max

On Thu, Mar 24, 2016 at 11:07 PM, Hitesh Shah  wrote:

> +1 (binding)
>
> — HItesh
>
> On Mar 24, 2016, at 8:00 PM, Siddharth Anand 
> wrote:
>
> > Following the discussion earlier:
> >https://s.apache.org/AirflowDiscussion
> >
> > I would like to call a VOTE for accepting Airflow as a new incubator
> project.
> >
> > The proposal is available at:
> https://wiki.apache.org/incubator/AirflowProposal
> >
> > The proposal is also included at the bottom of this email.
> >
> > Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
> > [ ] +1 accept Airflow into the Apache Incubator
> > [ ] ±0
> > [ ] -1 because...
> >
> > +1 (non-binding)
> >
> > Thanks,
> > -s (Sid)
> >
> >
> > == Abstract ==
> >
> > Airflow is a workflow automation and scheduling system that can be
> > used to author and manage data pipelines.
> >
> > == Proposal ==
> >
> > Airflow provides a system for authoring and managing workflows a.k.a.
> > data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
> > authors DAGs in Python using an Airflow-provided framework. He/She
> > then executes the DAG using Airflow’s scheduler or registers the DAG
> > for event-based execution. A web-based UI provides the developer with
> > a range of options for managing and viewing his/her data pipelines.
> > Background
> >
> > Airflow was developed at Airbnb to enable easier authorship and
> > management of DAGs than were possible with existing solutions such as
> > Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
> > more XML or property files to be bundled together to define a
> > workflow. This separation of code and config can present a challenge
> > to understanding the DAG - in Azkaban, a DAG’s structure is reflected
> > by its file system tree and one can find himself/herself traversing
> > the file system when inspecting or changing the structure of the DAG.
> > Airflow workflows, on the other hand, are simply and elegantly defined
> > in Python code, often a single file. Airflow merges the powerful
> > Web-based management aspects of projects like Azkaban and Oozie with
> > the simplicity and elegance of defining workflows in Python. Airflow,
> > less than a year old in terms of its Open Source launch, is currently
> > used in production environments in more than 30 companies and boasts
> > an active contributor list of more than 100 developers, the vast
> > majority of which (>95%) are outside of Airbnb.
> >
> > We would like to share it with the ASF and begin developing a
> > community of developers and users within Apache.
> >
> > == Rationale ==
> >
> > Many organizations (>30) already benefit from running Airflow to
> > manage data pipelines. Our 100+ contributors continue to provide
> > integrations with 3rd party systems through the implementation of new
> > hooks and operators, both of which are used in defining the tasks that
> > compose workflows.
> >
> > == Current Status ==
> >
> > === Meritocracy ===
> >
> > Our intent with this incubator proposal is to start building a diverse
> > developer community around Airflow following the Apache meritocracy
> > model. Since Airflow was open-sourced in mid-2015, we have had fast
> > adoption and contributions by multiple organizations the world over.
> > We plan to continue to support new contributors and we will work to
> > actively promote those who contribute significantly to the project to
> > committers.
> >
> > === Community ===
> >
> > Airflow is currently being used in over 30 companies. We hope to
> > extend our contributor base significantly and invite all those who are
> > interested in building large-scale distributed systems to participate.
> >
> > === Core Developers ===
> >
> > Airflow is currently being developed by four engineers: Maxime
> > Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
> > Chris is a member of the Apache Samza PMC and a contributor to various
> > Apache projects, including Apache Kafka and Apache YARN. Maxime,
> > Siddharth, and Bolke have contributed to Airflow.
> >
> > === Alignment ===
> > The ASF is the natural choice to host the Airflow project as its goal
> > of encouraging community-driven open-source projects fits with our
> > vision for Airflow.
> >
> > == Known Risks ==
> >
> > === Orphaned Products ===
> >
> > The core developers plan to work part time on the project. There is
> > very little risk of Airflow being abandoned as all of our companies
> > rely on it.
> >
> > === Inexperience with Open Source ===
> >
> > All of the core developers have experience with open source
> > development. Chris is a member of the Apache Samza PMC and a
> > contributor to various Apache projects, including Apache Kafka and
> > Apache YARN. Bolke is contributor on multiple open source projects and
> > a few Apache projects as well, including Apache Hive, Apache Hadoop,
> > and Apache Ranger.
> >
> > === Homogeneous Developers ===
> >
> > The current core developers are all from diffe

Re: [VOTE] Accept Airflow into the Incubator

2016-03-25 Thread Sergio Fernández
+1 (binding)

On Fri, Mar 25, 2016 at 7:32 AM, Maxime Beauchemin <
maximebeauche...@gmail.com> wrote:

> +1 (non-binding)
>
> Max
>
> On Thu, Mar 24, 2016 at 11:07 PM, Hitesh Shah  wrote:
>
> > +1 (binding)
> >
> > — HItesh
> >
> > On Mar 24, 2016, at 8:00 PM, Siddharth Anand 
> > wrote:
> >
> > > Following the discussion earlier:
> > >https://s.apache.org/AirflowDiscussion
> > >
> > > I would like to call a VOTE for accepting Airflow as a new incubator
> > project.
> > >
> > > The proposal is available at:
> > https://wiki.apache.org/incubator/AirflowProposal
> > >
> > > The proposal is also included at the bottom of this email.
> > >
> > > Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
> > > [ ] +1 accept Airflow into the Apache Incubator
> > > [ ] ±0
> > > [ ] -1 because...
> > >
> > > +1 (non-binding)
> > >
> > > Thanks,
> > > -s (Sid)
> > >
> > >
> > > == Abstract ==
> > >
> > > Airflow is a workflow automation and scheduling system that can be
> > > used to author and manage data pipelines.
> > >
> > > == Proposal ==
> > >
> > > Airflow provides a system for authoring and managing workflows a.k.a.
> > > data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
> > > authors DAGs in Python using an Airflow-provided framework. He/She
> > > then executes the DAG using Airflow’s scheduler or registers the DAG
> > > for event-based execution. A web-based UI provides the developer with
> > > a range of options for managing and viewing his/her data pipelines.
> > > Background
> > >
> > > Airflow was developed at Airbnb to enable easier authorship and
> > > management of DAGs than were possible with existing solutions such as
> > > Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
> > > more XML or property files to be bundled together to define a
> > > workflow. This separation of code and config can present a challenge
> > > to understanding the DAG - in Azkaban, a DAG’s structure is reflected
> > > by its file system tree and one can find himself/herself traversing
> > > the file system when inspecting or changing the structure of the DAG.
> > > Airflow workflows, on the other hand, are simply and elegantly defined
> > > in Python code, often a single file. Airflow merges the powerful
> > > Web-based management aspects of projects like Azkaban and Oozie with
> > > the simplicity and elegance of defining workflows in Python. Airflow,
> > > less than a year old in terms of its Open Source launch, is currently
> > > used in production environments in more than 30 companies and boasts
> > > an active contributor list of more than 100 developers, the vast
> > > majority of which (>95%) are outside of Airbnb.
> > >
> > > We would like to share it with the ASF and begin developing a
> > > community of developers and users within Apache.
> > >
> > > == Rationale ==
> > >
> > > Many organizations (>30) already benefit from running Airflow to
> > > manage data pipelines. Our 100+ contributors continue to provide
> > > integrations with 3rd party systems through the implementation of new
> > > hooks and operators, both of which are used in defining the tasks that
> > > compose workflows.
> > >
> > > == Current Status ==
> > >
> > > === Meritocracy ===
> > >
> > > Our intent with this incubator proposal is to start building a diverse
> > > developer community around Airflow following the Apache meritocracy
> > > model. Since Airflow was open-sourced in mid-2015, we have had fast
> > > adoption and contributions by multiple organizations the world over.
> > > We plan to continue to support new contributors and we will work to
> > > actively promote those who contribute significantly to the project to
> > > committers.
> > >
> > > === Community ===
> > >
> > > Airflow is currently being used in over 30 companies. We hope to
> > > extend our contributor base significantly and invite all those who are
> > > interested in building large-scale distributed systems to participate.
> > >
> > > === Core Developers ===
> > >
> > > Airflow is currently being developed by four engineers: Maxime
> > > Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
> > > Chris is a member of the Apache Samza PMC and a contributor to various
> > > Apache projects, including Apache Kafka and Apache YARN. Maxime,
> > > Siddharth, and Bolke have contributed to Airflow.
> > >
> > > === Alignment ===
> > > The ASF is the natural choice to host the Airflow project as its goal
> > > of encouraging community-driven open-source projects fits with our
> > > vision for Airflow.
> > >
> > > == Known Risks ==
> > >
> > > === Orphaned Products ===
> > >
> > > The core developers plan to work part time on the project. There is
> > > very little risk of Airflow being abandoned as all of our companies
> > > rely on it.
> > >
> > > === Inexperience with Open Source ===
> > >
> > > All of the core developers have experience with open source
> > > development. Chris is a member of the Apache Samza PMC

Re: [VOTE] Accept Airflow into the Incubator

2016-03-25 Thread Suresh Marru
+ 1 (binding).

Suresh

> On Mar 24, 2016, at 11:00 PM, Siddharth Anand  
> wrote:
> 
> Following the discussion earlier:
>https://s.apache.org/AirflowDiscussion
> 
> I would like to call a VOTE for accepting Airflow as a new incubator project.
> 
> The proposal is available at: 
> https://wiki.apache.org/incubator/AirflowProposal
> 
> The proposal is also included at the bottom of this email.
> 
> Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
> [ ] +1 accept Airflow into the Apache Incubator
> [ ] ±0
> [ ] -1 because...
> 
> +1 (non-binding)
> 
> Thanks,
> -s (Sid)
> 
> 
> == Abstract ==
> 
> Airflow is a workflow automation and scheduling system that can be
> used to author and manage data pipelines.
> 
> == Proposal ==
> 
> Airflow provides a system for authoring and managing workflows a.k.a.
> data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
> authors DAGs in Python using an Airflow-provided framework. He/She
> then executes the DAG using Airflow’s scheduler or registers the DAG
> for event-based execution. A web-based UI provides the developer with
> a range of options for managing and viewing his/her data pipelines.
> Background
> 
> Airflow was developed at Airbnb to enable easier authorship and
> management of DAGs than were possible with existing solutions such as
> Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
> more XML or property files to be bundled together to define a
> workflow. This separation of code and config can present a challenge
> to understanding the DAG - in Azkaban, a DAG’s structure is reflected
> by its file system tree and one can find himself/herself traversing
> the file system when inspecting or changing the structure of the DAG.
> Airflow workflows, on the other hand, are simply and elegantly defined
> in Python code, often a single file. Airflow merges the powerful
> Web-based management aspects of projects like Azkaban and Oozie with
> the simplicity and elegance of defining workflows in Python. Airflow,
> less than a year old in terms of its Open Source launch, is currently
> used in production environments in more than 30 companies and boasts
> an active contributor list of more than 100 developers, the vast
> majority of which (>95%) are outside of Airbnb.
> 
> We would like to share it with the ASF and begin developing a
> community of developers and users within Apache.
> 
> == Rationale ==
> 
> Many organizations (>30) already benefit from running Airflow to
> manage data pipelines. Our 100+ contributors continue to provide
> integrations with 3rd party systems through the implementation of new
> hooks and operators, both of which are used in defining the tasks that
> compose workflows.
> 
> == Current Status ==
> 
> === Meritocracy ===
> 
> Our intent with this incubator proposal is to start building a diverse
> developer community around Airflow following the Apache meritocracy
> model. Since Airflow was open-sourced in mid-2015, we have had fast
> adoption and contributions by multiple organizations the world over.
> We plan to continue to support new contributors and we will work to
> actively promote those who contribute significantly to the project to
> committers.
> 
> === Community ===
> 
> Airflow is currently being used in over 30 companies. We hope to
> extend our contributor base significantly and invite all those who are
> interested in building large-scale distributed systems to participate.
> 
> === Core Developers ===
> 
> Airflow is currently being developed by four engineers: Maxime
> Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
> Chris is a member of the Apache Samza PMC and a contributor to various
> Apache projects, including Apache Kafka and Apache YARN. Maxime,
> Siddharth, and Bolke have contributed to Airflow.
> 
> === Alignment ===
> The ASF is the natural choice to host the Airflow project as its goal
> of encouraging community-driven open-source projects fits with our
> vision for Airflow.
> 
> == Known Risks ==
> 
> === Orphaned Products ===
> 
> The core developers plan to work part time on the project. There is
> very little risk of Airflow being abandoned as all of our companies
> rely on it.
> 
> === Inexperience with Open Source ===
> 
> All of the core developers have experience with open source
> development. Chris is a member of the Apache Samza PMC and a
> contributor to various Apache projects, including Apache Kafka and
> Apache YARN. Bolke is contributor on multiple open source projects and
> a few Apache projects as well, including Apache Hive, Apache Hadoop,
> and Apache Ranger.
> 
> === Homogeneous Developers ===
> 
> The current core developers are all from different companies. Our
> community of 100 contributors hail from over 30 different companies
> from across the world.
> 
> === Reliance on Salaried Developers ===
> 
> Currently, the only developer paid to work on this project is Maxime.
> 
> === Relationships with Other Apache Prod

Re: [VOTE] Accept Airflow into the Incubator

2016-03-25 Thread Jeremiah Lowin
+1 (non-binding)

Jeremiah


> On Mar 25, 2016, at 8:26 AM, Suresh Marru  wrote:
> 
> + 1 (binding).
> 
> Suresh
> 
>> On Mar 24, 2016, at 11:00 PM, Siddharth Anand  
>> wrote:
>> 
>> Following the discussion earlier:
>>   https://s.apache.org/AirflowDiscussion
>> 
>> I would like to call a VOTE for accepting Airflow as a new incubator project.
>> 
>> The proposal is available at: 
>> https://wiki.apache.org/incubator/AirflowProposal
>> 
>> The proposal is also included at the bottom of this email.
>> 
>> Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
>> [ ] +1 accept Airflow into the Apache Incubator
>> [ ] ±0
>> [ ] -1 because...
>> 
>> +1 (non-binding)
>> 
>> Thanks,
>> -s (Sid)
>> 
>> 
>> == Abstract ==
>> 
>> Airflow is a workflow automation and scheduling system that can be
>> used to author and manage data pipelines.
>> 
>> == Proposal ==
>> 
>> Airflow provides a system for authoring and managing workflows a.k.a.
>> data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
>> authors DAGs in Python using an Airflow-provided framework. He/She
>> then executes the DAG using Airflow’s scheduler or registers the DAG
>> for event-based execution. A web-based UI provides the developer with
>> a range of options for managing and viewing his/her data pipelines.
>> Background
>> 
>> Airflow was developed at Airbnb to enable easier authorship and
>> management of DAGs than were possible with existing solutions such as
>> Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
>> more XML or property files to be bundled together to define a
>> workflow. This separation of code and config can present a challenge
>> to understanding the DAG - in Azkaban, a DAG’s structure is reflected
>> by its file system tree and one can find himself/herself traversing
>> the file system when inspecting or changing the structure of the DAG.
>> Airflow workflows, on the other hand, are simply and elegantly defined
>> in Python code, often a single file. Airflow merges the powerful
>> Web-based management aspects of projects like Azkaban and Oozie with
>> the simplicity and elegance of defining workflows in Python. Airflow,
>> less than a year old in terms of its Open Source launch, is currently
>> used in production environments in more than 30 companies and boasts
>> an active contributor list of more than 100 developers, the vast
>> majority of which (>95%) are outside of Airbnb.
>> 
>> We would like to share it with the ASF and begin developing a
>> community of developers and users within Apache.
>> 
>> == Rationale ==
>> 
>> Many organizations (>30) already benefit from running Airflow to
>> manage data pipelines. Our 100+ contributors continue to provide
>> integrations with 3rd party systems through the implementation of new
>> hooks and operators, both of which are used in defining the tasks that
>> compose workflows.
>> 
>> == Current Status ==
>> 
>> === Meritocracy ===
>> 
>> Our intent with this incubator proposal is to start building a diverse
>> developer community around Airflow following the Apache meritocracy
>> model. Since Airflow was open-sourced in mid-2015, we have had fast
>> adoption and contributions by multiple organizations the world over.
>> We plan to continue to support new contributors and we will work to
>> actively promote those who contribute significantly to the project to
>> committers.
>> 
>> === Community ===
>> 
>> Airflow is currently being used in over 30 companies. We hope to
>> extend our contributor base significantly and invite all those who are
>> interested in building large-scale distributed systems to participate.
>> 
>> === Core Developers ===
>> 
>> Airflow is currently being developed by four engineers: Maxime
>> Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
>> Chris is a member of the Apache Samza PMC and a contributor to various
>> Apache projects, including Apache Kafka and Apache YARN. Maxime,
>> Siddharth, and Bolke have contributed to Airflow.
>> 
>> === Alignment ===
>> The ASF is the natural choice to host the Airflow project as its goal
>> of encouraging community-driven open-source projects fits with our
>> vision for Airflow.
>> 
>> == Known Risks ==
>> 
>> === Orphaned Products ===
>> 
>> The core developers plan to work part time on the project. There is
>> very little risk of Airflow being abandoned as all of our companies
>> rely on it.
>> 
>> === Inexperience with Open Source ===
>> 
>> All of the core developers have experience with open source
>> development. Chris is a member of the Apache Samza PMC and a
>> contributor to various Apache projects, including Apache Kafka and
>> Apache YARN. Bolke is contributor on multiple open source projects and
>> a few Apache projects as well, including Apache Hive, Apache Hadoop,
>> and Apache Ranger.
>> 
>> === Homogeneous Developers ===
>> 
>> The current core developers are all from different companies. Our
>> community of 100 contributors hail from over 30 

Re: [VOTE] Accept Airflow into the Incubator

2016-03-25 Thread Jacques Nadeau
+1 (binding)

On Fri, Mar 25, 2016 at 5:57 AM, Jeremiah Lowin  wrote:

> +1 (non-binding)
>
> Jeremiah
>
>
> > On Mar 25, 2016, at 8:26 AM, Suresh Marru  wrote:
> >
> > + 1 (binding).
> >
> > Suresh
> >
> >> On Mar 24, 2016, at 11:00 PM, Siddharth Anand 
> wrote:
> >>
> >> Following the discussion earlier:
> >>   https://s.apache.org/AirflowDiscussion
> >>
> >> I would like to call a VOTE for accepting Airflow as a new incubator
> project.
> >>
> >> The proposal is available at:
> https://wiki.apache.org/incubator/AirflowProposal
> >>
> >> The proposal is also included at the bottom of this email.
> >>
> >> Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
> >> [ ] +1 accept Airflow into the Apache Incubator
> >> [ ] ±0
> >> [ ] -1 because...
> >>
> >> +1 (non-binding)
> >>
> >> Thanks,
> >> -s (Sid)
> >>
> >>
> >> == Abstract ==
> >>
> >> Airflow is a workflow automation and scheduling system that can be
> >> used to author and manage data pipelines.
> >>
> >> == Proposal ==
> >>
> >> Airflow provides a system for authoring and managing workflows a.k.a.
> >> data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
> >> authors DAGs in Python using an Airflow-provided framework. He/She
> >> then executes the DAG using Airflow’s scheduler or registers the DAG
> >> for event-based execution. A web-based UI provides the developer with
> >> a range of options for managing and viewing his/her data pipelines.
> >> Background
> >>
> >> Airflow was developed at Airbnb to enable easier authorship and
> >> management of DAGs than were possible with existing solutions such as
> >> Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
> >> more XML or property files to be bundled together to define a
> >> workflow. This separation of code and config can present a challenge
> >> to understanding the DAG - in Azkaban, a DAG’s structure is reflected
> >> by its file system tree and one can find himself/herself traversing
> >> the file system when inspecting or changing the structure of the DAG.
> >> Airflow workflows, on the other hand, are simply and elegantly defined
> >> in Python code, often a single file. Airflow merges the powerful
> >> Web-based management aspects of projects like Azkaban and Oozie with
> >> the simplicity and elegance of defining workflows in Python. Airflow,
> >> less than a year old in terms of its Open Source launch, is currently
> >> used in production environments in more than 30 companies and boasts
> >> an active contributor list of more than 100 developers, the vast
> >> majority of which (>95%) are outside of Airbnb.
> >>
> >> We would like to share it with the ASF and begin developing a
> >> community of developers and users within Apache.
> >>
> >> == Rationale ==
> >>
> >> Many organizations (>30) already benefit from running Airflow to
> >> manage data pipelines. Our 100+ contributors continue to provide
> >> integrations with 3rd party systems through the implementation of new
> >> hooks and operators, both of which are used in defining the tasks that
> >> compose workflows.
> >>
> >> == Current Status ==
> >>
> >> === Meritocracy ===
> >>
> >> Our intent with this incubator proposal is to start building a diverse
> >> developer community around Airflow following the Apache meritocracy
> >> model. Since Airflow was open-sourced in mid-2015, we have had fast
> >> adoption and contributions by multiple organizations the world over.
> >> We plan to continue to support new contributors and we will work to
> >> actively promote those who contribute significantly to the project to
> >> committers.
> >>
> >> === Community ===
> >>
> >> Airflow is currently being used in over 30 companies. We hope to
> >> extend our contributor base significantly and invite all those who are
> >> interested in building large-scale distributed systems to participate.
> >>
> >> === Core Developers ===
> >>
> >> Airflow is currently being developed by four engineers: Maxime
> >> Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
> >> Chris is a member of the Apache Samza PMC and a contributor to various
> >> Apache projects, including Apache Kafka and Apache YARN. Maxime,
> >> Siddharth, and Bolke have contributed to Airflow.
> >>
> >> === Alignment ===
> >> The ASF is the natural choice to host the Airflow project as its goal
> >> of encouraging community-driven open-source projects fits with our
> >> vision for Airflow.
> >>
> >> == Known Risks ==
> >>
> >> === Orphaned Products ===
> >>
> >> The core developers plan to work part time on the project. There is
> >> very little risk of Airflow being abandoned as all of our companies
> >> rely on it.
> >>
> >> === Inexperience with Open Source ===
> >>
> >> All of the core developers have experience with open source
> >> development. Chris is a member of the Apache Samza PMC and a
> >> contributor to various Apache projects, including Apache Kafka and
> >> Apache YARN. Bolke is contributor on multiple open 

Re: [VOTE] Accept Airflow into the Incubator

2016-03-25 Thread Chris Nauroth
+1 (binding)

--Chris Nauroth




On 3/24/16, 8:00 PM, "Siddharth Anand"  wrote:

>Following the discussion earlier:
>https://s.apache.org/AirflowDiscussion
>
>I would like to call a VOTE for accepting Airflow as a new incubator
>project.
>
>The proposal is available at:
>https://wiki.apache.org/incubator/AirflowProposal
>
>The proposal is also included at the bottom of this email.
>
>Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
>[ ] +1 accept Airflow into the Apache Incubator
>[ ] ±0
>[ ] -1 because...
>
>+1 (non-binding)
>
>Thanks,
>-s (Sid)
>
>
>== Abstract ==
>
>Airflow is a workflow automation and scheduling system that can be
>used to author and manage data pipelines.
>
>== Proposal ==
>
>Airflow provides a system for authoring and managing workflows a.k.a.
>data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
>authors DAGs in Python using an Airflow-provided framework. He/She
>then executes the DAG using Airflow¹s scheduler or registers the DAG
>for event-based execution. A web-based UI provides the developer with
>a range of options for managing and viewing his/her data pipelines.
>Background
>
>Airflow was developed at Airbnb to enable easier authorship and
>management of DAGs than were possible with existing solutions such as
>Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
>more XML or property files to be bundled together to define a
>workflow. This separation of code and config can present a challenge
>to understanding the DAG - in Azkaban, a DAG¹s structure is reflected
>by its file system tree and one can find himself/herself traversing
>the file system when inspecting or changing the structure of the DAG.
>Airflow workflows, on the other hand, are simply and elegantly defined
>in Python code, often a single file. Airflow merges the powerful
>Web-based management aspects of projects like Azkaban and Oozie with
>the simplicity and elegance of defining workflows in Python. Airflow,
>less than a year old in terms of its Open Source launch, is currently
>used in production environments in more than 30 companies and boasts
>an active contributor list of more than 100 developers, the vast
>majority of which (>95%) are outside of Airbnb.
>
>We would like to share it with the ASF and begin developing a
>community of developers and users within Apache.
>
>== Rationale ==
>
>Many organizations (>30) already benefit from running Airflow to
>manage data pipelines. Our 100+ contributors continue to provide
>integrations with 3rd party systems through the implementation of new
>hooks and operators, both of which are used in defining the tasks that
>compose workflows.
>
>== Current Status ==
>
>=== Meritocracy ===
>
>Our intent with this incubator proposal is to start building a diverse
>developer community around Airflow following the Apache meritocracy
>model. Since Airflow was open-sourced in mid-2015, we have had fast
>adoption and contributions by multiple organizations the world over.
>We plan to continue to support new contributors and we will work to
>actively promote those who contribute significantly to the project to
>committers.
>
>=== Community ===
>
>Airflow is currently being used in over 30 companies. We hope to
>extend our contributor base significantly and invite all those who are
>interested in building large-scale distributed systems to participate.
>
>=== Core Developers ===
>
>Airflow is currently being developed by four engineers: Maxime
>Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
>Chris is a member of the Apache Samza PMC and a contributor to various
>Apache projects, including Apache Kafka and Apache YARN. Maxime,
>Siddharth, and Bolke have contributed to Airflow.
>
>=== Alignment ===
>The ASF is the natural choice to host the Airflow project as its goal
>of encouraging community-driven open-source projects fits with our
>vision for Airflow.
>
>== Known Risks ==
>
>=== Orphaned Products ===
>
>The core developers plan to work part time on the project. There is
>very little risk of Airflow being abandoned as all of our companies
>rely on it.
>
>=== Inexperience with Open Source ===
>
>All of the core developers have experience with open source
>development. Chris is a member of the Apache Samza PMC and a
>contributor to various Apache projects, including Apache Kafka and
>Apache YARN. Bolke is contributor on multiple open source projects and
>a few Apache projects as well, including Apache Hive, Apache Hadoop,
>and Apache Ranger.
>
>=== Homogeneous Developers ===
>
>The current core developers are all from different companies. Our
>community of 100 contributors hail from over 30 different companies
>from across the world.
>
>=== Reliance on Salaried Developers ===
>
>Currently, the only developer paid to work on this project is Maxime.
>
>=== Relationships with Other Apache Products ===
>
>Airflow is deeply integrated with Apache products. It currently
>provides hooks and operators to enable workflows 

Re: [VOTE] Accept Airflow into the Incubator

2016-03-25 Thread Arthur Wiedmer
+1 (non-binding)

Regards,
Arthur
On Mar 25, 2016 9:27 AM, "Chris Nauroth"  wrote:

> +1 (binding)
>
> --Chris Nauroth
>
>
>
>
> On 3/24/16, 8:00 PM, "Siddharth Anand"  wrote:
>
> >Following the discussion earlier:
> >https://s.apache.org/AirflowDiscussion
> >
> >I would like to call a VOTE for accepting Airflow as a new incubator
> >project.
> >
> >The proposal is available at:
> >https://wiki.apache.org/incubator/AirflowProposal
> >
> >The proposal is also included at the bottom of this email.
> >
> >Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
> >[ ] +1 accept Airflow into the Apache Incubator
> >[ ] ±0
> >[ ] -1 because...
> >
> >+1 (non-binding)
> >
> >Thanks,
> >-s (Sid)
> >
> >
> >== Abstract ==
> >
> >Airflow is a workflow automation and scheduling system that can be
> >used to author and manage data pipelines.
> >
> >== Proposal ==
> >
> >Airflow provides a system for authoring and managing workflows a.k.a.
> >data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
> >authors DAGs in Python using an Airflow-provided framework. He/She
> >then executes the DAG using Airflow¹s scheduler or registers the DAG
> >for event-based execution. A web-based UI provides the developer with
> >a range of options for managing and viewing his/her data pipelines.
> >Background
> >
> >Airflow was developed at Airbnb to enable easier authorship and
> >management of DAGs than were possible with existing solutions such as
> >Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
> >more XML or property files to be bundled together to define a
> >workflow. This separation of code and config can present a challenge
> >to understanding the DAG - in Azkaban, a DAG¹s structure is reflected
> >by its file system tree and one can find himself/herself traversing
> >the file system when inspecting or changing the structure of the DAG.
> >Airflow workflows, on the other hand, are simply and elegantly defined
> >in Python code, often a single file. Airflow merges the powerful
> >Web-based management aspects of projects like Azkaban and Oozie with
> >the simplicity and elegance of defining workflows in Python. Airflow,
> >less than a year old in terms of its Open Source launch, is currently
> >used in production environments in more than 30 companies and boasts
> >an active contributor list of more than 100 developers, the vast
> >majority of which (>95%) are outside of Airbnb.
> >
> >We would like to share it with the ASF and begin developing a
> >community of developers and users within Apache.
> >
> >== Rationale ==
> >
> >Many organizations (>30) already benefit from running Airflow to
> >manage data pipelines. Our 100+ contributors continue to provide
> >integrations with 3rd party systems through the implementation of new
> >hooks and operators, both of which are used in defining the tasks that
> >compose workflows.
> >
> >== Current Status ==
> >
> >=== Meritocracy ===
> >
> >Our intent with this incubator proposal is to start building a diverse
> >developer community around Airflow following the Apache meritocracy
> >model. Since Airflow was open-sourced in mid-2015, we have had fast
> >adoption and contributions by multiple organizations the world over.
> >We plan to continue to support new contributors and we will work to
> >actively promote those who contribute significantly to the project to
> >committers.
> >
> >=== Community ===
> >
> >Airflow is currently being used in over 30 companies. We hope to
> >extend our contributor base significantly and invite all those who are
> >interested in building large-scale distributed systems to participate.
> >
> >=== Core Developers ===
> >
> >Airflow is currently being developed by four engineers: Maxime
> >Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
> >Chris is a member of the Apache Samza PMC and a contributor to various
> >Apache projects, including Apache Kafka and Apache YARN. Maxime,
> >Siddharth, and Bolke have contributed to Airflow.
> >
> >=== Alignment ===
> >The ASF is the natural choice to host the Airflow project as its goal
> >of encouraging community-driven open-source projects fits with our
> >vision for Airflow.
> >
> >== Known Risks ==
> >
> >=== Orphaned Products ===
> >
> >The core developers plan to work part time on the project. There is
> >very little risk of Airflow being abandoned as all of our companies
> >rely on it.
> >
> >=== Inexperience with Open Source ===
> >
> >All of the core developers have experience with open source
> >development. Chris is a member of the Apache Samza PMC and a
> >contributor to various Apache projects, including Apache Kafka and
> >Apache YARN. Bolke is contributor on multiple open source projects and
> >a few Apache projects as well, including Apache Hive, Apache Hadoop,
> >and Apache Ranger.
> >
> >=== Homogeneous Developers ===
> >
> >The current core developers are all from different companies. Our
> >community of 100 contributors hail from over 30

Re: [VOTE] Accept Airflow into the Incubator

2016-03-25 Thread Timothy Chen
+1 (non-binding)

Tim

On Fri, Mar 25, 2016 at 9:37 AM, Arthur Wiedmer
 wrote:
> +1 (non-binding)
>
> Regards,
> Arthur
> On Mar 25, 2016 9:27 AM, "Chris Nauroth"  wrote:
>
>> +1 (binding)
>>
>> --Chris Nauroth
>>
>>
>>
>>
>> On 3/24/16, 8:00 PM, "Siddharth Anand"  wrote:
>>
>> >Following the discussion earlier:
>> >https://s.apache.org/AirflowDiscussion
>> >
>> >I would like to call a VOTE for accepting Airflow as a new incubator
>> >project.
>> >
>> >The proposal is available at:
>> >https://wiki.apache.org/incubator/AirflowProposal
>> >
>> >The proposal is also included at the bottom of this email.
>> >
>> >Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
>> >[ ] +1 accept Airflow into the Apache Incubator
>> >[ ] ±0
>> >[ ] -1 because...
>> >
>> >+1 (non-binding)
>> >
>> >Thanks,
>> >-s (Sid)
>> >
>> >
>> >== Abstract ==
>> >
>> >Airflow is a workflow automation and scheduling system that can be
>> >used to author and manage data pipelines.
>> >
>> >== Proposal ==
>> >
>> >Airflow provides a system for authoring and managing workflows a.k.a.
>> >data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
>> >authors DAGs in Python using an Airflow-provided framework. He/She
>> >then executes the DAG using Airflow¹s scheduler or registers the DAG
>> >for event-based execution. A web-based UI provides the developer with
>> >a range of options for managing and viewing his/her data pipelines.
>> >Background
>> >
>> >Airflow was developed at Airbnb to enable easier authorship and
>> >management of DAGs than were possible with existing solutions such as
>> >Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
>> >more XML or property files to be bundled together to define a
>> >workflow. This separation of code and config can present a challenge
>> >to understanding the DAG - in Azkaban, a DAG¹s structure is reflected
>> >by its file system tree and one can find himself/herself traversing
>> >the file system when inspecting or changing the structure of the DAG.
>> >Airflow workflows, on the other hand, are simply and elegantly defined
>> >in Python code, often a single file. Airflow merges the powerful
>> >Web-based management aspects of projects like Azkaban and Oozie with
>> >the simplicity and elegance of defining workflows in Python. Airflow,
>> >less than a year old in terms of its Open Source launch, is currently
>> >used in production environments in more than 30 companies and boasts
>> >an active contributor list of more than 100 developers, the vast
>> >majority of which (>95%) are outside of Airbnb.
>> >
>> >We would like to share it with the ASF and begin developing a
>> >community of developers and users within Apache.
>> >
>> >== Rationale ==
>> >
>> >Many organizations (>30) already benefit from running Airflow to
>> >manage data pipelines. Our 100+ contributors continue to provide
>> >integrations with 3rd party systems through the implementation of new
>> >hooks and operators, both of which are used in defining the tasks that
>> >compose workflows.
>> >
>> >== Current Status ==
>> >
>> >=== Meritocracy ===
>> >
>> >Our intent with this incubator proposal is to start building a diverse
>> >developer community around Airflow following the Apache meritocracy
>> >model. Since Airflow was open-sourced in mid-2015, we have had fast
>> >adoption and contributions by multiple organizations the world over.
>> >We plan to continue to support new contributors and we will work to
>> >actively promote those who contribute significantly to the project to
>> >committers.
>> >
>> >=== Community ===
>> >
>> >Airflow is currently being used in over 30 companies. We hope to
>> >extend our contributor base significantly and invite all those who are
>> >interested in building large-scale distributed systems to participate.
>> >
>> >=== Core Developers ===
>> >
>> >Airflow is currently being developed by four engineers: Maxime
>> >Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
>> >Chris is a member of the Apache Samza PMC and a contributor to various
>> >Apache projects, including Apache Kafka and Apache YARN. Maxime,
>> >Siddharth, and Bolke have contributed to Airflow.
>> >
>> >=== Alignment ===
>> >The ASF is the natural choice to host the Airflow project as its goal
>> >of encouraging community-driven open-source projects fits with our
>> >vision for Airflow.
>> >
>> >== Known Risks ==
>> >
>> >=== Orphaned Products ===
>> >
>> >The core developers plan to work part time on the project. There is
>> >very little risk of Airflow being abandoned as all of our companies
>> >rely on it.
>> >
>> >=== Inexperience with Open Source ===
>> >
>> >All of the core developers have experience with open source
>> >development. Chris is a member of the Apache Samza PMC and a
>> >contributor to various Apache projects, including Apache Kafka and
>> >Apache YARN. Bolke is contributor on multiple open source projects and
>> >a few Apache projects as well, including Ap

Re: [VOTE] Accept Airflow into the Incubator

2016-03-25 Thread moon soo Lee
+1 (non-binding)

On Fri, Mar 25, 2016 at 1:05 PM Timothy Chen  wrote:

> +1 (non-binding)
>
> Tim
>
> On Fri, Mar 25, 2016 at 9:37 AM, Arthur Wiedmer
>  wrote:
> > +1 (non-binding)
> >
> > Regards,
> > Arthur
> > On Mar 25, 2016 9:27 AM, "Chris Nauroth" 
> wrote:
> >
> >> +1 (binding)
> >>
> >> --Chris Nauroth
> >>
> >>
> >>
> >>
> >> On 3/24/16, 8:00 PM, "Siddharth Anand" 
> wrote:
> >>
> >> >Following the discussion earlier:
> >> >https://s.apache.org/AirflowDiscussion
> >> >
> >> >I would like to call a VOTE for accepting Airflow as a new incubator
> >> >project.
> >> >
> >> >The proposal is available at:
> >> >https://wiki.apache.org/incubator/AirflowProposal
> >> >
> >> >The proposal is also included at the bottom of this email.
> >> >
> >> >Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
> >> >[ ] +1 accept Airflow into the Apache Incubator
> >> >[ ] ±0
> >> >[ ] -1 because...
> >> >
> >> >+1 (non-binding)
> >> >
> >> >Thanks,
> >> >-s (Sid)
> >> >
> >> >
> >> >== Abstract ==
> >> >
> >> >Airflow is a workflow automation and scheduling system that can be
> >> >used to author and manage data pipelines.
> >> >
> >> >== Proposal ==
> >> >
> >> >Airflow provides a system for authoring and managing workflows a.k.a.
> >> >data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
> >> >authors DAGs in Python using an Airflow-provided framework. He/She
> >> >then executes the DAG using Airflow¹s scheduler or registers the DAG
> >> >for event-based execution. A web-based UI provides the developer with
> >> >a range of options for managing and viewing his/her data pipelines.
> >> >Background
> >> >
> >> >Airflow was developed at Airbnb to enable easier authorship and
> >> >management of DAGs than were possible with existing solutions such as
> >> >Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
> >> >more XML or property files to be bundled together to define a
> >> >workflow. This separation of code and config can present a challenge
> >> >to understanding the DAG - in Azkaban, a DAG¹s structure is reflected
> >> >by its file system tree and one can find himself/herself traversing
> >> >the file system when inspecting or changing the structure of the DAG.
> >> >Airflow workflows, on the other hand, are simply and elegantly defined
> >> >in Python code, often a single file. Airflow merges the powerful
> >> >Web-based management aspects of projects like Azkaban and Oozie with
> >> >the simplicity and elegance of defining workflows in Python. Airflow,
> >> >less than a year old in terms of its Open Source launch, is currently
> >> >used in production environments in more than 30 companies and boasts
> >> >an active contributor list of more than 100 developers, the vast
> >> >majority of which (>95%) are outside of Airbnb.
> >> >
> >> >We would like to share it with the ASF and begin developing a
> >> >community of developers and users within Apache.
> >> >
> >> >== Rationale ==
> >> >
> >> >Many organizations (>30) already benefit from running Airflow to
> >> >manage data pipelines. Our 100+ contributors continue to provide
> >> >integrations with 3rd party systems through the implementation of new
> >> >hooks and operators, both of which are used in defining the tasks that
> >> >compose workflows.
> >> >
> >> >== Current Status ==
> >> >
> >> >=== Meritocracy ===
> >> >
> >> >Our intent with this incubator proposal is to start building a diverse
> >> >developer community around Airflow following the Apache meritocracy
> >> >model. Since Airflow was open-sourced in mid-2015, we have had fast
> >> >adoption and contributions by multiple organizations the world over.
> >> >We plan to continue to support new contributors and we will work to
> >> >actively promote those who contribute significantly to the project to
> >> >committers.
> >> >
> >> >=== Community ===
> >> >
> >> >Airflow is currently being used in over 30 companies. We hope to
> >> >extend our contributor base significantly and invite all those who are
> >> >interested in building large-scale distributed systems to participate.
> >> >
> >> >=== Core Developers ===
> >> >
> >> >Airflow is currently being developed by four engineers: Maxime
> >> >Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
> >> >Chris is a member of the Apache Samza PMC and a contributor to various
> >> >Apache projects, including Apache Kafka and Apache YARN. Maxime,
> >> >Siddharth, and Bolke have contributed to Airflow.
> >> >
> >> >=== Alignment ===
> >> >The ASF is the natural choice to host the Airflow project as its goal
> >> >of encouraging community-driven open-source projects fits with our
> >> >vision for Airflow.
> >> >
> >> >== Known Risks ==
> >> >
> >> >=== Orphaned Products ===
> >> >
> >> >The core developers plan to work part time on the project. There is
> >> >very little risk of Airflow being abandoned as all of our companies
> >> >rely on it.
> >> >
> >> >=== Inexperience with Open Source ===
>

Re: [VOTE] Accept Airflow into the Incubator

2016-03-25 Thread Felix Cheung
+1 (non-binding)

On Fri, Mar 25, 2016 at 1:41 PM moon soo Lee  wrote:

> +1 (non-binding)
>
> On Fri, Mar 25, 2016 at 1:05 PM Timothy Chen  wrote:
>
> > +1 (non-binding)
> >
> > Tim
> >
> > On Fri, Mar 25, 2016 at 9:37 AM, Arthur Wiedmer
> >  wrote:
> > > +1 (non-binding)
> > >
> > > Regards,
> > > Arthur
> > > On Mar 25, 2016 9:27 AM, "Chris Nauroth" 
> > wrote:
> > >
> > >> +1 (binding)
> > >>
> > >> --Chris Nauroth
> > >>
> > >>
> > >>
> > >>
> > >> On 3/24/16, 8:00 PM, "Siddharth Anand" 
> > wrote:
> > >>
> > >> >Following the discussion earlier:
> > >> >https://s.apache.org/AirflowDiscussion
> > >> >
> > >> >I would like to call a VOTE for accepting Airflow as a new incubator
> > >> >project.
> > >> >
> > >> >The proposal is available at:
> > >> >https://wiki.apache.org/incubator/AirflowProposal
> > >> >
> > >> >The proposal is also included at the bottom of this email.
> > >> >
> > >> >Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
> > >> >[ ] +1 accept Airflow into the Apache Incubator
> > >> >[ ] ±0
> > >> >[ ] -1 because...
> > >> >
> > >> >+1 (non-binding)
> > >> >
> > >> >Thanks,
> > >> >-s (Sid)
> > >> >
> > >> >
> > >> >== Abstract ==
> > >> >
> > >> >Airflow is a workflow automation and scheduling system that can be
> > >> >used to author and manage data pipelines.
> > >> >
> > >> >== Proposal ==
> > >> >
> > >> >Airflow provides a system for authoring and managing workflows a.k.a.
> > >> >data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
> > >> >authors DAGs in Python using an Airflow-provided framework. He/She
> > >> >then executes the DAG using Airflow¹s scheduler or registers the DAG
> > >> >for event-based execution. A web-based UI provides the developer with
> > >> >a range of options for managing and viewing his/her data pipelines.
> > >> >Background
> > >> >
> > >> >Airflow was developed at Airbnb to enable easier authorship and
> > >> >management of DAGs than were possible with existing solutions such as
> > >> >Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one
> or
> > >> >more XML or property files to be bundled together to define a
> > >> >workflow. This separation of code and config can present a challenge
> > >> >to understanding the DAG - in Azkaban, a DAG¹s structure is reflected
> > >> >by its file system tree and one can find himself/herself traversing
> > >> >the file system when inspecting or changing the structure of the DAG.
> > >> >Airflow workflows, on the other hand, are simply and elegantly
> defined
> > >> >in Python code, often a single file. Airflow merges the powerful
> > >> >Web-based management aspects of projects like Azkaban and Oozie with
> > >> >the simplicity and elegance of defining workflows in Python. Airflow,
> > >> >less than a year old in terms of its Open Source launch, is currently
> > >> >used in production environments in more than 30 companies and boasts
> > >> >an active contributor list of more than 100 developers, the vast
> > >> >majority of which (>95%) are outside of Airbnb.
> > >> >
> > >> >We would like to share it with the ASF and begin developing a
> > >> >community of developers and users within Apache.
> > >> >
> > >> >== Rationale ==
> > >> >
> > >> >Many organizations (>30) already benefit from running Airflow to
> > >> >manage data pipelines. Our 100+ contributors continue to provide
> > >> >integrations with 3rd party systems through the implementation of new
> > >> >hooks and operators, both of which are used in defining the tasks
> that
> > >> >compose workflows.
> > >> >
> > >> >== Current Status ==
> > >> >
> > >> >=== Meritocracy ===
> > >> >
> > >> >Our intent with this incubator proposal is to start building a
> diverse
> > >> >developer community around Airflow following the Apache meritocracy
> > >> >model. Since Airflow was open-sourced in mid-2015, we have had fast
> > >> >adoption and contributions by multiple organizations the world over.
> > >> >We plan to continue to support new contributors and we will work to
> > >> >actively promote those who contribute significantly to the project to
> > >> >committers.
> > >> >
> > >> >=== Community ===
> > >> >
> > >> >Airflow is currently being used in over 30 companies. We hope to
> > >> >extend our contributor base significantly and invite all those who
> are
> > >> >interested in building large-scale distributed systems to
> participate.
> > >> >
> > >> >=== Core Developers ===
> > >> >
> > >> >Airflow is currently being developed by four engineers: Maxime
> > >> >Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
> > >> >Chris is a member of the Apache Samza PMC and a contributor to
> various
> > >> >Apache projects, including Apache Kafka and Apache YARN. Maxime,
> > >> >Siddharth, and Bolke have contributed to Airflow.
> > >> >
> > >> >=== Alignment ===
> > >> >The ASF is the natural choice to host the Airflow project as its goal
> > >> >of encouraging community-driven open-source projects fits with

Re: [VOTE] Accept Airflow into the Incubator

2016-03-26 Thread Stian Soiland-Reyes
+1 (non-binding)
On 25 Mar 2016 03:00, "Siddharth Anand"  wrote:

> Following the discussion earlier:
> https://s.apache.org/AirflowDiscussion
>
> I would like to call a VOTE for accepting Airflow as a new incubator
> project.
>
> The proposal is available at:
> https://wiki.apache.org/incubator/AirflowProposal
>
> The proposal is also included at the bottom of this email.
>
> Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
> [ ] +1 accept Airflow into the Apache Incubator
> [ ] ±0
> [ ] -1 because...
>
> +1 (non-binding)
>
> Thanks,
> -s (Sid)
>
>
> == Abstract ==
>
> Airflow is a workflow automation and scheduling system that can be
> used to author and manage data pipelines.
>
> == Proposal ==
>
> Airflow provides a system for authoring and managing workflows a.k.a.
> data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
> authors DAGs in Python using an Airflow-provided framework. He/She
> then executes the DAG using Airflow’s scheduler or registers the DAG
> for event-based execution. A web-based UI provides the developer with
> a range of options for managing and viewing his/her data pipelines.
> Background
>
> Airflow was developed at Airbnb to enable easier authorship and
> management of DAGs than were possible with existing solutions such as
> Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
> more XML or property files to be bundled together to define a
> workflow. This separation of code and config can present a challenge
> to understanding the DAG - in Azkaban, a DAG’s structure is reflected
> by its file system tree and one can find himself/herself traversing
> the file system when inspecting or changing the structure of the DAG.
> Airflow workflows, on the other hand, are simply and elegantly defined
> in Python code, often a single file. Airflow merges the powerful
> Web-based management aspects of projects like Azkaban and Oozie with
> the simplicity and elegance of defining workflows in Python. Airflow,
> less than a year old in terms of its Open Source launch, is currently
> used in production environments in more than 30 companies and boasts
> an active contributor list of more than 100 developers, the vast
> majority of which (>95%) are outside of Airbnb.
>
> We would like to share it with the ASF and begin developing a
> community of developers and users within Apache.
>
> == Rationale ==
>
> Many organizations (>30) already benefit from running Airflow to
> manage data pipelines. Our 100+ contributors continue to provide
> integrations with 3rd party systems through the implementation of new
> hooks and operators, both of which are used in defining the tasks that
> compose workflows.
>
> == Current Status ==
>
> === Meritocracy ===
>
> Our intent with this incubator proposal is to start building a diverse
> developer community around Airflow following the Apache meritocracy
> model. Since Airflow was open-sourced in mid-2015, we have had fast
> adoption and contributions by multiple organizations the world over.
> We plan to continue to support new contributors and we will work to
> actively promote those who contribute significantly to the project to
> committers.
>
> === Community ===
>
> Airflow is currently being used in over 30 companies. We hope to
> extend our contributor base significantly and invite all those who are
> interested in building large-scale distributed systems to participate.
>
> === Core Developers ===
>
> Airflow is currently being developed by four engineers: Maxime
> Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
> Chris is a member of the Apache Samza PMC and a contributor to various
> Apache projects, including Apache Kafka and Apache YARN. Maxime,
> Siddharth, and Bolke have contributed to Airflow.
>
> === Alignment ===
> The ASF is the natural choice to host the Airflow project as its goal
> of encouraging community-driven open-source projects fits with our
> vision for Airflow.
>
> == Known Risks ==
>
> === Orphaned Products ===
>
> The core developers plan to work part time on the project. There is
> very little risk of Airflow being abandoned as all of our companies
> rely on it.
>
> === Inexperience with Open Source ===
>
> All of the core developers have experience with open source
> development. Chris is a member of the Apache Samza PMC and a
> contributor to various Apache projects, including Apache Kafka and
> Apache YARN. Bolke is contributor on multiple open source projects and
> a few Apache projects as well, including Apache Hive, Apache Hadoop,
> and Apache Ranger.
>
> === Homogeneous Developers ===
>
> The current core developers are all from different companies. Our
> community of 100 contributors hail from over 30 different companies
> from across the world.
>
> === Reliance on Salaried Developers ===
>
> Currently, the only developer paid to work on this project is Maxime.
>
> === Relationships with Other Apache Products ===
>
> Airflow is deeply integrated with Apac

AirPal to Apache? Re: [VOTE] Accept Airflow into the Incubator

2016-03-25 Thread Adunuthula, Seshu
Any plans from AirBnB to bring AirPal to Apache?



On 3/24/16, 8:00 PM, "Siddharth Anand"  wrote:

>Following the discussion earlier:
>https://s.apache.org/AirflowDiscussion
>
>I would like to call a VOTE for accepting Airflow as a new incubator
>project.
>
>The proposal is available at:
>https://wiki.apache.org/incubator/AirflowProposal
>
>The proposal is also included at the bottom of this email.
>
>Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
>[ ] +1 accept Airflow into the Apache Incubator
>[ ] ±0
>[ ] -1 because...
>
>+1 (non-binding)
>
>Thanks,
>-s (Sid)
>
>
>== Abstract ==
>
>Airflow is a workflow automation and scheduling system that can be
>used to author and manage data pipelines.
>
>== Proposal ==
>
>Airflow provides a system for authoring and managing workflows a.k.a.
>data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
>authors DAGs in Python using an Airflow-provided framework. He/She
>then executes the DAG using Airflow¹s scheduler or registers the DAG
>for event-based execution. A web-based UI provides the developer with
>a range of options for managing and viewing his/her data pipelines.
>Background
>
>Airflow was developed at Airbnb to enable easier authorship and
>management of DAGs than were possible with existing solutions such as
>Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
>more XML or property files to be bundled together to define a
>workflow. This separation of code and config can present a challenge
>to understanding the DAG - in Azkaban, a DAG¹s structure is reflected
>by its file system tree and one can find himself/herself traversing
>the file system when inspecting or changing the structure of the DAG.
>Airflow workflows, on the other hand, are simply and elegantly defined
>in Python code, often a single file. Airflow merges the powerful
>Web-based management aspects of projects like Azkaban and Oozie with
>the simplicity and elegance of defining workflows in Python. Airflow,
>less than a year old in terms of its Open Source launch, is currently
>used in production environments in more than 30 companies and boasts
>an active contributor list of more than 100 developers, the vast
>majority of which (>95%) are outside of Airbnb.
>
>We would like to share it with the ASF and begin developing a
>community of developers and users within Apache.
>
>== Rationale ==
>
>Many organizations (>30) already benefit from running Airflow to
>manage data pipelines. Our 100+ contributors continue to provide
>integrations with 3rd party systems through the implementation of new
>hooks and operators, both of which are used in defining the tasks that
>compose workflows.
>
>== Current Status ==
>
>=== Meritocracy ===
>
>Our intent with this incubator proposal is to start building a diverse
>developer community around Airflow following the Apache meritocracy
>model. Since Airflow was open-sourced in mid-2015, we have had fast
>adoption and contributions by multiple organizations the world over.
>We plan to continue to support new contributors and we will work to
>actively promote those who contribute significantly to the project to
>committers.
>
>=== Community ===
>
>Airflow is currently being used in over 30 companies. We hope to
>extend our contributor base significantly and invite all those who are
>interested in building large-scale distributed systems to participate.
>
>=== Core Developers ===
>
>Airflow is currently being developed by four engineers: Maxime
>Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
>Chris is a member of the Apache Samza PMC and a contributor to various
>Apache projects, including Apache Kafka and Apache YARN. Maxime,
>Siddharth, and Bolke have contributed to Airflow.
>
>=== Alignment ===
>The ASF is the natural choice to host the Airflow project as its goal
>of encouraging community-driven open-source projects fits with our
>vision for Airflow.
>
>== Known Risks ==
>
>=== Orphaned Products ===
>
>The core developers plan to work part time on the project. There is
>very little risk of Airflow being abandoned as all of our companies
>rely on it.
>
>=== Inexperience with Open Source ===
>
>All of the core developers have experience with open source
>development. Chris is a member of the Apache Samza PMC and a
>contributor to various Apache projects, including Apache Kafka and
>Apache YARN. Bolke is contributor on multiple open source projects and
>a few Apache projects as well, including Apache Hive, Apache Hadoop,
>and Apache Ranger.
>
>=== Homogeneous Developers ===
>
>The current core developers are all from different companies. Our
>community of 100 contributors hail from over 30 different companies
>from across the world.
>
>=== Reliance on Salaried Developers ===
>
>Currently, the only developer paid to work on this project is Maxime.
>
>=== Relationships with Other Apache Products ===
>
>Airflow is deeply integrated with Apache products. It currently
>provides hooks and operators to

Re: AirPal to Apache? Re: [VOTE] Accept Airflow into the Incubator

2016-03-25 Thread John D. Ament
You would need to ask AirBnB.

John

On Fri, Mar 25, 2016 at 9:20 AM Adunuthula, Seshu 
wrote:

> Any plans from AirBnB to bring AirPal to Apache?
>
>
>
> On 3/24/16, 8:00 PM, "Siddharth Anand"  wrote:
>
> >Following the discussion earlier:
> >https://s.apache.org/AirflowDiscussion
> >
> >I would like to call a VOTE for accepting Airflow as a new incubator
> >project.
> >
> >The proposal is available at:
> >https://wiki.apache.org/incubator/AirflowProposal
> >
> >The proposal is also included at the bottom of this email.
> >
> >Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
> >[ ] +1 accept Airflow into the Apache Incubator
> >[ ] ±0
> >[ ] -1 because...
> >
> >+1 (non-binding)
> >
> >Thanks,
> >-s (Sid)
> >
> >
> >== Abstract ==
> >
> >Airflow is a workflow automation and scheduling system that can be
> >used to author and manage data pipelines.
> >
> >== Proposal ==
> >
> >Airflow provides a system for authoring and managing workflows a.k.a.
> >data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
> >authors DAGs in Python using an Airflow-provided framework. He/She
> >then executes the DAG using Airflow¹s scheduler or registers the DAG
> >for event-based execution. A web-based UI provides the developer with
> >a range of options for managing and viewing his/her data pipelines.
> >Background
> >
> >Airflow was developed at Airbnb to enable easier authorship and
> >management of DAGs than were possible with existing solutions such as
> >Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
> >more XML or property files to be bundled together to define a
> >workflow. This separation of code and config can present a challenge
> >to understanding the DAG - in Azkaban, a DAG¹s structure is reflected
> >by its file system tree and one can find himself/herself traversing
> >the file system when inspecting or changing the structure of the DAG.
> >Airflow workflows, on the other hand, are simply and elegantly defined
> >in Python code, often a single file. Airflow merges the powerful
> >Web-based management aspects of projects like Azkaban and Oozie with
> >the simplicity and elegance of defining workflows in Python. Airflow,
> >less than a year old in terms of its Open Source launch, is currently
> >used in production environments in more than 30 companies and boasts
> >an active contributor list of more than 100 developers, the vast
> >majority of which (>95%) are outside of Airbnb.
> >
> >We would like to share it with the ASF and begin developing a
> >community of developers and users within Apache.
> >
> >== Rationale ==
> >
> >Many organizations (>30) already benefit from running Airflow to
> >manage data pipelines. Our 100+ contributors continue to provide
> >integrations with 3rd party systems through the implementation of new
> >hooks and operators, both of which are used in defining the tasks that
> >compose workflows.
> >
> >== Current Status ==
> >
> >=== Meritocracy ===
> >
> >Our intent with this incubator proposal is to start building a diverse
> >developer community around Airflow following the Apache meritocracy
> >model. Since Airflow was open-sourced in mid-2015, we have had fast
> >adoption and contributions by multiple organizations the world over.
> >We plan to continue to support new contributors and we will work to
> >actively promote those who contribute significantly to the project to
> >committers.
> >
> >=== Community ===
> >
> >Airflow is currently being used in over 30 companies. We hope to
> >extend our contributor base significantly and invite all those who are
> >interested in building large-scale distributed systems to participate.
> >
> >=== Core Developers ===
> >
> >Airflow is currently being developed by four engineers: Maxime
> >Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
> >Chris is a member of the Apache Samza PMC and a contributor to various
> >Apache projects, including Apache Kafka and Apache YARN. Maxime,
> >Siddharth, and Bolke have contributed to Airflow.
> >
> >=== Alignment ===
> >The ASF is the natural choice to host the Airflow project as its goal
> >of encouraging community-driven open-source projects fits with our
> >vision for Airflow.
> >
> >== Known Risks ==
> >
> >=== Orphaned Products ===
> >
> >The core developers plan to work part time on the project. There is
> >very little risk of Airflow being abandoned as all of our companies
> >rely on it.
> >
> >=== Inexperience with Open Source ===
> >
> >All of the core developers have experience with open source
> >development. Chris is a member of the Apache Samza PMC and a
> >contributor to various Apache projects, including Apache Kafka and
> >Apache YARN. Bolke is contributor on multiple open source projects and
> >a few Apache projects as well, including Apache Hive, Apache Hadoop,
> >and Apache Ranger.
> >
> >=== Homogeneous Developers ===
> >
> >The current core developers are all from different companies. Our
> >community of 100 cont

Re: AirPal to Apache? Re: [VOTE] Accept Airflow into the Incubator

2016-03-25 Thread Adunuthula, Seshu
I am hoping there are a few AirBnB guys on this DL, though its dependency
on Presto could prohibit it becoming an Apache project…


On 3/25/16, 7:38 AM, "John D. Ament"  wrote:

>You would need to ask AirBnB.
>
>John
>
>On Fri, Mar 25, 2016 at 9:20 AM Adunuthula, Seshu 
>wrote:
>
>> Any plans from AirBnB to bring AirPal to Apache?
>>
>>
>>
>> On 3/24/16, 8:00 PM, "Siddharth Anand"  wrote:
>>
>> >Following the discussion earlier:
>> >https://s.apache.org/AirflowDiscussion
>> >
>> >I would like to call a VOTE for accepting Airflow as a new incubator
>> >project.
>> >
>> >The proposal is available at:
>> >https://wiki.apache.org/incubator/AirflowProposal
>> >
>> >The proposal is also included at the bottom of this email.
>> >
>> >Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT
>> >[ ] +1 accept Airflow into the Apache Incubator
>> >[ ] ±0
>> >[ ] -1 because...
>> >
>> >+1 (non-binding)
>> >
>> >Thanks,
>> >-s (Sid)
>> >
>> >
>> >== Abstract ==
>> >
>> >Airflow is a workflow automation and scheduling system that can be
>> >used to author and manage data pipelines.
>> >
>> >== Proposal ==
>> >
>> >Airflow provides a system for authoring and managing workflows a.k.a.
>> >data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer
>> >authors DAGs in Python using an Airflow-provided framework. He/She
>> >then executes the DAG using Airflow¹s scheduler or registers the DAG
>> >for event-based execution. A web-based UI provides the developer with
>> >a range of options for managing and viewing his/her data pipelines.
>> >Background
>> >
>> >Airflow was developed at Airbnb to enable easier authorship and
>> >management of DAGs than were possible with existing solutions such as
>> >Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or
>> >more XML or property files to be bundled together to define a
>> >workflow. This separation of code and config can present a challenge
>> >to understanding the DAG - in Azkaban, a DAG¹s structure is reflected
>> >by its file system tree and one can find himself/herself traversing
>> >the file system when inspecting or changing the structure of the DAG.
>> >Airflow workflows, on the other hand, are simply and elegantly defined
>> >in Python code, often a single file. Airflow merges the powerful
>> >Web-based management aspects of projects like Azkaban and Oozie with
>> >the simplicity and elegance of defining workflows in Python. Airflow,
>> >less than a year old in terms of its Open Source launch, is currently
>> >used in production environments in more than 30 companies and boasts
>> >an active contributor list of more than 100 developers, the vast
>> >majority of which (>95%) are outside of Airbnb.
>> >
>> >We would like to share it with the ASF and begin developing a
>> >community of developers and users within Apache.
>> >
>> >== Rationale ==
>> >
>> >Many organizations (>30) already benefit from running Airflow to
>> >manage data pipelines. Our 100+ contributors continue to provide
>> >integrations with 3rd party systems through the implementation of new
>> >hooks and operators, both of which are used in defining the tasks that
>> >compose workflows.
>> >
>> >== Current Status ==
>> >
>> >=== Meritocracy ===
>> >
>> >Our intent with this incubator proposal is to start building a diverse
>> >developer community around Airflow following the Apache meritocracy
>> >model. Since Airflow was open-sourced in mid-2015, we have had fast
>> >adoption and contributions by multiple organizations the world over.
>> >We plan to continue to support new contributors and we will work to
>> >actively promote those who contribute significantly to the project to
>> >committers.
>> >
>> >=== Community ===
>> >
>> >Airflow is currently being used in over 30 companies. We hope to
>> >extend our contributor base significantly and invite all those who are
>> >interested in building large-scale distributed systems to participate.
>> >
>> >=== Core Developers ===
>> >
>> >Airflow is currently being developed by four engineers: Maxime
>> >Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini.
>> >Chris is a member of the Apache Samza PMC and a contributor to various
>> >Apache projects, including Apache Kafka and Apache YARN. Maxime,
>> >Siddharth, and Bolke have contributed to Airflow.
>> >
>> >=== Alignment ===
>> >The ASF is the natural choice to host the Airflow project as its goal
>> >of encouraging community-driven open-source projects fits with our
>> >vision for Airflow.
>> >
>> >== Known Risks ==
>> >
>> >=== Orphaned Products ===
>> >
>> >The core developers plan to work part time on the project. There is
>> >very little risk of Airflow being abandoned as all of our companies
>> >rely on it.
>> >
>> >=== Inexperience with Open Source ===
>> >
>> >All of the core developers have experience with open source
>> >development. Chris is a member of the Apache Samza PMC and a
>> >contributor to various Apache projects, including Apache Kafka and
>>