Re: Generic Transfer Operator

2020-09-11 Thread Austin Bennett
ely use-cases for both. I’m totally > fine > > > > >> >with saying “for small/medium use-cases, we come with an in-house > > > > >> >system. However for larger cases, you’ll require spark/Flink/S3. > > > > >That’s > > > > >

Re: Generic Transfer Operator

2020-09-11 Thread Kamil Olszewski
ystem. However for larger cases, you’ll require spark/Flink/S3. > > > >That’s > > > >> >totally in line with PLENTY of use-cases. This would be especially > > > >cool > > > >> >when matched with fast-follow as we could EVEN potentially

Re: Generic Transfer Operator

2020-09-07 Thread Daniel Imberman
t;> > > >> >> via Newton Mail > > >> > > >>[ > > https://cloudmagic.com/k/d/mailapp?ct=dx&cv=10.0.50&pv=10.15.6&source=email_footer_2 > > ] > > >> >> On Sat, Sep 5, 2020 at 5:11 PM, Austin Ben

Re: Generic Transfer Operator

2020-09-06 Thread Tomasz Urbaszek
gt;> via Newton Mail > > >> > > >>[ > > https://cloudmagic.com/k/d/mailapp?ct=dx&cv=10.0.50&pv=10.15.6&source=email_footer_2 > > ] > > >> >> On Sat, Sep 5, 2020 at 5:11 PM, Austin Bennett > > >> > wrote: > > >&g

Re: Generic Transfer Operator

2020-09-06 Thread Daniel Imberman
t; >> >> On Sat, Sep 5, 2020 at 5:11 PM, Austin Bennett > >> > wrote: > >> >> I believe - for not large data - the direct runner is wholly > >doable, > >> >which > >> >> seems in line with airflow patterns.

Re: Generic Transfer Operator

2020-09-06 Thread Ash Berlin-Taylor
nner is wholly >doable, >> >which >> >> seems in line with airflow patterns. I have, and have spoken with >> >several >> >> others that have, been productive with that runner. >> >> >> >> For much larger transfers, the generic

Re: Generic Transfer Operator

2020-09-06 Thread Jarek Potiuk
s that have, been productive with that runner. > > >> > > >> For much larger transfers, the generic operator could accept > > >parameters for > > >> submitting the compute to an actual runner. Though, imagining that > > >> (needing a run

Re: Generic Transfer Operator

2020-09-06 Thread Tomasz Urbaszek
ductive with that runner. > >> > >> For much larger transfers, the generic operator could accept > >parameters for > >> submitting the compute to an actual runner. Though, imagining that > >> (needing a runner) would not be the primary use case for such an

Re: Generic Transfer Operator

2020-09-06 Thread Ash Berlin-Taylor
;operator. >> >> >> On Tue, Sep 1, 2020, 11:52 PM Tomasz Urbaszek >wrote: >> >> > Austin, you are right, Beam covers all (and more) important IOs. >> > However, using Apache Beam to design a generic transfer operator >> > requires Airflow users to h

Re: Generic Transfer Operator

2020-09-06 Thread Tomasz Urbaszek
uld accept parameters for > submitting the compute to an actual runner. Though, imagining that > (needing a runner) would not be the primary use case for such an operator. > > > On Tue, Sep 1, 2020, 11:52 PM Tomasz Urbaszek wrote: > > > Austin, you are right, Beam covers all

Re: Generic Transfer Operator

2020-09-05 Thread Daniel Imberman
would not be the primary use case for such an operator. On Tue, Sep 1, 2020, 11:52 PM Tomasz Urbaszek wrote: > Austin, you are right, Beam covers all (and more) important IOs. > However, using Apache Beam to design a generic transfer operator > requires Airflow users to have additiona

Re: Generic Transfer Operator

2020-09-05 Thread Austin Bennett
t; > Best, > Tomek > > > On Wed, Sep 2, 2020 at 12:37 AM Austin Bennett > wrote: > > > > Are there IOs that would be desired for a generic transfer operator that > > don't exist in: https://beam.apache.org/documentation/io/built-in/ <- > > th

Re: Generic Transfer Operator

2020-09-01 Thread Tomasz Urbaszek
Austin, you are right, Beam covers all (and more) important IOs. However, using Apache Beam to design a generic transfer operator requires Airflow users to have additional resources that will be used as a runner (Spark, Flink, etc.). Unless you suggest using DirectRunner? Can you please tell us

Re: Generic Transfer Operator

2020-09-01 Thread Austin Bennett
Are there IOs that would be desired for a generic transfer operator that don't exist in: https://beam.apache.org/documentation/io/built-in/ <- there is pretty solid coverage? Beam is getting to the point where even python beam can leverage the java IOs, which increases the range of

Re: Generic Transfer Operator

2020-09-01 Thread Jarek Potiuk
mal" solution but once you >> admit >> > you >> > > > > are >> > > > > > > not >> > > > > > > > > going to focus for optimisation, you come with simpler and >> > easier >> > > &g

Re: Generic Transfer Operator

2020-09-01 Thread Jarek Potiuk
ow operator. > > > > > > > > Gerard Casas Saez > > > > Twitter | Cortex | @casassaez <http://twitter.com/casassaez> > > > > > > > > > > > > On Tue, Sep 1, 2020 at 12:44 PM Austin Bennett < > > > > whatwo

Re: Generic Transfer Operator

2020-09-01 Thread Gerard Casas Saez
> > > > > > > > > On Tue, Sep 1, 2020 at 12:44 PM Austin Bennett < > > > whatwouldausti...@gmail.com> > > > wrote: > > > > > > > Are you guys familiar with Beam <https://beam.apache.org>? Esp. if > not > > &

Re: Generic Transfer Operator

2020-09-01 Thread Tomasz Urbaszek
ter | Cortex | @casassaez <http://twitter.com/casassaez> > > > > > > On Tue, Sep 1, 2020 at 12:44 PM Austin Bennett < > > whatwouldausti...@gmail.com> > > wrote: > > > > > Are you guys familiar with Beam <https://beam.apache.org>? Esp. if

Re: Generic Transfer Operator

2020-09-01 Thread Kaxil Naik
might rather straightforward to rely on the > ecosystem > > of connectors in that Apache Project to use as the foundations for a > > generic transfer operator. > > > > On Tue, Sep 1, 2020 at 11:05 AM Jarek Potiuk > > wrote: > > > > > +1 > > > > &

Re: Generic Transfer Operator

2020-09-01 Thread Daniel Imberman
e ecosystem > of connectors in that Apache Project to use as the foundations for a > generic transfer operator. > > On Tue, Sep 1, 2020 at 11:05 AM Jarek Potiuk > wrote: > > > +1 > > > > On Tue, Sep 1, 2020 at 1:35 PM Kamil Olszewski < > > kamil.olszew...@

Re: Generic Transfer Operator

2020-09-01 Thread Gerard Casas Saez
Sep 1, 2020 at 12:44 PM Austin Bennett wrote: > Are you guys familiar with Beam <https://beam.apache.org>? Esp. if not > doing transforms, it might rather straightforward to rely on the ecosystem > of connectors in that Apache Project to use as the foundations for a > generic trans

Re: Generic Transfer Operator

2020-09-01 Thread Austin Bennett
Are you guys familiar with Beam <https://beam.apache.org>? Esp. if not doing transforms, it might rather straightforward to rely on the ecosystem of connectors in that Apache Project to use as the foundations for a generic transfer operator. On Tue, Sep 1, 2020 at 11:05 AM Jarek Potiuk

Re: Generic Transfer Operator

2020-09-01 Thread Jarek Potiuk
t; > services that Airflow should use (Airflow is an orchestrator, not data > > > processing solution) but for the kind of quick-start scenarios I > foresee > > it > > > might be most useful, being able to apply simple data transformation on > > the > > >

Re: Generic Transfer Operator

2020-09-01 Thread Kamil Olszewski
not data > > processing solution) but for the kind of quick-start scenarios I foresee > it > > might be most useful, being able to apply simple data transformation on > the > > fly by data scientist might be a big plus. > > > > Suggestion: Panda DataFrame as the fo

Re: Generic Transfer Operator

2020-08-27 Thread Tomasz Urbaszek
e as the format of the "data" component > > Kamil - you should have access now. > > J. > > > On Tue, Aug 18, 2020 at 6:53 PM Kamil Olszewski < > kamil.olszew...@polidea.com> > wrote: > > > Hello all, > > in Polidea we have come up with an idea for a

Re: Generic Transfer Operator

2020-08-19 Thread Jarek Potiuk
J. On Tue, Aug 18, 2020 at 6:53 PM Kamil Olszewski wrote: > Hello all, > in Polidea we have come up with an idea for a generic transfer operator > that would be able to transport data between two destinations of various > types (file, database, storage, etc.) - please find the l

Generic Transfer Operator

2020-08-18 Thread Kamil Olszewski
Hello all, in Polidea we have come up with an idea for a generic transfer operator that would be able to transport data between two destinations of various types (file, database, storage, etc.) - please find the link with a short doc with POC <https://docs.google.com/documen