Hi Gabor, Are Python UDFs (User-defined functions) something that might work for you? If all you really need to write in Python is your DoFn this is probably your best option. It is still a bit of work but we support Java UDFs today, so all you would need to do is write a Java wrapper to call your Python function.
Andrew On Fri, Apr 13, 2018, 7:58 AM Kenneth Knowles <k...@google.com> wrote: > The most recent work on cross-language pipeline authoring is the design > brainstorming at https://s.apache.org/beam-mixed-language-pipelines so it > is still in the preliminary stages. There's no basic mystery, but there are > a lot of practical considerations about what is easy to run on a pipeline > author's machine. > > Regarding Apache Calcite - it is a Java library. It doesn't really make > sense to bind it to Python. Today we don't use most of its capabilities. We > just use it as a parser mostly. It would be easy to find an existing parser > in Python or write your own (with ply, the basics could be done within a > day). But still I don't think it makes sense to reimplement and maintain > the SQL-to-Beam translation in multiple languages. > > Kenn > > On Fri, Apr 13, 2018 at 2:43 AM Reuven Lax <re...@google.com> wrote: > >> If someone implemented it directly in Python then it would be supported >> directly in Python. I don't know if anyone is actively working on that - >> the current implementation uses Apache Calcite, and I don't know whether >> they have a Python API. >> >> On Fri, Apr 13, 2018 at 9:40 AM Prabeesh K. <prabsma...@gmail.com> wrote: >> >>> What about supporting SQL in Python SDK? >>> >>> On 13 April 2018 at 13:32, Reuven Lax <re...@google.com> wrote: >>> >>>> The portability work will allow the Python and Java SDKs to be used in >>>> the same pipeline, though this work is not yet complete. >>>> >>>> >>> This is would be an interesting feature. >>> >>> On Fri, Apr 13, 2018 at 9:15 AM Gabor Hermann <m...@gaborhermann.com> >>>> wrote: >>>> >>>>> Hey all, >>>>> >>>>> Are there any efforts towards supporting SQL from the Python SDK, not >>>>> just from Java? I couldn't find any info about this in JIRA or mailing >>>>> lists. >>>>> >>>>> How much effort do you think it would take to implement this? Are >>>>> there >>>>> some dependencies like supporting more features in Python? I know that >>>>> the Python SDK is experimental. >>>>> >>>>> As an alternative, is there a way to combine Python and Java SDKs in >>>>> the >>>>> same pipeline? >>>>> >>>>> Thanks for your answers in advance! >>>>> >>>>> Cheers, >>>>> Gabor >>>>> >>>>> >>> >>>