Lei (Eddy) Xu created BEAM-6296:
-----------------------------------

             Summary: Support Python Spark Runner
                 Key: BEAM-6296
                 URL: https://issues.apache.org/jira/browse/BEAM-6296
             Project: Beam
          Issue Type: New Feature
          Components: runner-spark
    Affects Versions: 2.9.0
            Reporter: Lei (Eddy) Xu
            Assignee: Amit Sela


Hello, everyone,

It would be great to have a Python version of Spark runner available to Python. 

While we are happy of running Apache Beam on Dataflow, there are a few use 
cases that require different dependencies and OS env which makes it be more 
appropriate to run on a self-managed Spark cluster. With a spark runner for the 
python SDK, there will be an option to unify the language to define data 
pipelines.  

Would like to see the community's feedbacks of this feature.




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to