[ https://issues.apache.org/jira/browse/BAHIR-35?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15385706#comment-15385706 ]
Steve Loughran commented on BAHIR-35: ------------------------------------- What's bing proposed here. Just adding the bahir .py files into the binary .jar? > Include Python code in the binary jars for use with "--packages ..." > -------------------------------------------------------------------- > > Key: BAHIR-35 > URL: https://issues.apache.org/jira/browse/BAHIR-35 > Project: Bahir > Issue Type: Task > Components: Build > Affects Versions: 2.0.0 > Reporter: Christian Kadner > Original Estimate: 8h > Remaining Estimate: 8h > > Currently, to make use the PySpark code (i.e streaming-mqtt/python) a user > will have to download the jar from Maven central or clone the code from > GitHub and then have to find individual *.py files, create a zip and add that > to the {{spark-submit}} command with the {{--py-files}} option, or, add them > to the {{PYTHONPATH}} when running locally. > If we include the Python code in the binary build (to the jar that gets > uploaded to Maven central), then users need not do any acrobatics besides > using the {{--packages ...}} option. > An example where the Python code is part of the binary jar is the > [GraphFrames|https://spark-packages.org/package/graphframes/graphframes] > package. -- This message was sent by Atlassian JIRA (v6.3.4#6332)