Mike Sukmanowsky created PIG-4225:
-------------------------------------

             Summary: Allow users to specify Python executable for Pig streaming
                 Key: PIG-4225
                 URL: https://issues.apache.org/jira/browse/PIG-4225
             Project: Pig
          Issue Type: Improvement
          Components: internal-udfs
    Affects Versions: 0.12.0, 0.12.1
            Reporter: Mike Sukmanowsky


The [current 
PythonScriptEngine|https://github.com/apache/pig/blob/release-0.12.0/src/org/apache/pig/scripting/streaming/python/PythonScriptEngine.java#L69]
 uses whatever python is currently on the path in order to execute scripts.

Python users are accustomed to creating virtual environments (virtualenvs) 
where associated requirements are installed without needing to worry about 
"global" installs via, for example, sudo pip install <some package>.

Is it possible to have the Python executable specified either via the 
{{DEFINE}} command syntax or, in a hadoop job configuration variable? Perhaps 
{{pig.pythonstreaming.pythonpath}}?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to