[ https://issues.apache.org/jira/browse/DATAFU-167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17622945#comment-17622945 ]
Arpit Bhardwaj commented on DATAFU-167: --------------------------------------- Hi [~eyal] Ok let me do a test with latest master. Will come back to you in case of any further question. > Fix Scala Python Bridge support in Spark 2 minor version updates > ----------------------------------------------------------------- > > Key: DATAFU-167 > URL: https://issues.apache.org/jira/browse/DATAFU-167 > Project: DataFu > Issue Type: Bug > Affects Versions: 1.6.1 > Environment: _emphasized text_ > Reporter: Eyal Allweil > Priority: Major > Labels: up-for-grabs > Fix For: 1.7.0 > > > The Scala Python Bridge, which works for versions 2.2.2, 2.3.2 and 2.4.3, > doesn't work for versions 2.2.3, 2.3.3, and 2.4.4 (and up). > This can be reproduced in testing by running the tests with the command > {code:bash} > ./gradlew :datafu-spark:test -PscalaVersion=2.11 -PsparkVersion=2.2.3 > -PscalaCompatVersion=2.11 > {code} > The error message is > {noformat} > AttributeError: 'GatewayParameters' object has no attribute 'auth_token' > at > org.apache.spark.datafu.deploy.SparkPythonRunner.execFile(SparkPythonRunner.scala:137) > {noformat} > > Currently our code runs using the PYSPARK_ALLOW_INSECURE_GATEWAY parameter; > it's possible/probable that using the auth_token parameter will both fix this > problem and be better in general. > > Please note that in order to test Spark 2.4.4, you need to upgrade the > scalatest version used. > > A description of using an auth token with py4j can be found here: > [https://www.py4j.org/advanced_topics.html#authentication] > > It's likely that (at least some of the) changes will need to be made here: > [https://github.com/apache/datafu/blob/master/datafu-spark/src/main/scala/spark/utils/overwrites/SparkPythonRunner.scala#L63] > [https://github.com/apache/datafu/blob/master/datafu-spark/src/main/resources/pyspark_utils/bridge_utils.py#L43] -- This message was sent by Atlassian Jira (v8.20.10#820010)