[
https://issues.apache.org/jira/browse/DATAFU-155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17810299#comment-17810299
]
Eyal Allweil commented on DATAFU-155:
-------------------------------------
Looks like compilation for this is broken now (in Spark 3.1.x). But maybe
fixing it isn't too hard ...
{code:java}
... git/datafu/datafu-spark/src/main/scala/datafu/spark/SparkUDFs.scala:149:
type mismatch;
found : Seq[Boolean]
required:
Seq[Option[org.apache.spark.sql.catalyst.encoders.ExpressionEncoder[_]]]
ScalaUDF(func, dataType, e, inputsNullSafe, inputTypes)
^
... git/datafu/datafu-spark/src/main/scala/datafu/spark/SparkUDFs.scala:149:
type mismatch;
found : Seq[org.apache.spark.sql.types.DataType]
required: Option[org.apache.spark.sql.catalyst.encoders.ExpressionEncoder[_]]
ScalaUDF(func, dataType, e, inputsNullSafe, inputTypes)
^
two errors found{code}
> Ease Scala-Python API expoure
> -----------------------------
>
> Key: DATAFU-155
> URL: https://issues.apache.org/jira/browse/DATAFU-155
> Project: DataFu
> Issue Type: Bug
> Reporter: Ohad Raviv
> Priority: Minor
> Fix For: 2.1.0
>
>
> As one of the key features in datafu-spark is the integration between Scala
> and Python code, we should add a helper method to more easily expose Scala
> APIs in python.
> currently, the wrapper methods need to deal with some non-trivial conversions
> that py4j doesn't handle well as it is oriented to python-java conversions.
> so we want to extend it to cope with Scala collections, Option, etc. to save
> some boilerplate code in both the Scala side and Python side.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)