[jira] [Assigned] (SPARK-6716) Change SparkContext.DRIVER_IDENTIFIER from 'driver' to 'driver'

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6716: --- Assignee: Apache Spark (was: Josh Rosen) Change SparkContext.DRIVER_IDENTIFIER from

[jira] [Commented] (SPARK-6577) SparseMatrix should be supported in PySpark

2015-04-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14423720#comment-14423720 ] Xiangrui Meng commented on SPARK-6577: -- We don't require scipy in MLlib. But if scipy

[jira] [Resolved] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-04-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6334. -- Resolution: Duplicate I marked this one as duplicated as the solution will be provided by

[jira] [Commented] (SPARK-6227) PCA and SVD for PySpark

2015-04-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14428301#comment-14428301 ] Xiangrui Meng commented on SPARK-6227: -- [~MeethuMathew] I agree with Joseph that we

[jira] [Assigned] (SPARK-5832) Add Affinity Propagation clustering algorithm

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5832: --- Assignee: Apache Spark (was: Liang-Chi Hsieh) Add Affinity Propagation clustering

[jira] [Assigned] (SPARK-5832) Add Affinity Propagation clustering algorithm

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5832: --- Assignee: Liang-Chi Hsieh (was: Apache Spark) Add Affinity Propagation clustering

[jira] [Updated] (SPARK-6717) Clear shuffle files after checkpointing in ALS

2015-04-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6717: - Labels: als (was: ) Clear shuffle files after checkpointing in ALS

[jira] [Created] (SPARK-6717) Clear shuffle files after checkpointing in ALS

2015-04-05 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6717: Summary: Clear shuffle files after checkpointing in ALS Key: SPARK-6717 URL: https://issues.apache.org/jira/browse/SPARK-6717 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-6262) Python MLlib API missing items: Statistics

2015-04-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6262. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5359

[jira] [Assigned] (SPARK-5941) `def table` is not using the unresolved logical plan `UnresolvedRelation`

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5941: --- Assignee: (was: Apache Spark) `def table` is not using the unresolved logical plan

[jira] [Updated] (SPARK-5941) Unit Test loads the table `src` twice for leftsemijoin.q

2015-04-05 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-5941: - Summary: Unit Test loads the table `src` twice for leftsemijoin.q (was: `def table` is not using the

[jira] [Issue Comment Deleted] (SPARK-5941) Unit Test loads the table `src` twice for leftsemijoin.q

2015-04-05 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-5941: - Comment: was deleted (was: Eagerly resolving the table probably causes side effect in some scenarios,

[jira] [Closed] (SPARK-6712) Allow lower the log level in YARN client while keeping AM tracking URL printed

2015-04-05 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park closed SPARK-6712. Resolution: Won't Fix Closing the jira as won't fix. See the discussion in the PR. Allow lower

[jira] [Commented] (SPARK-6716) Change SparkContext.DRIVER_IDENTIFIER from 'driver' to 'driver'

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14401196#comment-14401196 ] Apache Spark commented on SPARK-6716: - User 'JoshRosen' has created a pull request for

[jira] [Assigned] (SPARK-6716) Change SparkContext.DRIVER_IDENTIFIER from 'driver' to 'driver'

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6716: --- Assignee: Josh Rosen (was: Apache Spark) Change SparkContext.DRIVER_IDENTIFIER from

[jira] [Updated] (SPARK-5261) In some cases ,The value of word's vector representation is too big

2015-04-05 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-5261: --- Description: Get data: {code:none} normalize_text() { awk '{print tolower($0);}' | sed -e s/’/'/g

[jira] [Assigned] (SPARK-5941) `def table` is not using the unresolved logical plan `UnresolvedRelation`

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5941: --- Assignee: Apache Spark `def table` is not using the unresolved logical plan

[jira] [Updated] (SPARK-6209) ExecutorClassLoader can leak connections after failing to load classes from the REPL class server

2015-04-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-6209: -- Fix Version/s: 1.2.2 I've merged the backport for 1.2.2. ExecutorClassLoader can leak connections

[jira] [Updated] (SPARK-6577) SparseMatrix should be supported in PySpark

2015-04-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6577: - Assignee: Manoj Kumar SparseMatrix should be supported in PySpark

[jira] [Updated] (SPARK-6577) SparseMatrix should be supported in PySpark

2015-04-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6577: - Target Version/s: 1.4.0 SparseMatrix should be supported in PySpark

[jira] [Commented] (SPARK-6407) Streaming ALS for Collaborative Filtering

2015-04-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14423680#comment-14423680 ] Xiangrui Meng commented on SPARK-6407: -- Using ALS for online updates is expensive. I

[jira] [Created] (SPARK-6718) Improve the test on normL1/normL2 of summary statistics

2015-04-05 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6718: Summary: Improve the test on normL1/normL2 of summary statistics Key: SPARK-6718 URL: https://issues.apache.org/jira/browse/SPARK-6718 Project: Spark Issue

[jira] [Updated] (SPARK-6718) Improve the test on normL1/normL2 of summary statistics

2015-04-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6718: - Priority: Minor (was: Major) Improve the test on normL1/normL2 of summary statistics

[jira] [Created] (SPARK-6716) Change SparkContext.DRIVER_IDENTIFIER from 'driver' to 'driver'

2015-04-05 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-6716: - Summary: Change SparkContext.DRIVER_IDENTIFIER from 'driver' to 'driver' Key: SPARK-6716 URL: https://issues.apache.org/jira/browse/SPARK-6716 Project: Spark

[jira] [Commented] (SPARK-3987) NNLS generates incorrect result

2015-04-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14429187#comment-14429187 ] Xiangrui Meng commented on SPARK-3987: -- [~coderxiang] [~debasish83] What is the

[jira] [Commented] (SPARK-3987) NNLS generates incorrect result

2015-04-05 Thread Shuo Xiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14441502#comment-14441502 ] Shuo Xiang commented on SPARK-3987: --- [~mengxr] I think it is done. NNLS generates

[jira] [Updated] (SPARK-5941) Unit Test loads the table `src` twice for leftsemijoin.q

2015-04-05 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-5941: - Description: In leftsemijoin.q, there is a data loading command for table sales already, but in TestHive,

[jira] [Updated] (SPARK-5941) Unit Test loads the table `src` twice for leftsemijoin.q

2015-04-05 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-5941: - Description: In leftsemijoin.q, there is a data loading command for table sales already, but in TestHive,

[jira] [Commented] (SPARK-5261) In some cases ,The value of word's vector representation is too big

2015-04-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14396112#comment-14396112 ] Sean Owen commented on SPARK-5261: -- I think they both come down to a minCount that is too

[jira] [Commented] (SPARK-6699) PySpark Acess Denied error in windows seen only in ver 1.3

2015-04-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14396116#comment-14396116 ] Sean Owen commented on SPARK-6699: -- numpy is required:

[jira] [Commented] (SPARK-6593) Provide option for HadoopRDD to skip corrupted files

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14396221#comment-14396221 ] Apache Spark commented on SPARK-6593: - User 'tigerquoll' has created a pull request

[jira] [Assigned] (SPARK-6715) Eliminate duplicate filters from pushdown predicates

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6715: --- Assignee: Apache Spark Eliminate duplicate filters from pushdown predicates

[jira] [Created] (SPARK-6715) Eliminate duplicate filters from pushdown predicates

2015-04-05 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-6715: -- Summary: Eliminate duplicate filters from pushdown predicates Key: SPARK-6715 URL: https://issues.apache.org/jira/browse/SPARK-6715 Project: Spark Issue

[jira] [Assigned] (SPARK-6715) Eliminate duplicate filters from pushdown predicates

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6715: --- Assignee: (was: Apache Spark) Eliminate duplicate filters from pushdown predicates

[jira] [Commented] (SPARK-6715) Eliminate duplicate filters from pushdown predicates

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14396239#comment-14396239 ] Apache Spark commented on SPARK-6715: - User 'viirya' has created a pull request for

[jira] [Commented] (SPARK-6676) Add hadoop 2.4+ for profiles in POM.xml

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14396161#comment-14396161 ] Apache Spark commented on SPARK-6676: - User 'srowen' has created a pull request for

[jira] [Assigned] (SPARK-6713) Iterators in columnSimilarities to allow flatMap spill

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6713: --- Assignee: Apache Spark Iterators in columnSimilarities to allow flatMap spill

[jira] [Commented] (SPARK-6713) Iterators in columnSimilarities to allow flatMap spill

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14396163#comment-14396163 ] Apache Spark commented on SPARK-6713: - User 'rezazadeh' has created a pull request for

[jira] [Assigned] (SPARK-6713) Iterators in columnSimilarities to allow flatMap spill

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6713: --- Assignee: (was: Apache Spark) Iterators in columnSimilarities to allow flatMap spill

[jira] [Updated] (SPARK-6713) Iterators in columnSimilarities to allow flatMap spill

2015-04-05 Thread Reza Zadeh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reza Zadeh updated SPARK-6713: -- Description: We should use Iterators in columnSimilarities to allow mapPartitionsWithIndex to spill to

[jira] [Commented] (SPARK-6695) Add an external iterator: a hadoop-like output collector

2015-04-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14396174#comment-14396174 ] Sean Owen commented on SPARK-6695: -- See SPARK-6713 for a solution to this particular

[jira] [Commented] (SPARK-6569) Kafka directInputStream logs what appear to be incorrect warnings

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14396178#comment-14396178 ] Apache Spark commented on SPARK-6569: - User 'srowen' has created a pull request for

[jira] [Created] (SPARK-6714) additionally overload KafkaUtils.createDirectStream for using a messageHandler without having to specify the offsets

2015-04-05 Thread JIRA
Juan Rodríguez Hortalá created SPARK-6714: - Summary: additionally overload KafkaUtils.createDirectStream for using a messageHandler without having to specify the offsets Key: SPARK-6714 URL:

[jira] [Assigned] (SPARK-6714) additionally overload KafkaUtils.createDirectStream for using a messageHandler without having to specify the offsets

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6714: --- Assignee: (was: Apache Spark) additionally overload KafkaUtils.createDirectStream for

[jira] [Created] (SPARK-6713) Iterators in columnSimilarities to allow flatMap spill

2015-04-05 Thread Reza Zadeh (JIRA)
Reza Zadeh created SPARK-6713: - Summary: Iterators in columnSimilarities to allow flatMap spill Key: SPARK-6713 URL: https://issues.apache.org/jira/browse/SPARK-6713 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-6630) SparkConf.setIfMissing should only evaluate the assigned value if indeed missing

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6630: --- Assignee: Apache Spark SparkConf.setIfMissing should only evaluate the assigned value if

[jira] [Assigned] (SPARK-6630) SparkConf.setIfMissing should only evaluate the assigned value if indeed missing

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6630: --- Assignee: (was: Apache Spark) SparkConf.setIfMissing should only evaluate the assigned

[jira] [Commented] (SPARK-6630) SparkConf.setIfMissing should only evaluate the assigned value if indeed missing

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14396167#comment-14396167 ] Apache Spark commented on SPARK-6630: - User 'srowen' has created a pull request for

[jira] [Resolved] (SPARK-6706) kmeans|| hangs for a long time if both k and vector dimension are large

2015-04-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6706. -- Resolution: Duplicate kmeans|| hangs for a long time if both k and vector dimension are large

[jira] [Assigned] (SPARK-6420) Driver's Block Manager does not use spark.driver.host in Yarn-Client mode

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6420: --- Assignee: Apache Spark Driver's Block Manager does not use spark.driver.host in Yarn-Client

[jira] [Assigned] (SPARK-6420) Driver's Block Manager does not use spark.driver.host in Yarn-Client mode

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6420: --- Assignee: (was: Apache Spark) Driver's Block Manager does not use spark.driver.host in

[jira] [Assigned] (SPARK-6478) new RDD.pipeWithPartition method

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6478: --- Assignee: Apache Spark new RDD.pipeWithPartition method

[jira] [Assigned] (SPARK-6478) new RDD.pipeWithPartition method

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6478: --- Assignee: (was: Apache Spark) new RDD.pipeWithPartition method

[jira] [Commented] (SPARK-6630) SparkConf.setIfMissing should only evaluate the assigned value if indeed missing

2015-04-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14396357#comment-14396357 ] Sean Owen commented on SPARK-6630: -- Hm, I don't think we can change this, since it would

[jira] [Commented] (SPARK-6602) Replace direct use of Akka with Spark RPC interface

2015-04-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14396285#comment-14396285 ] Apache Spark commented on SPARK-6602: - User 'zsxwing' has created a pull request for

[jira] [Commented] (SPARK-2336) Approximate k-NN Models for MLLib

2015-04-05 Thread Sen Fang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14396306#comment-14396306 ] Sen Fang commented on SPARK-2336: - I'm tentatively going to give the hybrid spilltree

[jira] [Commented] (SPARK-6708) Using Hive UDTF may throw ClassNotFoundException

2015-04-05 Thread Adnan Khan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14396382#comment-14396382 ] Adnan Khan commented on SPARK-6708: --- is this related to SPARK-4854? Using Hive UDTF