[jira] [Commented] (SPARK-16777) Parquet schema converter depends on deprecated APIs

2016-07-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398827#comment-15398827 ] holdenk commented on SPARK-16777: - That's a good point, thanks for the comment/note :) I

[jira] [Commented] (SPARK-16578) Configurable hostname for RBackend

2016-07-28 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398826#comment-15398826 ] Miao Wang commented on SPARK-16578: --- [~shivaram] It seems interesting. I can help inves

[jira] [Commented] (SPARK-16789) Can't run saveAsTable with database name

2016-07-28 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398807#comment-15398807 ] Xiao Li commented on SPARK-16789: - Let me take a look at this. > Can't run saveAsTable w

[jira] [Comment Edited] (SPARK-16777) Parquet schema converter depends on deprecated APIs

2016-07-28 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398781#comment-15398781 ] Hyukjin Kwon edited comment on SPARK-16777 at 7/29/16 6:29 AM:

[jira] [Commented] (SPARK-16776) Fix Kafka deprecation warnings

2016-07-28 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398786#comment-15398786 ] Hyukjin Kwon commented on SPARK-16776: -- Please let me leave the logs here just in ca

[jira] [Commented] (SPARK-16777) Parquet schema converter depends on deprecated APIs

2016-07-28 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398781#comment-15398781 ] Hyukjin Kwon commented on SPARK-16777: -- Please let me leave a note because I actuall

[jira] [Commented] (SPARK-16774) Fix use of deprecated TimeStamp constructor (also providing incorrect results)

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398776#comment-15398776 ] Apache Spark commented on SPARK-16774: -- User 'holdenk' has created a pull request fo

[jira] [Assigned] (SPARK-16774) Fix use of deprecated TimeStamp constructor (also providing incorrect results)

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16774: Assignee: Apache Spark > Fix use of deprecated TimeStamp constructor (also providing incor

[jira] [Assigned] (SPARK-16774) Fix use of deprecated TimeStamp constructor (also providing incorrect results)

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16774: Assignee: (was: Apache Spark) > Fix use of deprecated TimeStamp constructor (also prov

[jira] [Assigned] (SPARK-16771) Infinite recursion loop in org.apache.spark.sql.catalyst.trees.TreeNode when table name collides.

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16771: Assignee: (was: Apache Spark) > Infinite recursion loop in org.apache.spark.sql.cataly

[jira] [Commented] (SPARK-16771) Infinite recursion loop in org.apache.spark.sql.catalyst.trees.TreeNode when table name collides.

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398714#comment-15398714 ] Apache Spark commented on SPARK-16771: -- User 'dongjoon-hyun' has created a pull requ

[jira] [Assigned] (SPARK-16771) Infinite recursion loop in org.apache.spark.sql.catalyst.trees.TreeNode when table name collides.

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16771: Assignee: Apache Spark > Infinite recursion loop in org.apache.spark.sql.catalyst.trees.Tr

[jira] [Assigned] (SPARK-16787) SparkContext.addFile() should not fail if called twice with the same file

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16787: Assignee: Apache Spark (was: Josh Rosen) > SparkContext.addFile() should not fail if call

[jira] [Commented] (SPARK-16787) SparkContext.addFile() should not fail if called twice with the same file

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398709#comment-15398709 ] Apache Spark commented on SPARK-16787: -- User 'JoshRosen' has created a pull request

[jira] [Assigned] (SPARK-16787) SparkContext.addFile() should not fail if called twice with the same file

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16787: Assignee: Josh Rosen (was: Apache Spark) > SparkContext.addFile() should not fail if call

[jira] [Commented] (SPARK-16771) Infinite recursion loop in org.apache.spark.sql.catalyst.trees.TreeNode when table name collides.

2016-07-28 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398701#comment-15398701 ] Dongjoon Hyun commented on SPARK-16771: --- Hi, [~fpin]. You're right. I can regenerat

[jira] [Commented] (SPARK-15694) Implement ScriptTransformation in sql/core

2016-07-28 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398694#comment-15398694 ] Tejas Patil commented on SPARK-15694: - I spent some time over last weekend working on

[jira] [Commented] (SPARK-16646) LEAST doesn't accept numeric arguments with different data types

2016-07-28 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398650#comment-15398650 ] Hyukjin Kwon commented on SPARK-16646: -- Sure, I will close the PR for meanwhile. The

[jira] [Commented] (SPARK-16788) Investigate JSR-310 & scala-time alternatives to our own datetime utils

2016-07-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398632#comment-15398632 ] holdenk commented on SPARK-16788: - cc [~davies] [~ckadner] :) > Investigate JSR-310 & sc

[jira] [Created] (SPARK-16789) Can't run saveAsTable with database name

2016-07-28 Thread SonixLegend (JIRA)
SonixLegend created SPARK-16789: --- Summary: Can't run saveAsTable with database name Key: SPARK-16789 URL: https://issues.apache.org/jira/browse/SPARK-16789 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-16788) Investigate JSR-310 & scala-time alternatives to our own datetime utils

2016-07-28 Thread holdenk (JIRA)
holdenk created SPARK-16788: --- Summary: Investigate JSR-310 & scala-time alternatives to our own datetime utils Key: SPARK-16788 URL: https://issues.apache.org/jira/browse/SPARK-16788 Project: Spark

[jira] [Commented] (SPARK-16746) Spark streaming lost data when ReceiverTracker writes Blockinfo to hdfs timeout

2016-07-28 Thread Hongyao Zhao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398615#comment-15398615 ] Hongyao Zhao commented on SPARK-16746: -- I did some test yesterday, It seems that sp

[jira] [Commented] (SPARK-16748) Errors thrown by UDFs cause TreeNodeException when the query has an ORDER BY clause

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398587#comment-15398587 ] Apache Spark commented on SPARK-16748: -- User 'tdas' has created a pull request for t

[jira] [Assigned] (SPARK-16748) Errors thrown by UDFs cause TreeNodeException when the query has an ORDER BY clause

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16748: Assignee: Tathagata Das (was: Apache Spark) > Errors thrown by UDFs cause TreeNodeExcepti

[jira] [Assigned] (SPARK-16748) Errors thrown by UDFs cause TreeNodeException when the query has an ORDER BY clause

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16748: Assignee: Apache Spark (was: Tathagata Das) > Errors thrown by UDFs cause TreeNodeExcepti

[jira] [Comment Edited] (SPARK-16768) pyspark calls incorrect version of logistic regression

2016-07-28 Thread Colin Beckingham (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398571#comment-15398571 ] Colin Beckingham edited comment on SPARK-16768 at 7/29/16 2:08 AM:

[jira] [Commented] (SPARK-16646) LEAST doesn't accept numeric arguments with different data types

2016-07-28 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398582#comment-15398582 ] Wenchen Fan commented on SPARK-16646: - We are discussing this internally, can you hol

[jira] [Commented] (SPARK-16768) pyspark calls incorrect version of logistic regression

2016-07-28 Thread Colin Beckingham (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398571#comment-15398571 ] Colin Beckingham commented on SPARK-16768: -- This is very strange then. I can lau

[jira] [Commented] (SPARK-16786) LDA topic distributions for new documents in PySpark

2016-07-28 Thread Jordan Beauchamp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398560#comment-15398560 ] Jordan Beauchamp commented on SPARK-16786: -- topicDistribution has been implement

[jira] [Commented] (SPARK-16786) LDA topic distributions for new documents in PySpark

2016-07-28 Thread Jordan Beauchamp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398559#comment-15398559 ] Jordan Beauchamp commented on SPARK-16786: -- One minor difficulty is that downstr

[jira] [Assigned] (SPARK-16786) LDA topic distributions for new documents in PySpark

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16786: Assignee: Apache Spark > LDA topic distributions for new documents in PySpark > --

[jira] [Assigned] (SPARK-16786) LDA topic distributions for new documents in PySpark

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16786: Assignee: (was: Apache Spark) > LDA topic distributions for new documents in PySpark >

[jira] [Commented] (SPARK-16786) LDA topic distributions for new documents in PySpark

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398558#comment-15398558 ] Apache Spark commented on SPARK-16786: -- User 'supremekai' has created a pull request

[jira] [Commented] (SPARK-16753) Spark SQL doesn't handle skewed dataset joins properly

2016-07-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398551#comment-15398551 ] Reynold Xin commented on SPARK-16753: - Got it - definitely good to do skew join. Ther

[jira] [Assigned] (SPARK-16748) Errors thrown by UDFs cause TreeNodeException when the query has an ORDER BY clause

2016-07-28 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das reassigned SPARK-16748: - Assignee: Tathagata Das > Errors thrown by UDFs cause TreeNodeException when the query h

[jira] [Updated] (SPARK-16787) SparkContext.addFile() should not fail if called twice with the same file

2016-07-28 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-16787: --- Target Version/s: 2.0.1 (was: 1.6.3, 2.0.1) > SparkContext.addFile() should not fail if called twice

[jira] [Created] (SPARK-16787) SparkContext.addFile() should not fail if called twice with the same file

2016-07-28 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-16787: -- Summary: SparkContext.addFile() should not fail if called twice with the same file Key: SPARK-16787 URL: https://issues.apache.org/jira/browse/SPARK-16787 Project: Spark

[jira] [Comment Edited] (SPARK-16774) Fix use of deprecated TimeStamp constructor (also providing incorrect results)

2016-07-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398424#comment-15398424 ] holdenk edited comment on SPARK-16774 at 7/28/16 11:38 PM: --- Whi

[jira] [Updated] (SPARK-16774) Fix use of deprecated TimeStamp constructor (also providing incorrect results)

2016-07-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-16774: Description: The TimeStamp constructor we use inside of DateTime utils has been deprecated since JDK 1.1 -

[jira] [Updated] (SPARK-16774) Fix use of deprecated TimeStamp constructor

2016-07-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-16774: Description: The TimeStamp constructor we use inside of DateTime utils has been deprecated since JDK 1.1 -

[jira] [Updated] (SPARK-16774) Fix use of deprecated TimeStamp constructor (also providing incorrect results)

2016-07-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-16774: Summary: Fix use of deprecated TimeStamp constructor (also providing incorrect results) (was: Fix use of d

[jira] [Commented] (SPARK-16774) Fix use of deprecated TimeStamp constructor

2016-07-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398424#comment-15398424 ] holdenk commented on SPARK-16774: - While diving into this (relatedly I hate timezones) -

[jira] [Commented] (SPARK-16768) pyspark calls incorrect version of logistic regression

2016-07-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398409#comment-15398409 ] Sean Owen commented on SPARK-16768: --- That's spark.ml.LogisticRegression, right? there's

[jira] [Commented] (SPARK-16785) dapply doesn't return array or raw columns

2016-07-28 Thread Clark Fitzgerald (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398369#comment-15398369 ] Clark Fitzgerald commented on SPARK-16785: -- To fix this I propose to treat the r

[jira] [Comment Edited] (SPARK-16785) dapply doesn't return array or raw columns

2016-07-28 Thread Clark Fitzgerald (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398312#comment-15398312 ] Clark Fitzgerald edited comment on SPARK-16785 at 7/28/16 10:51 PM: ---

[jira] [Commented] (SPARK-16611) Expose several hidden DataFrame/RDD functions

2016-07-28 Thread Clark Fitzgerald (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398346#comment-15398346 ] Clark Fitzgerald commented on SPARK-16611: -- +1 for more direct access to the RDD

[jira] [Updated] (SPARK-16786) LDA topic distributions for new documents in PySpark

2016-07-28 Thread Jordan Beauchamp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jordan Beauchamp updated SPARK-16786: - Target Version/s: (was: 2.0.0) > LDA topic distributions for new documents in PySpark >

[jira] [Updated] (SPARK-16786) LDA topic distributions for new documents in PySpark

2016-07-28 Thread Jordan Beauchamp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jordan Beauchamp updated SPARK-16786: - Fix Version/s: (was: 2.0.0) > LDA topic distributions for new documents in PySpark >

[jira] [Created] (SPARK-16786) LDA topic distributions for new documents in PySpark

2016-07-28 Thread Jordan Beauchamp (JIRA)
Jordan Beauchamp created SPARK-16786: Summary: LDA topic distributions for new documents in PySpark Key: SPARK-16786 URL: https://issues.apache.org/jira/browse/SPARK-16786 Project: Spark

[jira] [Updated] (SPARK-16785) dapply doesn't return array or raw columns

2016-07-28 Thread Clark Fitzgerald (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Clark Fitzgerald updated SPARK-16785: - Priority: Minor (was: Major) > dapply doesn't return array or raw columns >

[jira] [Commented] (SPARK-16785) dapply doesn't return array or raw columns

2016-07-28 Thread Clark Fitzgerald (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398312#comment-15398312 ] Clark Fitzgerald commented on SPARK-16785: -- [~shivaram] and I have had some emai

[jira] [Created] (SPARK-16785) dapply doesn't return array or raw columns

2016-07-28 Thread Clark Fitzgerald (JIRA)
Clark Fitzgerald created SPARK-16785: Summary: dapply doesn't return array or raw columns Key: SPARK-16785 URL: https://issues.apache.org/jira/browse/SPARK-16785 Project: Spark Issue Type

[jira] [Commented] (SPARK-2984) FileNotFoundException on _temporary directory

2016-07-28 Thread Denis Serduik (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398289#comment-15398289 ] Denis Serduik commented on SPARK-2984: -- Something like this... The problem occurs whe

[jira] [Closed] (SPARK-16783) make-distri

2016-07-28 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Gummelt closed SPARK-16783. --- Resolution: Not A Problem > make-distri > --- > > Key: SPARK-16783 >

[jira] [Created] (SPARK-16784) Configurable log4j settings

2016-07-28 Thread Michael Gummelt (JIRA)
Michael Gummelt created SPARK-16784: --- Summary: Configurable log4j settings Key: SPARK-16784 URL: https://issues.apache.org/jira/browse/SPARK-16784 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-16772) Correct API doc references to PySpark classes + formatting fixes

2016-07-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16772. - Resolution: Fixed Assignee: Nicholas Chammas Fix Version/s: 2.1.0

[jira] [Updated] (SPARK-16769) httpclient classic dependency - potentially a patch required?

2016-07-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16769: -- Priority: Minor (was: Major) Issue Type: Improvement (was: Question) I think the issue is that

[jira] [Commented] (SPARK-16745) Spark job completed however have to wait for 13 mins (data size is small)

2016-07-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398272#comment-15398272 ] Sean Owen commented on SPARK-16745: --- I suspect this is a duplicate of one of the many i

[jira] [Commented] (SPARK-12157) Support numpy types as return values of Python UDFs

2016-07-28 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398267#comment-15398267 ] Nicholas Chammas commented on SPARK-12157: -- I'm looking to define a UDF in PySpa

[jira] [Commented] (SPARK-6305) Add support for log4j 2.x to Spark

2016-07-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398237#comment-15398237 ] Reynold Xin commented on SPARK-6305: Thanks, Mikael. Would love to get some help here.

[jira] [Created] (SPARK-16783) make-distri

2016-07-28 Thread Michael Gummelt (JIRA)
Michael Gummelt created SPARK-16783: --- Summary: make-distri Key: SPARK-16783 URL: https://issues.apache.org/jira/browse/SPARK-16783 Project: Spark Issue Type: Bug Reporter: Micha

[jira] [Commented] (SPARK-16365) Ideas for moving "mllib-local" forward

2016-07-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398178#comment-15398178 ] Joseph K. Bradley commented on SPARK-16365: --- This JIRA is covering multiple pot

[jira] [Commented] (SPARK-16780) spark-streaming-kafka_2.10 version 2.0.0 not on maven central

2016-07-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398179#comment-15398179 ] Sean Owen commented on SPARK-16780: --- There is also an "0.8" artifact, which is likely w

[jira] [Commented] (SPARK-16334) [SQL] SQL query on parquet table java.lang.ArrayIndexOutOfBoundsException

2016-07-28 Thread Keith Kraus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398130#comment-15398130 ] Keith Kraus commented on SPARK-16334: - [~sameerag] I have just built branch-2.0 which

[jira] [Resolved] (SPARK-16764) Recommend disabling vectorized parquet reader on OutOfMemoryError

2016-07-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16764. - Resolution: Fixed Assignee: Sameer Agarwal Fix Version/s: 2.1.0

[jira] [Commented] (SPARK-16780) spark-streaming-kafka_2.10 version 2.0.0 not on maven central

2016-07-28 Thread Andrew B (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398105#comment-15398105 ] Andrew B commented on SPARK-16780: -- How are the new artifacts used with the example belo

[jira] [Resolved] (SPARK-16762) spark hanging when action method print

2016-07-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16762. --- Resolution: Invalid Yeah, too many possible things wrong; it should be narrowed down separately firs

[jira] [Commented] (SPARK-16770) Spark shell not usable with german keyboard due to JLine version

2016-07-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398072#comment-15398072 ] Sean Owen commented on SPARK-16770: --- Sure, if you would please open a pull request to u

[jira] [Commented] (SPARK-16774) Fix use of deprecated TimeStamp constructor

2016-07-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398063#comment-15398063 ] Sean Owen commented on SPARK-16774: --- Yeah I remember this one. Seems like it would take

[jira] [Commented] (SPARK-16782) Use Sphinx autodoc to eliminate duplication of Python docstrings

2016-07-28 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398059#comment-15398059 ] Nicholas Chammas commented on SPARK-16782: -- [~davies] [~joshrosen] - I can take

[jira] [Created] (SPARK-16782) Use Sphinx autodoc to eliminate duplication of Python docstrings

2016-07-28 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-16782: Summary: Use Sphinx autodoc to eliminate duplication of Python docstrings Key: SPARK-16782 URL: https://issues.apache.org/jira/browse/SPARK-16782 Project: Spa

[jira] [Resolved] (SPARK-16780) spark-streaming-kafka_2.10 version 2.0.0 not on maven central

2016-07-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16780. --- Resolution: Not A Problem These have become artifacts like spark-streaming-kafka-0-10_2.10 in 2.0.0

[jira] [Commented] (SPARK-16769) httpclient classic dependency - potentially a patch required?

2016-07-28 Thread Adam Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15398018#comment-15398018 ] Adam Roberts commented on SPARK-16769: -- Thanks, looking on Maven central I see the l

[jira] [Commented] (SPARK-16765) Add Pipeline API example for KMeans

2016-07-28 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15397994#comment-15397994 ] Bryan Cutler commented on SPARK-16765: -- Was there some specific use of Pipelines wit

[jira] [Comment Edited] (SPARK-16779) Fix unnecessary use of postfix operations

2016-07-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15397984#comment-15397984 ] holdenk edited comment on SPARK-16779 at 7/28/16 6:36 PM: -- I'm s

[jira] [Commented] (SPARK-16779) Fix unnecessary use of postfix operations

2016-07-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15397984#comment-15397984 ] holdenk commented on SPARK-16779: - I'm sort of on the fence with fixing as well - but we

[jira] [Created] (SPARK-16781) java launched by PySpark as gateway may not be the same java used in the spark environment

2016-07-28 Thread Michael Berman (JIRA)
Michael Berman created SPARK-16781: -- Summary: java launched by PySpark as gateway may not be the same java used in the spark environment Key: SPARK-16781 URL: https://issues.apache.org/jira/browse/SPARK-16781

[jira] [Commented] (SPARK-16779) Fix unnecessary use of postfix operations

2016-07-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15397968#comment-15397968 ] Sean Owen commented on SPARK-16779: --- Yeah I tend to agree with fixing this up, because

[jira] [Created] (SPARK-16780) spark-streaming-kafka_2.10 version 2.0.0 not on maven central

2016-07-28 Thread Andrew B (JIRA)
Andrew B created SPARK-16780: Summary: spark-streaming-kafka_2.10 version 2.0.0 not on maven central Key: SPARK-16780 URL: https://issues.apache.org/jira/browse/SPARK-16780 Project: Spark Issue

[jira] [Created] (SPARK-16779) Fix unnecessary use of postfix operations

2016-07-28 Thread holdenk (JIRA)
holdenk created SPARK-16779: --- Summary: Fix unnecessary use of postfix operations Key: SPARK-16779 URL: https://issues.apache.org/jira/browse/SPARK-16779 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-16773) Post Spark 2.0 deprecation & warnings cleanup

2016-07-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-16773: Summary: Post Spark 2.0 deprecation & warnings cleanup (was: Post Spark 2.0 deprecation cleanup) > Post S

[jira] [Commented] (SPARK-16769) httpclient classic dependency - potentially a patch required?

2016-07-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15397961#comment-15397961 ] Sean Owen commented on SPARK-16769: --- I'm pretty certain it's there only because some de

[jira] [Created] (SPARK-16778) Fix use of deprecated SQLContext constructor

2016-07-28 Thread holdenk (JIRA)
holdenk created SPARK-16778: --- Summary: Fix use of deprecated SQLContext constructor Key: SPARK-16778 URL: https://issues.apache.org/jira/browse/SPARK-16778 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-16775) Reduce internal warnings from deprecated accumulator API

2016-07-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-16775: Component/s: SQL > Reduce internal warnings from deprecated accumulator API > -

[jira] [Created] (SPARK-16777) Parquet schema converter depends on deprecated APIs

2016-07-28 Thread holdenk (JIRA)
holdenk created SPARK-16777: --- Summary: Parquet schema converter depends on deprecated APIs Key: SPARK-16777 URL: https://issues.apache.org/jira/browse/SPARK-16777 Project: Spark Issue Type: Sub-tas

[jira] [Updated] (SPARK-16775) Reduce internal warnings from deprecated accumulator API

2016-07-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-16775: Component/s: (was: ML) (was: SQL) (was: MLlib) > Reduce inter

[jira] [Created] (SPARK-16776) Fix Kafka deprecation warnings

2016-07-28 Thread holdenk (JIRA)
holdenk created SPARK-16776: --- Summary: Fix Kafka deprecation warnings Key: SPARK-16776 URL: https://issues.apache.org/jira/browse/SPARK-16776 Project: Spark Issue Type: Sub-task Component

[jira] [Created] (SPARK-16775) Reduce internal warnings from deprecated accumulator API

2016-07-28 Thread holdenk (JIRA)
holdenk created SPARK-16775: --- Summary: Reduce internal warnings from deprecated accumulator API Key: SPARK-16775 URL: https://issues.apache.org/jira/browse/SPARK-16775 Project: Spark Issue Type: Su

[jira] [Created] (SPARK-16774) Fix use of deprecated TimeStamp constructor

2016-07-28 Thread holdenk (JIRA)
holdenk created SPARK-16774: --- Summary: Fix use of deprecated TimeStamp constructor Key: SPARK-16774 URL: https://issues.apache.org/jira/browse/SPARK-16774 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-16773) Post Spark 2.0 deprecation cleanup

2016-07-28 Thread holdenk (JIRA)
holdenk created SPARK-16773: --- Summary: Post Spark 2.0 deprecation cleanup Key: SPARK-16773 URL: https://issues.apache.org/jira/browse/SPARK-16773 Project: Spark Issue Type: Improvement Co

[jira] [Commented] (SPARK-16769) httpclient classic dependency - potentially a patch required?

2016-07-28 Thread Adam Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15397889#comment-15397889 ] Adam Roberts commented on SPARK-16769: -- [~rxin] [~srowen] Reynold and Sean, interest

[jira] [Commented] (SPARK-11248) Spark hivethriftserver is using the wrong user to while getting HDFS permissions

2016-07-28 Thread Furcy Pin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15397882#comment-15397882 ] Furcy Pin commented on SPARK-11248: --- +1 I'm trying on spark 2.0.0, and I've configured

[jira] [Commented] (SPARK-16751) Upgrade derby to 10.12.1.1 from 10.11.1.1

2016-07-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15397865#comment-15397865 ] Sean Owen commented on SPARK-16751: --- Yeah, I see from the PR that it's actually package

[jira] [Resolved] (SPARK-16763) Factor out Tunsten for General Purpose Use

2016-07-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16763. --- Resolution: Invalid I'm going to close this because it's a question for the user@ list. However I do

[jira] [Updated] (SPARK-16772) Correct API doc references to PySpark classes + formatting fixes

2016-07-28 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-16772: - Summary: Correct API doc references to PySpark classes + formatting fixes (was: Correct

[jira] [Updated] (SPARK-16772) Correct API doc references to PySpark classes

2016-07-28 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-16772: - Summary: Correct API doc references to PySpark classes (was: Correct API doc references

[jira] [Commented] (SPARK-16611) Expose several hidden DataFrame/RDD functions

2016-07-28 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15397827#comment-15397827 ] Shivaram Venkataraman commented on SPARK-16611: --- 1. lapply: From an API per

[jira] [Resolved] (SPARK-16740) joins.LongToUnsafeRowMap crashes with NegativeArraySizeException

2016-07-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16740. - Resolution: Fixed Assignee: Sylvain Zimmer Fix Version/s: 2.1.0

[jira] [Assigned] (SPARK-16772) Correct API doc references to DataType + other minor doc tweaks

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16772: Assignee: (was: Apache Spark) > Correct API doc references to DataType + other minor d

[jira] [Commented] (SPARK-16772) Correct API doc references to DataType + other minor doc tweaks

2016-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15397796#comment-15397796 ] Apache Spark commented on SPARK-16772: -- User 'nchammas' has created a pull request f

  1   2   >