[jira] [Assigned] (SPARK-15321) Encoding/decoding of Array[Timestamp] fails

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15321: Assignee: Apache Spark > Encoding/decoding of Array[Timestamp] fails >

[jira] [Assigned] (SPARK-15321) Encoding/decoding of Array[Timestamp] fails

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15321: Assignee: (was: Apache Spark) > Encoding/decoding of Array[Timestamp] fails >

[jira] [Commented] (SPARK-15321) Encoding/decoding of Array[Timestamp] fails

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283425#comment-15283425 ] Apache Spark commented on SPARK-15321: -- User 'smungee' has created a pull request for this issue:

[jira] [Created] (SPARK-15321) Encoding/decoding of Array[Timestamp] fails

2016-05-13 Thread Sumedh Mungee (JIRA)
Sumedh Mungee created SPARK-15321: - Summary: Encoding/decoding of Array[Timestamp] fails Key: SPARK-15321 URL: https://issues.apache.org/jira/browse/SPARK-15321 Project: Spark Issue Type:

[jira] [Commented] (SPARK-15320) Spark-SQL Cli Ignores Parameter hive.metastore.warehouse.dir

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283414#comment-15283414 ] Apache Spark commented on SPARK-15320: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15320) Spark-SQL Cli Ignores Parameter hive.metastore.warehouse.dir

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15320: Assignee: Apache Spark > Spark-SQL Cli Ignores Parameter hive.metastore.warehouse.dir >

[jira] [Assigned] (SPARK-15320) Spark-SQL Cli Ignores Parameter hive.metastore.warehouse.dir

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15320: Assignee: (was: Apache Spark) > Spark-SQL Cli Ignores Parameter

[jira] [Updated] (SPARK-15320) Spark-SQL Cli Ignores Parameter hive.metastore.warehouse.dir

2016-05-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-15320: Description: When overriding {{hive.metastore.warehouse.dir}} in the spark-sql command line, it does not

[jira] [Created] (SPARK-15320) Spark-SQL Cli Ignores Parameter hive.metastore.warehouse.dir

2016-05-13 Thread Xiao Li (JIRA)
Xiao Li created SPARK-15320: --- Summary: Spark-SQL Cli Ignores Parameter hive.metastore.warehouse.dir Key: SPARK-15320 URL: https://issues.apache.org/jira/browse/SPARK-15320 Project: Spark Issue

[jira] [Commented] (SPARK-15002) Calling unpersist can cause spark to hang indefinitely when writing out a result

2016-05-13 Thread Vijay Parmar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283391#comment-15283391 ] Vijay Parmar commented on SPARK-15002: -- The code executed smoothly till the below line :-

[jira] [Comment Edited] (SPARK-15000) Spark hangs indefinitely if you cache a dataframe, then show it, then do some further processing on it

2016-05-13 Thread Vijay Parmar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283379#comment-15283379 ] Vijay Parmar edited comment on SPARK-15000 at 5/14/16 2:16 AM: --- I ran the

[jira] [Commented] (SPARK-15000) Spark hangs indefinitely if you cache a dataframe, then show it, then do some further processing on it

2016-05-13 Thread Vijay Parmar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283379#comment-15283379 ] Vijay Parmar commented on SPARK-15000: -- I ran the example code provided by you on Scala 1.6.1 and

[jira] [Commented] (SPARK-15317) JobProgressListener takes a huge amount of memory with iterative DataFrame program in local, standalone

2016-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283344#comment-15283344 ] Joseph K. Bradley commented on SPARK-15317: --- Comparing 1.6 and 2.0 more carefully, this is

[jira] [Updated] (SPARK-15317) JobProgressListener takes a huge amount of memory with iterative DataFrame program in local, standalone

2016-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15317: -- Attachment: compare-2.0-10Kpartitions.png compare-2.0-16partitions.png

[jira] [Comment Edited] (SPARK-4820) Spark build encounters "File name too long" on some encrypted filesystems

2016-05-13 Thread Joel Bondurant (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283330#comment-15283330 ] Joel Bondurant edited comment on SPARK-4820 at 5/14/16 12:13 AM: - RE: "If

[jira] [Commented] (SPARK-4820) Spark build encounters "File name too long" on some encrypted filesystems

2016-05-13 Thread Joel Bondurant (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283330#comment-15283330 ] Joel Bondurant commented on SPARK-4820: --- RE: "If you encounter this issue please comment on the JIRA

[jira] [Comment Edited] (SPARK-15159) SparkSession R API

2016-05-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283321#comment-15283321 ] Felix Cheung edited comment on SPARK-15159 at 5/14/16 12:01 AM: Right,

[jira] [Comment Edited] (SPARK-15159) SparkSession R API

2016-05-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283321#comment-15283321 ] Felix Cheung edited comment on SPARK-15159 at 5/13/16 11:59 PM: Right,

[jira] [Commented] (SPARK-15159) SparkSession R API

2016-05-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283321#comment-15283321 ] Felix Cheung commented on SPARK-15159: -- Right, possibly with SQLContext/HiveContext as wrapper of

[jira] [Commented] (SPARK-15118) spark couldn't get hive properyties in hive-site.xml

2016-05-13 Thread Vijay Parmar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283301#comment-15283301 ] Vijay Parmar commented on SPARK-15118: -- Those configurations are all deprecated and you can remove

[jira] [Assigned] (SPARK-15318) spark.ml Collaborative Filtering example does not work in spark-shell

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15318: Assignee: Apache Spark > spark.ml Collaborative Filtering example does not work in

[jira] [Commented] (SPARK-15318) spark.ml Collaborative Filtering example does not work in spark-shell

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283294#comment-15283294 ] Apache Spark commented on SPARK-15318: -- User 'wangmiao1981' has created a pull request for this

[jira] [Assigned] (SPARK-15318) spark.ml Collaborative Filtering example does not work in spark-shell

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15318: Assignee: (was: Apache Spark) > spark.ml Collaborative Filtering example does not

[jira] [Commented] (SPARK-15319) Fix SparkR doc layout for corr and other DataFrame stats functions

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283291#comment-15283291 ] Apache Spark commented on SPARK-15319: -- User 'felixcheung' has created a pull request for this

[jira] [Assigned] (SPARK-15319) Fix SparkR doc layout for corr and other DataFrame stats functions

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15319: Assignee: (was: Apache Spark) > Fix SparkR doc layout for corr and other DataFrame

[jira] [Assigned] (SPARK-15319) Fix SparkR doc layout for corr and other DataFrame stats functions

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15319: Assignee: Apache Spark > Fix SparkR doc layout for corr and other DataFrame stats

[jira] [Commented] (SPARK-15237) SparkR corr function documentation

2016-05-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283283#comment-15283283 ] Felix Cheung commented on SPARK-15237: -- opened SPARK-15319 > SparkR corr function documentation >

[jira] [Created] (SPARK-15319) Fix SparkR doc layout for corr and other DataFrame stats functions

2016-05-13 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-15319: Summary: Fix SparkR doc layout for corr and other DataFrame stats functions Key: SPARK-15319 URL: https://issues.apache.org/jira/browse/SPARK-15319 Project: Spark

[jira] [Commented] (SPARK-15318) spark.ml Collaborative Filtering example does not work in spark-shell

2016-05-13 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283271#comment-15283271 ] Miao Wang commented on SPARK-15318: --- I am working to find a solution now. > spark.ml Collaborative

[jira] [Created] (SPARK-15318) spark.ml Collaborative Filtering example does not work in spark-shell

2016-05-13 Thread Miao Wang (JIRA)
Miao Wang created SPARK-15318: - Summary: spark.ml Collaborative Filtering example does not work in spark-shell Key: SPARK-15318 URL: https://issues.apache.org/jira/browse/SPARK-15318 Project: Spark

[jira] [Updated] (SPARK-15317) JobProgressListener takes a huge amount of memory with iterative DataFrame program in local, standalone

2016-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15317: -- Description: h2. TL;DR Running a small test locally, I found JobProgressListener

[jira] [Updated] (SPARK-15317) JobProgressListener takes a huge amount of memory with iterative DataFrame program in local, standalohne

2016-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15317: -- Summary: JobProgressListener takes a huge amount of memory with iterative DataFrame

[jira] [Updated] (SPARK-15317) JobProgressListener takes a huge amount of memory with iterative DataFrame program in local, standalone

2016-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15317: -- Summary: JobProgressListener takes a huge amount of memory with iterative DataFrame

[jira] [Comment Edited] (SPARK-15237) SparkR corr function documentation

2016-05-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283234#comment-15283234 ] Felix Cheung edited comment on SPARK-15237 at 5/13/16 10:38 PM: as per

[jira] [Commented] (SPARK-15237) SparkR corr function documentation

2016-05-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283234#comment-15283234 ] Felix Cheung commented on SPARK-15237: -- as per the @rdname

[jira] [Commented] (SPARK-15129) Clarify conventions for calling Spark and MLlib from R

2016-05-13 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283209#comment-15283209 ] Gayathri Murali commented on SPARK-15129: - ||API Change||User Guide updated|| |Family and Link

[jira] [Comment Edited] (SPARK-15302) Implement FK/PK "rely novalidate" constraints for better CBO

2016-05-13 Thread Ruslan Dautkhanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283194#comment-15283194 ] Ruslan Dautkhanov edited comment on SPARK-15302 at 5/13/16 9:45 PM:

[jira] [Commented] (SPARK-15302) Implement FK/PK "rely novalidate" constraints for better CBO

2016-05-13 Thread Ruslan Dautkhanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283194#comment-15283194 ] Ruslan Dautkhanov commented on SPARK-15302: --- Yes, it's a feature request for Spark. See for

[jira] [Updated] (SPARK-15317) JobProgressListener takes a huge amount of memory with iterative DataFrame program in local mode

2016-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15317: -- Description: Running a small test locally, I found JobProgressListener consuming a

[jira] [Updated] (SPARK-15317) JobProgressListener takes a huge amount of memory with iterative DataFrame program in local mode

2016-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15317: -- Attachment: dump-standalone-2.0-4of4.png dump-standalone-2.0-3of4.png

[jira] [Updated] (SPARK-15317) JobProgressListener takes a huge amount of memory with iterative DataFrame program in local mode

2016-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15317: -- Attachment: cc_traces.txt > JobProgressListener takes a huge amount of memory with

[jira] [Updated] (SPARK-15317) JobProgressListener takes a huge amount of memory with iterative DataFrame program in local mode

2016-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15317: -- Description: Running a small test locally, I found JobProgressListener consuming a

[jira] [Created] (SPARK-15317) JobProgressListener takes a huge amount of memory with iterative DataFrame program in local mode

2016-05-13 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-15317: - Summary: JobProgressListener takes a huge amount of memory with iterative DataFrame program in local mode Key: SPARK-15317 URL:

[jira] [Commented] (SPARK-13850) TimSort Comparison method violates its general contract

2016-05-13 Thread Sital Kedia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283110#comment-15283110 ] Sital Kedia commented on SPARK-13850: - I have found a workaround for this issue. Please take a look

[jira] [Assigned] (SPARK-13850) TimSort Comparison method violates its general contract

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13850: Assignee: (was: Apache Spark) > TimSort Comparison method violates its general

[jira] [Commented] (SPARK-13850) TimSort Comparison method violates its general contract

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283106#comment-15283106 ] Apache Spark commented on SPARK-13850: -- User 'sitalkedia' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13850) TimSort Comparison method violates its general contract

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13850: Assignee: Apache Spark > TimSort Comparison method violates its general contract >

[jira] [Assigned] (SPARK-15316) PySpark GeneralizedLinearRegression missing linkPredictionCol param

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15316: Assignee: Apache Spark > PySpark GeneralizedLinearRegression missing linkPredictionCol

[jira] [Commented] (SPARK-15316) PySpark GeneralizedLinearRegression missing linkPredictionCol param

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283104#comment-15283104 ] Apache Spark commented on SPARK-15316: -- User 'holdenk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15316) PySpark GeneralizedLinearRegression missing linkPredictionCol param

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15316: Assignee: (was: Apache Spark) > PySpark GeneralizedLinearRegression missing

[jira] [Updated] (SPARK-15316) PySpark GeneralizedLinearRegression missing linkPredictionCol param

2016-05-13 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-15316: Description: PySpark's GeneralizedLinearRegression is missing the linkPredictionCol param. (was:

[jira] [Created] (SPARK-15316) PySpark GeneralizedLinearRegression missing linkPredictionCol param

2016-05-13 Thread holdenk (JIRA)
holdenk created SPARK-15316: --- Summary: PySpark GeneralizedLinearRegression missing linkPredictionCol param Key: SPARK-15316 URL: https://issues.apache.org/jira/browse/SPARK-15316 Project: Spark

[jira] [Commented] (SPARK-13346) Using DataFrames iteratively leads to massive query plans, which slows execution

2016-05-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283052#comment-15283052 ] Joseph K. Bradley commented on SPARK-13346: --- Sure, the practical applications are pretty much

[jira] [Assigned] (SPARK-15315) CSV datasource writes garbage for complex types instead of rasing error.

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15315: Assignee: (was: Apache Spark) > CSV datasource writes garbage for complex types

[jira] [Assigned] (SPARK-15315) CSV datasource writes garbage for complex types instead of rasing error.

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15315: Assignee: Apache Spark > CSV datasource writes garbage for complex types instead of

[jira] [Commented] (SPARK-15315) CSV datasource writes garbage for complex types instead of rasing error.

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283051#comment-15283051 ] Apache Spark commented on SPARK-15315: -- User 'sureshthalamati' has created a pull request for this

[jira] [Commented] (SPARK-14463) read.text broken for partitioned tables

2016-05-13 Thread Jurriaan Pruis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283044#comment-15283044 ] Jurriaan Pruis commented on SPARK-14463: Actually, this functionality is broken (explicitly

[jira] [Commented] (SPARK-15315) CSV datasource writes garbage for complex types instead of rasing error.

2016-05-13 Thread Suresh Thalamati (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283042#comment-15283042 ] Suresh Thalamati commented on SPARK-15315: -- I am working on submitting PR for this issue , will

[jira] [Created] (SPARK-15315) CSV datasource writes garbage for complex types instead of rasing error.

2016-05-13 Thread Suresh Thalamati (JIRA)
Suresh Thalamati created SPARK-15315: Summary: CSV datasource writes garbage for complex types instead of rasing error. Key: SPARK-15315 URL: https://issues.apache.org/jira/browse/SPARK-15315

[jira] [Commented] (SPARK-15075) Cleanup dependencies between SQLContext and SparkSession

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282970#comment-15282970 ] Apache Spark commented on SPARK-15075: -- User 'dilipbiswal' has created a pull request for this

[jira] [Assigned] (SPARK-15075) Cleanup dependencies between SQLContext and SparkSession

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15075: Assignee: Apache Spark > Cleanup dependencies between SQLContext and SparkSession >

[jira] [Assigned] (SPARK-15075) Cleanup dependencies between SQLContext and SparkSession

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15075: Assignee: (was: Apache Spark) > Cleanup dependencies between SQLContext and

[jira] [Commented] (SPARK-13485) (Dataset-oriented) API evolution in Spark 2.0

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282965#comment-15282965 ] Apache Spark commented on SPARK-13485: -- User 'dilipbiswal' has created a pull request for this

[jira] [Commented] (SPARK-15314) Enable tests that required save/load for Pipeline API

2016-05-13 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282910#comment-15282910 ] Dongjoon Hyun commented on SPARK-15314: --- Also, PR has been ready 11 days ago. > Enable tests that

[jira] [Closed] (SPARK-15314) Enable tests that required save/load for Pipeline API

2016-05-13 Thread Sandeep Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandeep Singh closed SPARK-15314. - Resolution: Duplicate > Enable tests that required save/load for Pipeline API >

[jira] [Commented] (SPARK-15314) Enable tests that required save/load for Pipeline API

2016-05-13 Thread Sandeep Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282923#comment-15282923 ] Sandeep Singh commented on SPARK-15314: --- My bad closed. > Enable tests that required save/load for

[jira] [Comment Edited] (SPARK-15314) Enable tests that required save/load for Pipeline API

2016-05-13 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282910#comment-15282910 ] Dongjoon Hyun edited comment on SPARK-15314 at 5/13/16 5:43 PM: Also, PR

[jira] [Commented] (SPARK-15314) Enable tests that required save/load for Pipeline API

2016-05-13 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282906#comment-15282906 ] Dongjoon Hyun commented on SPARK-15314: --- Hi, [~techaddict]. It's duplicated of SPARK-15058. >

[jira] [Updated] (SPARK-15296) Refactor All Java Tests that use SparkSession

2016-05-13 Thread Sandeep Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandeep Singh updated SPARK-15296: -- Description: There's a lot of Duplicate code in Java tests. {{setUp()}} and {{tearDown()}} of

[jira] [Assigned] (SPARK-15296) Refactor All Java Tests that use SparkSession

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15296: Assignee: (was: Apache Spark) > Refactor All Java Tests that use SparkSession >

[jira] [Assigned] (SPARK-15296) Refactor All Java Tests that use SparkSession

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15296: Assignee: Apache Spark > Refactor All Java Tests that use SparkSession >

[jira] [Commented] (SPARK-15296) Refactor All Java Tests that use SparkSession

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282842#comment-15282842 ] Apache Spark commented on SPARK-15296: -- User 'techaddict' has created a pull request for this issue:

[jira] [Resolved] (SPARK-15267) Refactor and add some classes for options in datasources like CSVOptions or JSONOptions

2016-05-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15267. - Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.0.0 > Refactor and add

[jira] [Commented] (SPARK-15314) Enable tests that required save/load for Pipeline API

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282688#comment-15282688 ] Apache Spark commented on SPARK-15314: -- User 'techaddict' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15314) Enable tests that required save/load for Pipeline API

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15314: Assignee: (was: Apache Spark) > Enable tests that required save/load for Pipeline API

[jira] [Assigned] (SPARK-15314) Enable tests that required save/load for Pipeline API

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15314: Assignee: Apache Spark > Enable tests that required save/load for Pipeline API >

[jira] [Created] (SPARK-15314) Enable tests that required save/load for Pipeline API

2016-05-13 Thread Sandeep Singh (JIRA)
Sandeep Singh created SPARK-15314: - Summary: Enable tests that required save/load for Pipeline API Key: SPARK-15314 URL: https://issues.apache.org/jira/browse/SPARK-15314 Project: Spark

[jira] [Updated] (SPARK-15314) Enable tests that required save/load for Pipeline API

2016-05-13 Thread Sandeep Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandeep Singh updated SPARK-15314: -- Description: Some tests were commented out in

[jira] [Updated] (SPARK-15296) Refactor All Java Tests that use SparkSession

2016-05-13 Thread Sandeep Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandeep Singh updated SPARK-15296: -- Description: There's a lot of Duplicate code in Java tests. {{setUp()}} and {{tearDown()}} of

[jira] [Updated] (SPARK-15296) Refactor All Java Tests that use SparkSession

2016-05-13 Thread Sandeep Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandeep Singh updated SPARK-15296: -- Component/s: (was: SQL) > Refactor All Java Tests that use SparkSession >

[jira] [Reopened] (SPARK-12972) Update org.apache.httpcomponents.httpclient

2016-05-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-12972: --- Hm, looks like this update actually caused test failures, but not in the PR builder. Reopening to take

[jira] [Closed] (SPARK-14638) Spark task does not have access to a dependency in the classloader of the executor thread

2016-05-13 Thread Younos Aboulnaga (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Younos Aboulnaga closed SPARK-14638. Resolution: Information Provided > Spark task does not have access to a dependency in the

[jira] [Updated] (SPARK-14638) Spark task does not have access to a dependency in the classloader of the executor thread

2016-05-13 Thread Younos Aboulnaga (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Younos Aboulnaga updated SPARK-14638: - Description: [Edit] Editing the description because I can't comment on JIRA since

[jira] [Updated] (SPARK-15186) Add user guide for Generalized Linear Regression.

2016-05-13 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-15186: --- Assignee: Seth Hendrickson > Add user guide for Generalized Linear Regression. >

[jira] [Updated] (SPARK-14979) Add examples for GeneralizedLinearRegression

2016-05-13 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-14979: --- Assignee: Yanbo Liang > Add examples for GeneralizedLinearRegression >

[jira] [Commented] (SPARK-14906) Move VectorUDT and MatrixUDT in PySpark to new ML package

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282607#comment-15282607 ] Apache Spark commented on SPARK-14906: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-14709) spark.ml API for linear SVM

2016-05-13 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282564#comment-15282564 ] yuhao yang commented on SPARK-14709: I put the SMO version at https://github.com/hhbyyh/SVMOnSpark.

[jira] [Commented] (SPARK-15171) Deprecate registerTempTable and add dataset.createTempView

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282560#comment-15282560 ] Apache Spark commented on SPARK-15171: -- User 'clockfly' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15244) Type of column name created with sqlContext.createDataFrame() is not consistent.

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15244: Assignee: (was: Apache Spark) > Type of column name created with

[jira] [Commented] (SPARK-15244) Type of column name created with sqlContext.createDataFrame() is not consistent.

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282557#comment-15282557 ] Apache Spark commented on SPARK-15244: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-15244) Type of column name created with sqlContext.createDataFrame() is not consistent.

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15244: Assignee: Apache Spark > Type of column name created with sqlContext.createDataFrame() is

[jira] [Commented] (SPARK-15244) Type of column name created with sqlContext.createDataFrame() is not consistent.

2016-05-13 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282546#comment-15282546 ] Dongjoon Hyun commented on SPARK-15244: --- Hi, [~k-yokoshi]. I can fix this consistently. I'll make

[jira] [Assigned] (SPARK-15313) EmbedSerializerInFilter rule should keep exprIds of output of surrounded SerializeFromObject.

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15313: Assignee: Apache Spark > EmbedSerializerInFilter rule should keep exprIds of output of

[jira] [Commented] (SPARK-15313) EmbedSerializerInFilter rule should keep exprIds of output of surrounded SerializeFromObject.

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282543#comment-15282543 ] Apache Spark commented on SPARK-15313: -- User 'ueshin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15313) EmbedSerializerInFilter rule should keep exprIds of output of surrounded SerializeFromObject.

2016-05-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15313: Assignee: (was: Apache Spark) > EmbedSerializerInFilter rule should keep exprIds of

[jira] [Created] (SPARK-15313) EmbedSerializerInFilter rule should keep exprIds of output of surrounded SerializeFromObject.

2016-05-13 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-15313: - Summary: EmbedSerializerInFilter rule should keep exprIds of output of surrounded SerializeFromObject. Key: SPARK-15313 URL: https://issues.apache.org/jira/browse/SPARK-15313

[jira] [Commented] (SPARK-13581) LibSVM throws MatchError

2016-05-13 Thread Sandeep Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15282520#comment-15282520 ] Sandeep Singh commented on SPARK-13581: --- Can't seem to reproduce on current master {code} scala>

[jira] [Assigned] (SPARK-12972) Update org.apache.httpcomponents.httpclient

2016-05-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-12972: - Assignee: Sean Owen > Update org.apache.httpcomponents.httpclient >

[jira] [Updated] (SPARK-15061) Upgrade Py4J to 0.10.1

2016-05-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15061: -- Target Version/s: (was: 2.1.0) Priority: Minor (was: Major) > Upgrade Py4J to 0.10.1 >

[jira] [Resolved] (SPARK-12972) Update org.apache.httpcomponents.httpclient

2016-05-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12972. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13049

  1   2   >