[jira] [Assigned] (SPARK-12993) Remove usage of ADD_FILES in pyspark

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12993: Assignee: (was: Apache Spark) > Remove usage of ADD_FILES in pyspark >

[jira] [Assigned] (SPARK-12993) Remove usage of ADD_FILES in pyspark

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12993: Assignee: Apache Spark > Remove usage of ADD_FILES in pyspark >

[jira] [Commented] (SPARK-12993) Remove usage of ADD_FILES in pyspark

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116647#comment-15116647 ] Apache Spark commented on SPARK-12993: -- User 'zjffdu' has created a pull request for this issue:

[jira] [Commented] (SPARK-11780) Provide type aliases in org.apache.spark.sql.types for backwards compatibility

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116724#comment-15116724 ] Apache Spark commented on SPARK-11780: -- User 'maropu' has created a pull request for this issue:

[jira] [Updated] (SPARK-12993) Remove usage of ADD_FILES in pyspark

2016-01-25 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated SPARK-12993: --- Description: environment variable ADD_FILES is created for adding python files to spark context

[jira] [Assigned] (SPARK-12937) Bloom filter serialization

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12937: Assignee: Apache Spark (was: Wenchen Fan) > Bloom filter serialization >

[jira] [Commented] (SPARK-12937) Bloom filter serialization

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116855#comment-15116855 ] Apache Spark commented on SPARK-12937: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12937) Bloom filter serialization

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12937: Assignee: Wenchen Fan (was: Apache Spark) > Bloom filter serialization >

[jira] [Commented] (SPARK-12984) Not able to read CSV file using Spark 1.4.0

2016-01-25 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116645#comment-15116645 ] Felix Cheung commented on SPARK-12984: -- You should specify 'source' - otherwise it defaults to

[jira] [Resolved] (SPARK-11922) Python API for ml.feature.QuantileDiscretizer

2016-01-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-11922. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10085

[jira] [Created] (SPARK-12997) Use cast expression to perform type cast in csv

2016-01-25 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-12997: --- Summary: Use cast expression to perform type cast in csv Key: SPARK-12997 URL: https://issues.apache.org/jira/browse/SPARK-12997 Project: Spark Issue Type:

[jira] [Updated] (SPARK-12977) Factoring out StreamingListener and UI to support history UI

2016-01-25 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao updated SPARK-12977: Attachment: screenshot-1.png > Factoring out StreamingListener and UI to support history UI >

[jira] [Created] (SPARK-12994) It is not necessary to create ExecutorAllocationManager in local mode

2016-01-25 Thread Jeff Zhang (JIRA)
Jeff Zhang created SPARK-12994: -- Summary: It is not necessary to create ExecutorAllocationManager in local mode Key: SPARK-12994 URL: https://issues.apache.org/jira/browse/SPARK-12994 Project: Spark

[jira] [Assigned] (SPARK-12995) Remove deprecate APIs from Pregel

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12995: Assignee: (was: Apache Spark) > Remove deprecate APIs from Pregel >

[jira] [Commented] (SPARK-12995) Remove deprecate APIs from Pregel

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116773#comment-15116773 ] Apache Spark commented on SPARK-12995: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12995) Remove deprecate APIs from Pregel

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12995: Assignee: Apache Spark > Remove deprecate APIs from Pregel >

[jira] [Commented] (SPARK-12977) Factoring out StreamingListener and UI to support history UI

2016-01-25 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116844#comment-15116844 ] Saisai Shao commented on SPARK-12977: - Attach the current working progress, still some problems

[jira] [Created] (SPARK-12996) CSVRelation should be based on HadoopFsRelation

2016-01-25 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-12996: --- Summary: CSVRelation should be based on HadoopFsRelation Key: SPARK-12996 URL: https://issues.apache.org/jira/browse/SPARK-12996 Project: Spark Issue Type:

[jira] [Commented] (SPARK-12996) CSVRelation should be based on HadoopFsRelation

2016-01-25 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116774#comment-15116774 ] Reynold Xin commented on SPARK-12996: - cc [~hyukjin.kwon] would you be interested in fixing this? >

[jira] [Closed] (SPARK-12702) Populate statistics for DataFrame when reading CSV

2016-01-25 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-12702. --- Resolution: Duplicate Closing this because it is just part of SPARK-12996. > Populate statistics

[jira] [Closed] (SPARK-12670) Use spark internal utilities wherever possible

2016-01-25 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-12670. --- Resolution: Won't Fix Going to close this one since it is a little bit too broad. > Use spark

[jira] [Assigned] (SPARK-12968) Implement command to set current database

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12968: Assignee: (was: Apache Spark) > Implement command to set current database >

[jira] [Assigned] (SPARK-12968) Implement command to set current database

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12968: Assignee: Apache Spark > Implement command to set current database >

[jira] [Updated] (SPARK-12995) Remove deprecate APIs from Pregel

2016-01-25 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-12995: - Issue Type: Sub-task (was: Improvement) Parent: SPARK-11806 > Remove deprecate

[jira] [Created] (SPARK-12993) Remove usage of ADD_FILES in pyspark

2016-01-25 Thread Jeff Zhang (JIRA)
Jeff Zhang created SPARK-12993: -- Summary: Remove usage of ADD_FILES in pyspark Key: SPARK-12993 URL: https://issues.apache.org/jira/browse/SPARK-12993 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-12994) It is not necessary to create ExecutorAllocationManager in local mode

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12994: Assignee: (was: Apache Spark) > It is not necessary to create

[jira] [Assigned] (SPARK-12994) It is not necessary to create ExecutorAllocationManager in local mode

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12994: Assignee: Apache Spark > It is not necessary to create ExecutorAllocationManager in local

[jira] [Commented] (SPARK-12994) It is not necessary to create ExecutorAllocationManager in local mode

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116693#comment-15116693 ] Apache Spark commented on SPARK-12994: -- User 'zjffdu' has created a pull request for this issue:

[jira] [Commented] (SPARK-12968) Implement command to set current database

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116735#comment-15116735 ] Apache Spark commented on SPARK-12968: -- User 'viirya' has created a pull request for this issue:

[jira] [Created] (SPARK-12995) Remove deprecate APIs from Pregel

2016-01-25 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-12995: Summary: Remove deprecate APIs from Pregel Key: SPARK-12995 URL: https://issues.apache.org/jira/browse/SPARK-12995 Project: Spark Issue Type:

[jira] [Commented] (SPARK-12888) benchmark the new hash expression

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116747#comment-15116747 ] Apache Spark commented on SPARK-12888: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Updated] (SPARK-12834) Use type conversion instead of Ser/De of Pickle to transform JavaArray and JavaList

2016-01-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12834: -- Assignee: Xusen Yin > Use type conversion instead of Ser/De of Pickle to transform

[jira] [Resolved] (SPARK-12834) Use type conversion instead of Ser/De of Pickle to transform JavaArray and JavaList

2016-01-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-12834. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10772

[jira] [Resolved] (SPARK-12973) Support to set priority when submit spark application to YARN

2016-01-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12973. --- Resolution: Duplicate > Support to set priority when submit spark application to YARN >

[jira] [Commented] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115204#comment-15115204 ] Hyukjin Kwon commented on SPARK-12890: -- Actually I don't still understand what is an issue here.

[jira] [Created] (SPARK-12979) Paths are resolved relative to the local file system

2016-01-25 Thread Iulian Dragos (JIRA)
Iulian Dragos created SPARK-12979: - Summary: Paths are resolved relative to the local file system Key: SPARK-12979 URL: https://issues.apache.org/jira/browse/SPARK-12979 Project: Spark Issue

[jira] [Commented] (SPARK-12968) Implement command to set current database

2016-01-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115288#comment-15115288 ] Herman van Hovell commented on SPARK-12968: --- I don't mind if you go ahead and work on this. The

[jira] [Comment Edited] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-25 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115148#comment-15115148 ] Liang-Chi Hsieh edited comment on SPARK-12890 at 1/25/16 12:46 PM: --- As

[jira] [Commented] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-25 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115148#comment-15115148 ] Liang-Chi Hsieh commented on SPARK-12890: - As {{DataFrame.parquet}} accepts paths as parameter,

[jira] [Commented] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-25 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115257#comment-15115257 ] Takeshi Yamamuro commented on SPARK-12890: -- Ah, I see. > Spark SQL query related to only

[jira] [Created] (SPARK-12980) pyspark crash for large dataset - clone

2016-01-25 Thread Christopher Bourez (JIRA)
Christopher Bourez created SPARK-12980: -- Summary: pyspark crash for large dataset - clone Key: SPARK-12980 URL: https://issues.apache.org/jira/browse/SPARK-12980 Project: Spark Issue

[jira] [Updated] (SPARK-12980) pyspark crash for large dataset - clone

2016-01-25 Thread Christopher Bourez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christopher Bourez updated SPARK-12980: --- Description: I installed spark 1.6 on many different computers. On Windows,

[jira] [Updated] (SPARK-12928) Oracle FLOAT datatype is not properly handled when reading via JDBC

2016-01-25 Thread Greg Michalopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Michalopoulos updated SPARK-12928: --- Description: When trying to read in a table from Oracle and saveAsParquet, an

[jira] [Comment Edited] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-25 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115076#comment-15115076 ] Takeshi Yamamuro edited comment on SPARK-12890 at 1/25/16 1:32 PM: --- I

[jira] [Comment Edited] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115204#comment-15115204 ] Hyukjin Kwon edited comment on SPARK-12890 at 1/25/16 1:44 PM: --- Actually I

[jira] [Updated] (SPARK-12980) pyspark crash for large dataset - clone

2016-01-25 Thread Christopher Bourez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christopher Bourez updated SPARK-12980: --- Description: I installed spark 1.6 on many different computers. On Windows,

[jira] [Commented] (SPARK-10911) Executors should System.exit on clean shutdown

2016-01-25 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115225#comment-15115225 ] Thomas Graves commented on SPARK-10911: --- see the pull request for comments and discussion

[jira] [Commented] (SPARK-3611) Show number of cores for each executor in application web UI

2016-01-25 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115309#comment-15115309 ] Thomas Graves commented on SPARK-3611: -- I know the pull request was closed due to not being able to

[jira] [Assigned] (SPARK-12928) Oracle FLOAT datatype is not properly handled when reading via JDBC

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12928: Assignee: (was: Apache Spark) > Oracle FLOAT datatype is not properly handled when

[jira] [Commented] (SPARK-12928) Oracle FLOAT datatype is not properly handled when reading via JDBC

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115332#comment-15115332 ] Apache Spark commented on SPARK-12928: -- User 'poolis' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12928) Oracle FLOAT datatype is not properly handled when reading via JDBC

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12928: Assignee: Apache Spark > Oracle FLOAT datatype is not properly handled when reading via

[jira] [Commented] (SPARK-12360) Support using 64-bit long type in SparkR

2016-01-25 Thread Dmitriy Selivanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115376#comment-15115376 ] Dmitriy Selivanov commented on SPARK-12360: --- +1 for bit64 > Support using 64-bit long type in

[jira] [Commented] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-25 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115144#comment-15115144 ] Liang-Chi Hsieh commented on SPARK-12890: - For the original issue, I think it might because you

[jira] [Comment Edited] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-25 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115204#comment-15115204 ] Hyukjin Kwon edited comment on SPARK-12890 at 1/25/16 1:46 PM: --- Actually I

[jira] [Assigned] (SPARK-12492) SQL page of Spark-sql is always blank

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12492: Assignee: (was: Apache Spark) > SQL page of Spark-sql is always blank >

[jira] [Assigned] (SPARK-12492) SQL page of Spark-sql is always blank

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12492: Assignee: Apache Spark > SQL page of Spark-sql is always blank >

[jira] [Commented] (SPARK-12492) SQL page of Spark-sql is always blank

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115334#comment-15115334 ] Apache Spark commented on SPARK-12492: -- User 'KaiXinXiaoLei' has created a pull request for this

[jira] [Updated] (SPARK-12975) Throwing Exception when Bucketing Columns are part of Partitioning Columns

2016-01-25 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-12975: Description: When users are using partitionBy and bucketBy at the same time, some bucketing columns might

[jira] [Updated] (SPARK-12975) Throwing Exception when Bucketing Columns are part of Partitioning Columns

2016-01-25 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-12975: Summary: Throwing Exception when Bucketing Columns are part of Partitioning Columns (was: Eliminate

[jira] [Created] (SPARK-12984) Not able to read CSV file using Spark 1.4.0

2016-01-25 Thread Jai Murugesh Rajasekaran (JIRA)
Jai Murugesh Rajasekaran created SPARK-12984: Summary: Not able to read CSV file using Spark 1.4.0 Key: SPARK-12984 URL: https://issues.apache.org/jira/browse/SPARK-12984 Project: Spark

[jira] [Created] (SPARK-12985) Spark Hive thrift server big decimal data issue

2016-01-25 Thread Alex Liu (JIRA)
Alex Liu created SPARK-12985: Summary: Spark Hive thrift server big decimal data issue Key: SPARK-12985 URL: https://issues.apache.org/jira/browse/SPARK-12985 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-12633) Make Parameter Descriptions Consistent for PySpark MLlib Regression

2016-01-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12633: -- Assignee: Vijay Kiran > Make Parameter Descriptions Consistent for PySpark MLlib Regression >

[jira] [Updated] (SPARK-12631) Make Parameter Descriptions Consistent for PySpark MLlib Clustering

2016-01-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12631: -- Assignee: Bryan Cutler > Make Parameter Descriptions Consistent for PySpark MLlib Clustering >

[jira] [Created] (SPARK-12986) Fix pydoc warnings in mllib/regression.py

2016-01-25 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-12986: - Summary: Fix pydoc warnings in mllib/regression.py Key: SPARK-12986 URL: https://issues.apache.org/jira/browse/SPARK-12986 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-12631) Make Parameter Descriptions Consistent for PySpark MLlib Clustering

2016-01-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12631: -- Shepherd: Xiangrui Meng > Make Parameter Descriptions Consistent for PySpark MLlib Clustering

[jira] [Updated] (SPARK-12630) Make Parameter Descriptions Consistent for PySpark MLlib Classification

2016-01-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12630: -- Assignee: Vijay Kiran > Make Parameter Descriptions Consistent for PySpark MLlib

[jira] [Updated] (SPARK-12632) Make Parameter Descriptions Consistent for PySpark MLlib FPM and Recommendation

2016-01-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12632: -- Assignee: somil deshmukh > Make Parameter Descriptions Consistent for PySpark MLlib FPM and >

[jira] [Updated] (SPARK-12634) Make Parameter Descriptions Consistent for PySpark MLlib Tree

2016-01-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12634: -- Assignee: Vijay Kiran > Make Parameter Descriptions Consistent for PySpark MLlib Tree >

[jira] [Updated] (SPARK-12633) Make Parameter Descriptions Consistent for PySpark MLlib Regression

2016-01-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12633: -- Shepherd: Bryan Cutler > Make Parameter Descriptions Consistent for PySpark MLlib Regression >

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2016-01-25 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115904#comment-15115904 ] Emlyn Corrin commented on SPARK-9740: - Thanks for the help. I've tried with {{callUDF}} and that gives

[jira] [Updated] (SPARK-12634) Make Parameter Descriptions Consistent for PySpark MLlib Tree

2016-01-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12634: -- Shepherd: Bryan Cutler Target Version/s: 2.0.0 > Make Parameter Descriptions

[jira] [Updated] (SPARK-12980) pyspark crash for large dataset - clone

2016-01-25 Thread Christopher Bourez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christopher Bourez updated SPARK-12980: --- Description: I installed spark 1.6 on many different computers. On Windows,

[jira] [Created] (SPARK-12982) SQLContext: temporary table registration does not accept valid identifier

2016-01-25 Thread Grzegorz Chilkiewicz (JIRA)
Grzegorz Chilkiewicz created SPARK-12982: Summary: SQLContext: temporary table registration does not accept valid identifier Key: SPARK-12982 URL: https://issues.apache.org/jira/browse/SPARK-12982

[jira] [Updated] (SPARK-12632) Make Parameter Descriptions Consistent for PySpark MLlib FPM and Recommendation

2016-01-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12632: -- Target Version/s: 2.0.0 > Make Parameter Descriptions Consistent for PySpark MLlib FPM and >

[jira] [Updated] (SPARK-12631) Make Parameter Descriptions Consistent for PySpark MLlib Clustering

2016-01-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12631: -- Target Version/s: 2.0.0 > Make Parameter Descriptions Consistent for PySpark MLlib Clustering

[jira] [Updated] (SPARK-12630) Make Parameter Descriptions Consistent for PySpark MLlib Classification

2016-01-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12630: -- Target Version/s: 2.0.0 > Make Parameter Descriptions Consistent for PySpark MLlib

[jira] [Updated] (SPARK-12633) Make Parameter Descriptions Consistent for PySpark MLlib Regression

2016-01-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-12633: -- Target Version/s: 2.0.0 > Make Parameter Descriptions Consistent for PySpark MLlib Regression

[jira] [Commented] (SPARK-12945) ERROR LiveListenerBus: Listener JobProgressListener threw an exception

2016-01-25 Thread Ben Huntley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115986#comment-15115986 ] Ben Huntley commented on SPARK-12945: - Also seeing this issue in 1.6.0, not limited to Web UI, as

[jira] [Commented] (SPARK-12911) Cacheing a dataframe causes array comparisons to fail (in filter / where) after 1.6

2016-01-25 Thread Stephen DiCocco (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115459#comment-15115459 ] Stephen DiCocco commented on SPARK-12911: - So we have determined one way to work around the issue

[jira] [Comment Edited] (SPARK-12970) Error in documentation on creating rows with schemas defined by structs

2016-01-25 Thread Haidar Hadi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15114539#comment-15114539 ] Haidar Hadi edited comment on SPARK-12970 at 1/25/16 7:11 PM: -- sure

[jira] [Resolved] (SPARK-11965) Update user guide for RFormula feature interactions

2016-01-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11965. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10222

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2016-01-25 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115907#comment-15115907 ] Yin Huai commented on SPARK-9740: - Can you provide the full stack trace? > first/last aggregate NULL

[jira] [Comment Edited] (SPARK-12945) ERROR LiveListenerBus: Listener JobProgressListener threw an exception

2016-01-25 Thread Ben Huntley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115986#comment-15115986 ] Ben Huntley edited comment on SPARK-12945 at 1/25/16 8:41 PM: -- Also seeing

[jira] [Updated] (SPARK-12981) Dataframe distinct() followed by a filter(udf) in pyspark throws a casting error

2016-01-25 Thread Tom Arnfeld (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tom Arnfeld updated SPARK-12981: Description: We noticed a regression when testing out an upgrade of Spark 1.6 for our systems,

[jira] [Updated] (SPARK-12982) SQLContext: temporary table registration does not accept valid identifier

2016-01-25 Thread Grzegorz Chilkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grzegorz Chilkiewicz updated SPARK-12982: - Description: We have encountered very strange behavior of SparkSQL temporary

[jira] [Commented] (SPARK-12890) Spark SQL query related to only partition fields should not scan the whole data.

2016-01-25 Thread Simeon Simeonov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115558#comment-15115558 ] Simeon Simeonov commented on SPARK-12890: - [~viirya] If schema merging is the cause of the

[jira] [Updated] (SPARK-12981) Dataframe distinct() followed by a filter(udf) in pyspark throws a casting error

2016-01-25 Thread Tom Arnfeld (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tom Arnfeld updated SPARK-12981: Description: We noticed a regression when testing out an upgrade of Spark 1.6 for our systems,

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2016-01-25 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115480#comment-15115480 ] Yin Huai commented on SPARK-9740: - Can you attach your code? Also, can you try to use

[jira] [Resolved] (SPARK-12980) pyspark crash for large dataset - clone

2016-01-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12980. --- Resolution: Invalid Why is this a clone of another issue? I don't think you've specified clearly

[jira] [Closed] (SPARK-12980) pyspark crash for large dataset - clone

2016-01-25 Thread Christopher Bourez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christopher Bourez closed SPARK-12980. -- > pyspark crash for large dataset - clone > --- > >

[jira] [Commented] (SPARK-12261) pyspark crash for large dataset

2016-01-25 Thread Christopher Bourez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115433#comment-15115433 ] Christopher Bourez commented on SPARK-12261: I think the issue is not resolved I installed

[jira] [Updated] (SPARK-12982) SQLContext: temporary table registration does not accept valid identifier

2016-01-25 Thread Grzegorz Chilkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grzegorz Chilkiewicz updated SPARK-12982: - Description: We have encountered very strange behavior of SparkSQL temporary

[jira] [Created] (SPARK-12981) Dataframe distinct() followed by a filter(udf) in pyspark throws a casting error

2016-01-25 Thread Tom Arnfeld (JIRA)
Tom Arnfeld created SPARK-12981: --- Summary: Dataframe distinct() followed by a filter(udf) in pyspark throws a casting error Key: SPARK-12981 URL: https://issues.apache.org/jira/browse/SPARK-12981

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2016-01-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115650#comment-15115650 ] Herman van Hovell commented on SPARK-9740: -- We are probably resolving the Hive function by

[jira] [Commented] (SPARK-12941) Spark-SQL JDBC Oracle dialect fails to map string datatypes to Oracle VARCHAR datatype

2016-01-25 Thread Thomas Sebastian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115716#comment-15115716 ] Thomas Sebastian commented on SPARK-12941: -- Added a pull request

[jira] [Commented] (SPARK-11219) Make Parameter Description Format Consistent in PySpark.MLlib

2016-01-25 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115756#comment-15115756 ] Bryan Cutler commented on SPARK-11219: -- Regarding overall style in PySpark, I generally see single

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2016-01-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115659#comment-15115659 ] Herman van Hovell commented on SPARK-9740: -- Hmmm... It does have a suitable constructor. Please

[jira] [Commented] (SPARK-12941) Spark-SQL JDBC Oracle dialect fails to map string datatypes to Oracle VARCHAR datatype

2016-01-25 Thread Jayadevan M (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115682#comment-15115682 ] Jayadevan M commented on SPARK-12941: - Working on JdbcDialect.scala > Spark-SQL JDBC Oracle dialect

[jira] [Created] (SPARK-12983) Correct metrics.properties.template

2016-01-25 Thread Benjamin Fradet (JIRA)
Benjamin Fradet created SPARK-12983: --- Summary: Correct metrics.properties.template Key: SPARK-12983 URL: https://issues.apache.org/jira/browse/SPARK-12983 Project: Spark Issue Type:

[jira] [Commented] (SPARK-12983) Correct metrics.properties.template

2016-01-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115661#comment-15115661 ] Apache Spark commented on SPARK-12983: -- User 'BenFradet' has created a pull request for this issue:

  1   2   3   >