[jira] [Commented] (SPARK-14231) JSON data source fails to infer floats as decimal when precision is bigger than 38 or scale is bigger than precision.

2016-03-28 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215486#comment-15215486 ] Hyukjin Kwon commented on SPARK-14231: -- [~rxin] I can work on this but would you maybe confirm

[jira] [Updated] (SPARK-14231) JSON data source fails to infer floats as decimal when precision is bigger than 38 or scale is bigger than precision.

2016-03-28 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-14231: - Description: Currently, JSON data source supports {{floatAsBigDecimal}} option, which reads

[jira] [Created] (SPARK-14231) JSON data source fails to infer floats as decimal when precision is bigger than 38 or scale is bigger than precision.

2016-03-28 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-14231: Summary: JSON data source fails to infer floats as decimal when precision is bigger than 38 or scale is bigger than precision. Key: SPARK-14231 URL:

[jira] [Commented] (SPARK-14165) NoSuchElementException: None.get when joining DataFrames with Seq of fields of different case

2016-03-28 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215477#comment-15215477 ] Jacek Laskowski commented on SPARK-14165: - Go ahead! Thanks! > NoSuchElementException: None.get

[jira] [Resolved] (SPARK-14071) Change MLWritable.write to be a property

2016-03-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-14071. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11945

[jira] [Updated] (SPARK-14071) Change MLWritable.write to be a property

2016-03-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14071: -- Assignee: Miao Wang > Change MLWritable.write to be a property >

[jira] [Resolved] (SPARK-11730) Feature Importance for GBT

2016-03-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-11730. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11961

[jira] [Commented] (SPARK-13834) Update sbt and sbt plugins for 2.x.

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215460#comment-15215460 ] Apache Spark commented on SPARK-13834: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Updated] (SPARK-14216) ML tree models should have a standardized, reusable feature importance test

2016-03-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14216: -- Summary: ML tree models should have a standardized, reusable feature importance test

[jira] [Resolved] (SPARK-14210) Add timing metric for how long the query spent in scan

2016-03-28 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14210. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12007

[jira] [Assigned] (SPARK-14230) Config the start time (jitter) for streaming jobs

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14230: Assignee: (was: Apache Spark) > Config the start time (jitter) for streaming jobs >

[jira] [Commented] (SPARK-14230) Config the start time (jitter) for streaming jobs

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215403#comment-15215403 ] Apache Spark commented on SPARK-14230: -- User 'liyintang' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14230) Config the start time (jitter) for streaming jobs

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14230: Assignee: Apache Spark > Config the start time (jitter) for streaming jobs >

[jira] [Commented] (SPARK-14153) My dataset does not provide proper predictions in ALS

2016-03-28 Thread Dulaj Rajitha (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215393#comment-15215393 ] Dulaj Rajitha commented on SPARK-14153: --- The problem is by changing the dataset, why the best

[jira] [Comment Edited] (SPARK-14153) My dataset does not provide proper predictions in ALS

2016-03-28 Thread Dulaj Rajitha (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215390#comment-15215390 ] Dulaj Rajitha edited comment on SPARK-14153 at 3/29/16 4:15 AM: I changed

[jira] [Commented] (SPARK-14153) My dataset does not provide proper predictions in ALS

2016-03-28 Thread Dulaj Rajitha (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215390#comment-15215390 ] Dulaj Rajitha commented on SPARK-14153: --- I changed only this line which will chane the data set

[jira] [Commented] (SPARK-14153) My dataset does not provide proper predictions in ALS

2016-03-28 Thread Dulaj Rajitha (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215388#comment-15215388 ] Dulaj Rajitha commented on SPARK-14153: --- This is the code:

[jira] [Updated] (SPARK-14219) Fix `pickRandomVertex` not to fall into infinite loops for graphs with one vertex

2016-03-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-14219: Fix Version/s: 1.6.2 > Fix `pickRandomVertex` not to fall into infinite loops for graphs with one

[jira] [Commented] (SPARK-4563) Allow spark driver to bind to different ip then advertise ip

2016-03-28 Thread Joe Eloff (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215371#comment-15215371 ] Joe Eloff commented on SPARK-4563: -- I have the same issue developing against Spark running on AWS while

[jira] [Created] (SPARK-14230) Config the start time (jitter) for streaming jobs

2016-03-28 Thread Liyin Tang (JIRA)
Liyin Tang created SPARK-14230: -- Summary: Config the start time (jitter) for streaming jobs Key: SPARK-14230 URL: https://issues.apache.org/jira/browse/SPARK-14230 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-13981) Improve Filter generated code to defer variable evaluation within operator

2016-03-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13981. - Resolution: Fixed Assignee: Nong Li Fix Version/s: 2.0.0 > Improve Filter

[jira] [Commented] (SPARK-14207) Transformer for splitting a Vector/Array column into individual columns

2016-03-28 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215351#comment-15215351 ] yuhao yang commented on SPARK-14207: I'm thinking the first version can implement the basic function

[jira] [Resolved] (SPARK-14213) Migrate HiveQl parsing to ANTLR4 parser

2016-03-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14213. - Resolution: Fixed Assignee: Herman van Hovell > Migrate HiveQl parsing to ANTLR4 parser >

[jira] [Commented] (SPARK-13784) Model export/import for spark.ml: RandomForests

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215346#comment-15215346 ] Apache Spark commented on SPARK-13784: -- User 'GayathriMurali' has created a pull request for this

[jira] [Assigned] (SPARK-13784) Model export/import for spark.ml: RandomForests

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13784: Assignee: Apache Spark > Model export/import for spark.ml: RandomForests >

[jira] [Assigned] (SPARK-13784) Model export/import for spark.ml: RandomForests

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13784: Assignee: (was: Apache Spark) > Model export/import for spark.ml: RandomForests >

[jira] [Commented] (SPARK-14165) NoSuchElementException: None.get when joining DataFrames with Seq of fields of different case

2016-03-28 Thread Subhobrata Dey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215333#comment-15215333 ] Subhobrata Dey commented on SPARK-14165: Hi [~jlaskowski], If nobody is working on this issue, I

[jira] [Created] (SPARK-14229) PySpark DataFrame.rdd's can't be saved to an arbitrary Hadoop OutputFormat

2016-03-28 Thread Russell Jurney (JIRA)
Russell Jurney created SPARK-14229: -- Summary: PySpark DataFrame.rdd's can't be saved to an arbitrary Hadoop OutputFormat Key: SPARK-14229 URL: https://issues.apache.org/jira/browse/SPARK-14229

[jira] [Assigned] (SPARK-14227) [SQL] Add method for printing out generated code for debugging

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14227: Assignee: Apache Spark > [SQL] Add method for printing out generated code for debugging >

[jira] [Assigned] (SPARK-14227) [SQL] Add method for printing out generated code for debugging

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14227: Assignee: (was: Apache Spark) > [SQL] Add method for printing out generated code for

[jira] [Commented] (SPARK-14227) [SQL] Add method for printing out generated code for debugging

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215332#comment-15215332 ] Apache Spark commented on SPARK-14227: -- User 'ericl' has created a pull request for this issue:

[jira] [Created] (SPARK-14228) Lost executor of RPC disassociated, and occurs exception: Could not find CoarseGrainedScheduler or it has been stopped

2016-03-28 Thread meiyoula (JIRA)
meiyoula created SPARK-14228: Summary: Lost executor of RPC disassociated, and occurs exception: Could not find CoarseGrainedScheduler or it has been stopped Key: SPARK-14228 URL:

[jira] [Created] (SPARK-14227) [SQL] Add method for printing out generated code for debugging

2016-03-28 Thread Eric Liang (JIRA)
Eric Liang created SPARK-14227: -- Summary: [SQL] Add method for printing out generated code for debugging Key: SPARK-14227 URL: https://issues.apache.org/jira/browse/SPARK-14227 Project: Spark

[jira] [Commented] (SPARK-7992) Hide private classes/objects in in generated Java API doc

2016-03-28 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215315#comment-15215315 ] Jakob Odersky commented on SPARK-7992: -- [~mengxr], I just submitted [another

[jira] [Commented] (SPARK-12792) Refactor RRDD to support R UDF

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215301#comment-15215301 ] Apache Spark commented on SPARK-12792: -- User 'sun-rui' has created a pull request for this issue:

[jira] [Commented] (SPARK-11416) Upgrade kryo package to version 3.0

2016-03-28 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215288#comment-15215288 ] Josh Rosen commented on SPARK-11416: To my knowledge, we're going to want to to use a version of Kryo

[jira] [Commented] (SPARK-14222) Remove jackson-module-scala dependency or cross-publish Jackson for Scala 2.12

2016-03-28 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215286#comment-15215286 ] Marcelo Vanzin commented on SPARK-14222: bq. the extra verbosity required by other approaches is

[jira] [Commented] (SPARK-14037) count(df) is very slow for dataframe constrcuted using SparkR::createDataFrame

2016-03-28 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215278#comment-15215278 ] Sun Rui commented on SPARK-14037: - It's weird if you haven't changed any configuration. Could you check

[jira] [Commented] (SPARK-11416) Upgrade kryo package to version 3.0

2016-03-28 Thread Oscar Boykin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215277#comment-15215277 ] Oscar Boykin commented on SPARK-11416: -- Why? We can publish a chill on the old kryo for scala 2.12

[jira] [Commented] (SPARK-14192) Executor is dead in Driver but alive in AM when driver losts rpc with executor, but executor is alive.

2016-03-28 Thread meiyoula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215273#comment-15215273 ] meiyoula commented on SPARK-14192: -- The master branch of 2016/3/28 > Executor is dead in Driver but

[jira] [Commented] (SPARK-14057) sql time stamps do not respect time zones

2016-03-28 Thread Vijay Parmar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215249#comment-15215249 ] Vijay Parmar commented on SPARK-14057: -- Thanks Andrew. I will start looking into it. I just have a

[jira] [Commented] (SPARK-14057) sql time stamps do not respect time zones

2016-03-28 Thread Vijay Parmar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215248#comment-15215248 ] Vijay Parmar commented on SPARK-14057: -- Thanks Andrew. I will start looking into it. I just have a

[jira] [Issue Comment Deleted] (SPARK-14057) sql time stamps do not respect time zones

2016-03-28 Thread Vijay Parmar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vijay Parmar updated SPARK-14057: - Comment: was deleted (was: Thanks Andrew. I will start looking into it. I just have a question

[jira] [Resolved] (SPARK-14205) remove trait Queryable

2016-03-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14205. - Resolution: Fixed Fix Version/s: 2.0.0 > remove trait Queryable > --

[jira] [Created] (SPARK-14226) Caching a table with 1,100 columns and a few million rows fails

2016-03-28 Thread Hossein Falaki (JIRA)
Hossein Falaki created SPARK-14226: -- Summary: Caching a table with 1,100 columns and a few million rows fails Key: SPARK-14226 URL: https://issues.apache.org/jira/browse/SPARK-14226 Project: Spark

[jira] [Commented] (SPARK-14225) Cap the length of toCommentSafeString at 128 chars

2016-03-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215211#comment-15215211 ] Reynold Xin commented on SPARK-14225: - cc [~sameerag] too we probably need to fix the code formatter

[jira] [Commented] (SPARK-14224) Cannot project all columns from a table with ~1,100 columns

2016-03-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215210#comment-15215210 ] Reynold Xin commented on SPARK-14224: - code to reproduce {code}

[jira] [Updated] (SPARK-14224) Cannot project all columns from a table with ~1,100 columns

2016-03-28 Thread Hossein Falaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hossein Falaki updated SPARK-14224: --- Target Version/s: 2.0.0 > Cannot project all columns from a table with ~1,100 columns >

[jira] [Commented] (SPARK-14224) Cannot project all columns from a table with ~1,100 columns

2016-03-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215189#comment-15215189 ] Reynold Xin commented on SPARK-14224: - The Parquet file used for this query can be found in

[jira] [Commented] (SPARK-14225) Cap the length of toCommentSafeString at 128 chars

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215184#comment-15215184 ] Apache Spark commented on SPARK-14225: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14225) Cap the length of toCommentSafeString at 128 chars

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14225: Assignee: Apache Spark (was: Reynold Xin) > Cap the length of toCommentSafeString at 128

[jira] [Assigned] (SPARK-14225) Cap the length of toCommentSafeString at 128 chars

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14225: Assignee: Reynold Xin (was: Apache Spark) > Cap the length of toCommentSafeString at 128

[jira] [Created] (SPARK-14225) Cap the length of toCommentSafeString at 128 chars

2016-03-28 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-14225: --- Summary: Cap the length of toCommentSafeString at 128 chars Key: SPARK-14225 URL: https://issues.apache.org/jira/browse/SPARK-14225 Project: Spark Issue Type:

[jira] [Commented] (SPARK-14219) Fix `pickRandomVertex` not to fall into infinite loops for graphs with one vertex

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215183#comment-15215183 ] Apache Spark commented on SPARK-14219: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Updated] (SPARK-14224) Cannot project all columns from a table with ~1,100 columns

2016-03-28 Thread Hossein Falaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hossein Falaki updated SPARK-14224: --- Description: I created a temporary table from 1000 genomes dataset and cached it. When I try

[jira] [Updated] (SPARK-14223) Cannot project all columns from a parquet files with ~1,100 columns

2016-03-28 Thread Hossein Falaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hossein Falaki updated SPARK-14223: --- Description: The parquet file is generated by saving first 10 rows of the 1000 genomes

[jira] [Updated] (SPARK-14223) Cannot project all columns from a parquet files with ~1,100 columns

2016-03-28 Thread Hossein Falaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hossein Falaki updated SPARK-14223: --- Description: The parquet file is generated by saving first 10 rows of the 1000 genomes

[jira] [Updated] (SPARK-14223) Cannot project all columns from a parquet files with ~1,100 columns

2016-03-28 Thread Hossein Falaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hossein Falaki updated SPARK-14223: --- Description: The parquet file is generated by saving first 10 rows of the 1000 genomes

[jira] [Updated] (SPARK-14223) Cannot project all columns from a parquet files with ~1,100 columns

2016-03-28 Thread Hossein Falaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hossein Falaki updated SPARK-14223: --- Affects Version/s: 2.0.0 Target Version/s: (was: 2.0.0) > Cannot project all columns

[jira] [Created] (SPARK-14224) Cannot project all columns from a table with ~1,100 columns

2016-03-28 Thread Hossein Falaki (JIRA)
Hossein Falaki created SPARK-14224: -- Summary: Cannot project all columns from a table with ~1,100 columns Key: SPARK-14224 URL: https://issues.apache.org/jira/browse/SPARK-14224 Project: Spark

[jira] [Updated] (SPARK-14223) Cannot project all columns from a parquet files with ~1,100 columns

2016-03-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-14223: Priority: Critical (was: Major) > Cannot project all columns from a parquet files with ~1,100

[jira] [Updated] (SPARK-14223) Cannot project all columns from a parquet files with ~1,100 columns

2016-03-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-14223: Component/s: SQL > Cannot project all columns from a parquet files with ~1,100 columns >

[jira] [Updated] (SPARK-14223) Cannot project all columns from a parquet files with ~1,100 columns

2016-03-28 Thread Hossein Falaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hossein Falaki updated SPARK-14223: --- Attachment: 1000genomes.gz.parquet > Cannot project all columns from a parquet files with

[jira] [Created] (SPARK-14223) Cannot project all columns from a parquet files with ~1,100 columns

2016-03-28 Thread Hossein Falaki (JIRA)
Hossein Falaki created SPARK-14223: -- Summary: Cannot project all columns from a parquet files with ~1,100 columns Key: SPARK-14223 URL: https://issues.apache.org/jira/browse/SPARK-14223 Project:

[jira] [Assigned] (SPARK-13786) Pyspark ml.tuning support export/import

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13786: Assignee: (was: Apache Spark) > Pyspark ml.tuning support export/import >

[jira] [Commented] (SPARK-13786) Pyspark ml.tuning support export/import

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215167#comment-15215167 ] Apache Spark commented on SPARK-13786: -- User 'yinxusen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13786) Pyspark ml.tuning support export/import

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13786: Assignee: Apache Spark > Pyspark ml.tuning support export/import >

[jira] [Resolved] (SPARK-14219) Fix `pickRandomVertex` not to fall into infinite loops for graphs with one vertex

2016-03-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14219. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.0.0 > Fix

[jira] [Created] (SPARK-14222) Remove jackson-module-scala dependency or cross-publish Jackson for Scala 2.12

2016-03-28 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-14222: -- Summary: Remove jackson-module-scala dependency or cross-publish Jackson for Scala 2.12 Key: SPARK-14222 URL: https://issues.apache.org/jira/browse/SPARK-14222 Project:

[jira] [Commented] (SPARK-14218) dataset show() does not display column names in the correct order if underlying data frame schema order is different from the encoder schema order.

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215131#comment-15215131 ] Apache Spark commented on SPARK-14218: -- User 'sureshthalamati' has created a pull request for this

[jira] [Assigned] (SPARK-14218) dataset show() does not display column names in the correct order if underlying data frame schema order is different from the encoder schema order.

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14218: Assignee: (was: Apache Spark) > dataset show() does not display column names in the

[jira] [Assigned] (SPARK-14218) dataset show() does not display column names in the correct order if underlying data frame schema order is different from the encoder schema order.

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14218: Assignee: Apache Spark > dataset show() does not display column names in the correct

[jira] [Commented] (SPARK-11416) Upgrade kryo package to version 3.0

2016-03-28 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215125#comment-15215125 ] Josh Rosen commented on SPARK-11416: Yes, please open a pull request against master. > Upgrade kryo

[jira] [Resolved] (SPARK-13447) Fix AM failure situation for dynamic allocation disabled situation

2016-03-28 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-13447. --- Resolution: Fixed Assignee: Saisai Shao Fix Version/s: 2.0.0 Target

[jira] [Commented] (SPARK-11416) Upgrade kryo package to version 3.0

2016-03-28 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215124#comment-15215124 ] Josh Rosen commented on SPARK-11416: Upgrading to Kryo 3 is now a blocker for building Spark against

[jira] [Created] (SPARK-14221) Cross-publish Chill for Scala 2.12

2016-03-28 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-14221: -- Summary: Cross-publish Chill for Scala 2.12 Key: SPARK-14221 URL: https://issues.apache.org/jira/browse/SPARK-14221 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-14220) Build and test Spark against Scala 2.12

2016-03-28 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-14220: -- Summary: Build and test Spark against Scala 2.12 Key: SPARK-14220 URL: https://issues.apache.org/jira/browse/SPARK-14220 Project: Spark Issue Type: Umbrella

[jira] [Updated] (SPARK-13845) BlockStatus and StreamBlockId keep on growing result driver OOM

2016-03-28 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-13845: -- Target Version/s: 1.6.2, 2.0.0 (was: 1.6.1, 2.0.0) > BlockStatus and StreamBlockId keep on growing

[jira] [Assigned] (SPARK-14219) Fix `pickRandomVertex` not to fall into infinite loops for graphs with one vertex

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14219: Assignee: (was: Apache Spark) > Fix `pickRandomVertex` not to fall into infinite

[jira] [Commented] (SPARK-14219) Fix `pickRandomVertex` not to fall into infinite loops for graphs with one vertex

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215117#comment-15215117 ] Apache Spark commented on SPARK-14219: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-14219) Fix `pickRandomVertex` not to fall into infinite loops for graphs with one vertex

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14219: Assignee: Apache Spark > Fix `pickRandomVertex` not to fall into infinite loops for

[jira] [Updated] (SPARK-13845) BlockStatus and StreamBlockId keep on growing result driver OOM

2016-03-28 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-13845: -- Target Version/s: 1.6.1, 2.0.0 Fix Version/s: 2.0.0 > BlockStatus and StreamBlockId keep on

[jira] [Created] (SPARK-14219) Fix `pickRandomVertex` not to fall into infinite loops for graphs with one vertex

2016-03-28 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-14219: - Summary: Fix `pickRandomVertex` not to fall into infinite loops for graphs with one vertex Key: SPARK-14219 URL: https://issues.apache.org/jira/browse/SPARK-14219

[jira] [Commented] (SPARK-14218) dataset show() does not display column names in the correct order if underlying data frame schema order is different from the encoder schema order.

2016-03-28 Thread Suresh Thalamati (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215098#comment-15215098 ] Suresh Thalamati commented on SPARK-14218: -- I am giving a shot at submitting PR for this

[jira] [Created] (SPARK-14218) dataset show() does not display column names in the correct order if underlying data frame schema order is different from the encoder schema order.

2016-03-28 Thread Suresh Thalamati (JIRA)
Suresh Thalamati created SPARK-14218: Summary: dataset show() does not display column names in the correct order if underlying data frame schema order is different from the encoder schema order. Key: SPARK-14218

[jira] [Resolved] (SPARK-14169) Add UninterruptibleThread

2016-03-28 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-14169. Resolution: Fixed Fix Version/s: 2.0.0 > Add UninterruptibleThread >

[jira] [Resolved] (SPARK-14155) Hide UserDefinedType in Spark 2.0

2016-03-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14155. - Resolution: Fixed Fix Version/s: 2.0.0 > Hide UserDefinedType in Spark 2.0 >

[jira] [Resolved] (SPARK-14086) Add DDL commands to ANTLR4 Parser

2016-03-28 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14086. - Resolution: Fixed Fix Version/s: 2.0.0 > Add DDL commands to ANTLR4 Parser >

[jira] [Commented] (SPARK-14217) Vectorized parquet reader produces wrong result if data used dictionary encoding fallback

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215086#comment-15215086 ] Apache Spark commented on SPARK-14217: -- User 'nongli' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14217) Vectorized parquet reader produces wrong result if data used dictionary encoding fallback

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14217: Assignee: Apache Spark > Vectorized parquet reader produces wrong result if data used

[jira] [Assigned] (SPARK-14217) Vectorized parquet reader produces wrong result if data used dictionary encoding fallback

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14217: Assignee: (was: Apache Spark) > Vectorized parquet reader produces wrong result if

[jira] [Resolved] (SPARK-14134) SQLListenerSuite fails on maven builds

2016-03-28 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-14134. Resolution: Cannot Reproduce Tests are now passing again without the need to change this,

[jira] [Commented] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-28 Thread Shubhanshu Mishra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215076#comment-15215076 ] Shubhanshu Mishra commented on SPARK-14103: --- [~srowen] I just checked the Spark Code on github

[jira] [Created] (SPARK-14217) Vectorized parquet reader produces wrong result if data used dictionary encoding fallback

2016-03-28 Thread Nong Li (JIRA)
Nong Li created SPARK-14217: --- Summary: Vectorized parquet reader produces wrong result if data used dictionary encoding fallback Key: SPARK-14217 URL: https://issues.apache.org/jira/browse/SPARK-14217

[jira] [Reopened] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-28 Thread Shubhanshu Mishra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shubhanshu Mishra reopened SPARK-14103: --- [~srowen] I am reopening the issue as it is not yet resolved. I have added more details

[jira] [Comment Edited] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-28 Thread Shubhanshu Mishra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215064#comment-15215064 ] Shubhanshu Mishra edited comment on SPARK-14103 at 3/28/16 11:08 PM: -

[jira] [Commented] (SPARK-14103) Python DataFrame CSV load on large file is writing to console in Ipython

2016-03-28 Thread Shubhanshu Mishra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215064#comment-15215064 ] Shubhanshu Mishra commented on SPARK-14103: --- [~srowen]thanks for the reply. As I have mentioned

[jira] [Commented] (SPARK-14163) SumEvaluator and countApprox cannot reliably handle RDDs of size 1

2016-03-28 Thread Marcin Tustin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215039#comment-15215039 ] Marcin Tustin commented on SPARK-14163: --- Reworked PR here:

[jira] [Commented] (SPARK-14163) SumEvaluator and countApprox cannot reliably handle RDDs of size 1

2016-03-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215041#comment-15215041 ] Apache Spark commented on SPARK-14163: -- User 'mtustin-handy' has created a pull request for this

[jira] [Resolved] (SPARK-11893) Model export/import for spark.ml: TrainValidationSplit

2016-03-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-11893. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 9971

  1   2   3   >