[jira] [Created] (SPARK-15693) Write schema definition out for file-based data sources

2016-05-31 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15693: --- Summary: Write schema definition out for file-based data sources Key: SPARK-15693 URL: https://issues.apache.org/jira/browse/SPARK-15693 Project: Spark Issue T

[jira] [Closed] (SPARK-15055) Remove setValue method on accumulators

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-15055. --- Resolution: Won't Fix > Remove setValue method on accumulators >

[jira] [Commented] (SPARK-15086) Update Java API once the Scala one is finalized

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309415#comment-15309415 ] Reynold Xin commented on SPARK-15086: - yea go ahead. I don't think we will be doing

[jira] [Commented] (SPARK-15670) Add deprecate annotation for acumulator V1 interface in JavaSparkContext class

2016-05-31 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309379#comment-15309379 ] Weichen Xu commented on SPARK-15670: OK, I'll follow SPARK-15086 jira, thanks! > Add

[jira] [Commented] (SPARK-15692) Improves the explain output of several physical plans by displaying embedded logical plan in tree style

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309371#comment-15309371 ] Apache Spark commented on SPARK-15692: -- User 'clockfly' has created a pull request f

[jira] [Commented] (SPARK-15322) update deprecate accumulator usage into accumulatorV2 in mllib

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309372#comment-15309372 ] Apache Spark commented on SPARK-15322: -- User 'HyukjinKwon' has created a pull reques

[jira] [Assigned] (SPARK-15692) Improves the explain output of several physical plans by displaying embedded logical plan in tree style

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15692: Assignee: (was: Apache Spark) > Improves the explain output of several physical plans

[jira] [Closed] (SPARK-8655) DataFrameReader#option supports more than String as value

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-8655. -- Resolution: Auto Closed Marking this as auto-closed for now. The main problem with broadening the type

[jira] [Assigned] (SPARK-15692) Improves the explain output of several physical plans by displaying embedded logical plan in tree style

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15692: Assignee: Apache Spark > Improves the explain output of several physical plans by displayi

[jira] [Closed] (SPARK-11873) Regression for TPC-DS query 63 when used with decimal datatype and windows function

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-11873. --- Resolution: Auto Closed Marking as auto closed since the code path has changed quite a bit. > Regre

[jira] [Commented] (SPARK-14525) DataFrameWriter's save method should delegate to jdbc for jdbc datasource

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309368#comment-15309368 ] Reynold Xin commented on SPARK-14525: - [~JustinPihony] sorry for the delay. I think t

[jira] [Created] (SPARK-15692) Improves the explain output of several physical plans by displaying embedded logical plan in tree style

2016-05-31 Thread Sean Zhong (JIRA)
Sean Zhong created SPARK-15692: -- Summary: Improves the explain output of several physical plans by displaying embedded logical plan in tree style Key: SPARK-15692 URL: https://issues.apache.org/jira/browse/SPARK-1569

[jira] [Closed] (SPARK-2973) Use LocalRelation for all ExecutedCommands, avoid job for take/collect()

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-2973. -- Resolution: Later > Use LocalRelation for all ExecutedCommands, avoid job for take/collect() > -

[jira] [Updated] (SPARK-15691) Refactor and improve Hive support

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15691: Description: Hive support is important to Spark SQL, as many Spark users use it to read from Hive.

[jira] [Updated] (SPARK-15691) Refactor and improve Hive support

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15691: Description: Hive support is important to Spark SQL, as many Spark users use it to read from Hive.

[jira] [Updated] (SPARK-15691) Refactor and improve Hive support

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15691: Summary: Refactor and improve Hive support (was: Refactor Hive support) > Refactor and improve Hiv

[jira] [Assigned] (SPARK-14343) Dataframe operations on a partitioned dataset (using partition discovery) return invalid results

2016-05-31 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian reassigned SPARK-14343: -- Assignee: Cheng Lian > Dataframe operations on a partitioned dataset (using partition discover

[jira] [Updated] (SPARK-15691) Refactor Hive support

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15691: Description: Hive support is important to Spark SQL, as many Spark users use it to read from Hive.

[jira] [Created] (SPARK-15691) Refactor Hive support

2016-05-31 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15691: --- Summary: Refactor Hive support Key: SPARK-15691 URL: https://issues.apache.org/jira/browse/SPARK-15691 Project: Spark Issue Type: New Feature Compone

[jira] [Resolved] (SPARK-14441) Consolidate DDL tests

2016-05-31 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-14441. -- Resolution: Later seems we will not do it for 2.0.0. Let's resolve it as "Later". > Consolidate DDL te

[jira] [Resolved] (SPARK-14118) Implement DDL/DML commands for Spark 2.0

2016-05-31 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-14118. -- Resolution: Fixed Fix Version/s: 2.0.0 > Implement DDL/DML commands for Spark 2.0 >

[jira] [Updated] (SPARK-15690) Fast single-node (single-process) in-memory shuffle

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15690: Description: Spark's current shuffle implementation sorts all intermediate data by their partition

[jira] [Comment Edited] (SPARK-15582) Support for Groovy closures

2016-05-31 Thread Catalin Alexandru Zamfir (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308036#comment-15308036 ] Catalin Alexandru Zamfir edited comment on SPARK-15582 at 6/1/16 5:45 AM: -

[jira] [Updated] (SPARK-15690) Fast single-node (single-process) in-memory shuffle

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15690: Summary: Fast single-node (single-process) in-memory shuffle (was: Fast single-node in-memory shu

[jira] [Comment Edited] (SPARK-15582) Support for Groovy closures

2016-05-31 Thread Catalin Alexandru Zamfir (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308036#comment-15308036 ] Catalin Alexandru Zamfir edited comment on SPARK-15582 at 6/1/16 5:43 AM: -

[jira] [Created] (SPARK-15690) Fast single-node in-memory shuffle

2016-05-31 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15690: --- Summary: Fast single-node in-memory shuffle Key: SPARK-15690 URL: https://issues.apache.org/jira/browse/SPARK-15690 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-14343) Dataframe operations on a partitioned dataset (using partition discovery) return invalid results

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309317#comment-15309317 ] Apache Spark commented on SPARK-14343: -- User 'liancheng' has created a pull request

[jira] [Assigned] (SPARK-14343) Dataframe operations on a partitioned dataset (using partition discovery) return invalid results

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14343: Assignee: (was: Apache Spark) > Dataframe operations on a partitioned dataset (using p

[jira] [Assigned] (SPARK-14343) Dataframe operations on a partitioned dataset (using partition discovery) return invalid results

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14343: Assignee: Apache Spark > Dataframe operations on a partitioned dataset (using partition di

[jira] [Comment Edited] (SPARK-15582) Support for Groovy closures

2016-05-31 Thread Catalin Alexandru Zamfir (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308036#comment-15308036 ] Catalin Alexandru Zamfir edited comment on SPARK-15582 at 6/1/16 5:41 AM: -

[jira] [Closed] (SPARK-11448) We should skip caching part-files in ParquetRelation when configured to merge schema and respect summaries

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-11448. --- Resolution: Auto Closed > We should skip caching part-files in ParquetRelation when configured to mer

[jira] [Comment Edited] (SPARK-15582) Support for Groovy closures

2016-05-31 Thread Catalin Alexandru Zamfir (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308036#comment-15308036 ] Catalin Alexandru Zamfir edited comment on SPARK-15582 at 6/1/16 5:36 AM: -

[jira] [Created] (SPARK-15689) Data source API v2

2016-05-31 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15689: --- Summary: Data source API v2 Key: SPARK-15689 URL: https://issues.apache.org/jira/browse/SPARK-15689 Project: Spark Issue Type: New Feature Components

[jira] [Closed] (SPARK-14827) Spark SQL run on Hive table reports The specified datastore driver ("com.mysql.jdbc.Driver") was not found in the CLASSPATH.

2016-05-31 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang closed SPARK-14827. -- Resolution: Won't Fix > Spark SQL run on Hive table reports The specified datastore driver > ("com.mys

[jira] [Commented] (SPARK-14827) Spark SQL run on Hive table reports The specified datastore driver ("com.mysql.jdbc.Driver") was not found in the CLASSPATH.

2016-05-31 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309304#comment-15309304 ] Jeff Zhang commented on SPARK-14827: It is not a bug as the exception is clear that j

[jira] [Updated] (SPARK-15687) Columnar execution engine

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15687: Description: This ticket tracks progress in making the entire engine columnar, especially in the c

[jira] [Commented] (SPARK-15688) RelationalGroupedDataset.toDF should not add group by expressions that are already added in the aggregate expressions.

2016-05-31 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309299#comment-15309299 ] Xiao Li commented on SPARK-15688: - Yeah, will ask my teammate to do it ASAP. Thanks! > R

[jira] [Updated] (SPARK-15687) Columnar execution engine

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15687: Description: This ticket tracks progress in making the entire engine columnar, especially in the c

[jira] [Commented] (SPARK-15688) RelationalGroupedDataset.toDF should not add group by expressions that are already added in the aggregate expressions.

2016-05-31 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309293#comment-15309293 ] Yin Huai commented on SPARK-15688: -- [~smilegator] Anyone from your side can take this?

[jira] [Updated] (SPARK-15687) Columnar execution engine

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15687: Summary: Columnar execution engine (was: Fully columnar execution engine) > Columnar execution eng

[jira] [Updated] (SPARK-15687) Columnar execution engine

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15687: Priority: Critical (was: Major) > Columnar execution engine > - > >

[jira] [Created] (SPARK-15688) RelationalGroupedDataset.toDF should not add group by expressions that are already added in the aggregate expressions.

2016-05-31 Thread Yin Huai (JIRA)
Yin Huai created SPARK-15688: Summary: RelationalGroupedDataset.toDF should not add group by expressions that are already added in the aggregate expressions. Key: SPARK-15688 URL: https://issues.apache.org/jira/browse

[jira] [Updated] (SPARK-15687) Fully columnar execution engine

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15687: Summary: Fully columnar execution engine (was: Columnar execution engine) > Fully columnar executi

[jira] [Created] (SPARK-15687) Columnar execution engine

2016-05-31 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15687: --- Summary: Columnar execution engine Key: SPARK-15687 URL: https://issues.apache.org/jira/browse/SPARK-15687 Project: Spark Issue Type: New Feature Com

[jira] [Updated] (SPARK-15685) StackOverflowError (VirtualMachineError) or NoClassDefFoundError (LinkageError) should not System.exit() in local mode

2016-05-31 Thread Brett Randall (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brett Randall updated SPARK-15685: -- Description: Spark, when running in local mode, can encounter certain types of {{Error}} excep

[jira] [Commented] (SPARK-15530) Partitioning discovery logic HadoopFsRelation should use a higher setting of parallelism

2016-05-31 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309261#comment-15309261 ] Takeshi Yamamuro commented on SPARK-15530: -- Oh, I see. Okay, then I'll check the

[jira] [Commented] (SPARK-15032) When we create a new JDBC session, we may need to create a new session of executionHive

2016-05-31 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309253#comment-15309253 ] Yin Huai commented on SPARK-15032: -- I did a quick check today. Seems our current master

[jira] [Resolved] (SPARK-15032) When we create a new JDBC session, we may need to create a new session of executionHive

2016-05-31 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-15032. -- Resolution: Not A Problem > When we create a new JDBC session, we may need to create a new session of

[jira] [Commented] (SPARK-15530) Partitioning discovery logic HadoopFsRelation should use a higher setting of parallelism

2016-05-31 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309250#comment-15309250 ] Yin Huai commented on SPARK-15530: -- The doc of that conf says "The degree of parallelism

[jira] [Commented] (SPARK-15672) R programming guide update

2016-05-31 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309249#comment-15309249 ] Shivaram Venkataraman commented on SPARK-15672: --- Thanks [~GayathriMurali] -

[jira] [Commented] (SPARK-15530) Partitioning discovery logic HadoopFsRelation should use a higher setting of parallelism

2016-05-31 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309246#comment-15309246 ] Takeshi Yamamuro commented on SPARK-15530: -- You suggest we should rename the val

[jira] [Commented] (SPARK-15530) Partitioning discovery logic HadoopFsRelation should use a higher setting of parallelism

2016-05-31 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309240#comment-15309240 ] Yin Huai commented on SPARK-15530: -- Your change looks reasonable. How about we just take

[jira] [Commented] (SPARK-15683) spark sql local FS spark.sql.warehouse.dir throws on YARN

2016-05-31 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309139#comment-15309139 ] Saisai Shao commented on SPARK-15683: - Hi [~tgraves], please see this JIRA SPARK-1565

[jira] [Updated] (SPARK-12988) Can't drop columns that contain dots

2016-05-31 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-12988: - Assignee: Sean Zhong > Can't drop columns that contain dots > > >

[jira] [Resolved] (SPARK-12988) Can't drop columns that contain dots

2016-05-31 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-12988. -- Resolution: Fixed Fix Version/s: 2.0.0 > Can't drop columns that contain dots >

[jira] [Commented] (SPARK-12988) Can't drop columns that contain dots

2016-05-31 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309135#comment-15309135 ] Yin Huai commented on SPARK-12988: -- This issue has been resolved by https://github.com/a

[jira] [Commented] (SPARK-14381) Review spark.ml parity for feature transformers

2016-05-31 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309112#comment-15309112 ] Gayathri Murali commented on SPARK-14381: - I will work on this > Review spark.ml

[jira] [Commented] (SPARK-15672) R programming guide update

2016-05-31 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309100#comment-15309100 ] Gayathri Murali commented on SPARK-15672: - [~shivaram] I am working on changing R

[jira] [Updated] (SPARK-15685) StackOverflowError (VirtualMachineError) or NoClassDefFoundError (LinkageError) should not System.exit() in local mode

2016-05-31 Thread Brett Randall (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brett Randall updated SPARK-15685: -- Description: Spark, when running in local mode, can encounter certain types of {{Error}} excep

[jira] [Commented] (SPARK-15686) Move user-facing structured streaming classes into sql.streaming

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309075#comment-15309075 ] Apache Spark commented on SPARK-15686: -- User 'rxin' has created a pull request for t

[jira] [Assigned] (SPARK-15686) Move user-facing structured streaming classes into sql.streaming

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15686: Assignee: Reynold Xin (was: Apache Spark) > Move user-facing structured streaming classes

[jira] [Assigned] (SPARK-15686) Move user-facing structured streaming classes into sql.streaming

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15686: Assignee: Apache Spark (was: Reynold Xin) > Move user-facing structured streaming classes

[jira] [Created] (SPARK-15686) Move user-facing structured streaming classes into sql.streaming

2016-05-31 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15686: --- Summary: Move user-facing structured streaming classes into sql.streaming Key: SPARK-15686 URL: https://issues.apache.org/jira/browse/SPARK-15686 Project: Spark

[jira] [Created] (SPARK-15685) StackOverflowError (VirtualMachineError) or NoClassDefFoundError (LinkageError) should not System.exit() in local mode

2016-05-31 Thread Brett Randall (JIRA)
Brett Randall created SPARK-15685: - Summary: StackOverflowError (VirtualMachineError) or NoClassDefFoundError (LinkageError) should not System.exit() in local mode Key: SPARK-15685 URL: https://issues.apache.org/j

[jira] [Updated] (SPARK-15681) Allow case-insensitiveness in sc.setLogLevel

2016-05-31 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xin Wu updated SPARK-15681: --- Description: Currently SparkContext API setLogLevel(level: String) can not handle lower case or mixed case i

[jira] [Updated] (SPARK-15681) Allow case-insensitiveness in sc.setLogLevel

2016-05-31 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xin Wu updated SPARK-15681: --- Summary: Allow case-insensitiveness in sc.setLogLevel (was: Allow case-insensitiveness in sc.setLogLevel and

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-05-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309033#comment-15309033 ] Joseph K. Bradley commented on SPARK-15581: --- The roadmap links to [SPARK-4591],

[jira] [Updated] (SPARK-15581) MLlib 2.1 Roadmap

2016-05-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15581: -- Description: This is a master list for MLlib improvements we are working on for the nex

[jira] [Commented] (SPARK-12347) Write script to run all MLlib examples for testing

2016-05-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15309031#comment-15309031 ] Joseph K. Bradley commented on SPARK-12347: --- [~junzheng] Feel free to work on i

[jira] [Updated] (SPARK-12347) Write script to run all MLlib examples for testing

2016-05-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12347: -- Target Version/s: 2.1.0 Priority: Critical (was: Major) > Write script to

[jira] [Updated] (SPARK-15605) ML JavaDeveloperApiExample is broken

2016-05-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15605: -- Priority: Major (was: Minor) > ML JavaDeveloperApiExample is broken >

[jira] [Updated] (SPARK-15605) ML JavaDeveloperApiExample is broken

2016-05-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15605: -- Affects Version/s: 2.0.0 > ML JavaDeveloperApiExample is broken > -

[jira] [Updated] (SPARK-15678) Not use cache on appends and overwrites

2016-05-31 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-15678: --- Summary: Not use cache on appends and overwrites (was: Drop cache on appends and overwrites)

[jira] [Commented] (SPARK-15675) Implicit resolution doesn't work in multiple Statements in Spark Repl

2016-05-31 Thread Prashant Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308998#comment-15308998 ] Prashant Sharma commented on SPARK-15675: - Looks like https://github.com/scala/sc

[jira] [Updated] (SPARK-14343) Dataframe operations on a partitioned dataset (using partition discovery) return invalid results

2016-05-31 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-14343: - Issue Type: Bug (was: Sub-task) Parent: (was: SPARK-15631) > Dataframe operations on a parti

[jira] [Updated] (SPARK-14343) Dataframe operations on a partitioned dataset (using partition discovery) return invalid results

2016-05-31 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-14343: - Target Version/s: 2.0.0 Priority: Critical (was: Blocker) > Dataframe operations on a partit

[jira] [Resolved] (SPARK-15601) CircularBuffer's toString() to print only the contents written if buffer isn't full

2016-05-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15601. --- Resolution: Fixed Fix Version/s: 1.6.2 2.0.0 Issue resolved by pull request

[jira] [Commented] (SPARK-15675) Implicit resolution doesn't work in multiple Statements in Spark Repl

2016-05-31 Thread Prashant Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308989#comment-15308989 ] Prashant Sharma commented on SPARK-15675: - Interesting, I am looking into it. I w

[jira] [Resolved] (SPARK-15236) No way to disable Hive support in REPL

2016-05-31 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15236. --- Resolution: Fixed Fix Version/s: 2.0.0 > No way to disable Hive support in REPL >

[jira] [Resolved] (SPARK-15618) Use SparkSession.builder.sparkContext(...) in tests where possible

2016-05-31 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15618. --- Resolution: Fixed Fix Version/s: 2.0.0 > Use SparkSession.builder.sparkContext(...) in tests w

[jira] [Updated] (SPARK-15236) No way to disable Hive support in REPL

2016-05-31 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-15236: -- Assignee: Xin Wu > No way to disable Hive support in REPL > -- > >

[jira] [Assigned] (SPARK-12666) spark-shell --packages cannot load artifacts which are publishLocal'd by SBT

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12666: Assignee: Apache Spark > spark-shell --packages cannot load artifacts which are publishLoc

[jira] [Commented] (SPARK-12666) spark-shell --packages cannot load artifacts which are publishLocal'd by SBT

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308965#comment-15308965 ] Apache Spark commented on SPARK-12666: -- User 'BryanCutler' has created a pull reques

[jira] [Assigned] (SPARK-12666) spark-shell --packages cannot load artifacts which are publishLocal'd by SBT

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12666: Assignee: (was: Apache Spark) > spark-shell --packages cannot load artifacts which are

[jira] [Updated] (SPARK-15670) Add deprecate annotation for acumulator V1 interface in JavaSparkContext class

2016-05-31 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-15670: -- Assignee: Weichen Xu > Add deprecate annotation for acumulator V1 interface in JavaSparkContext class >

[jira] [Resolved] (SPARK-15670) Add deprecate annotation for acumulator V1 interface in JavaSparkContext class

2016-05-31 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15670. --- Resolution: Fixed Fix Version/s: 2.0.0 Target Version/s: 2.0.0 > Add deprecate annot

[jira] [Resolved] (SPARK-15680) Disable comments in generated code in order to avoid performance issues

2016-05-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15680. - Resolution: Fixed Fix Version/s: 2.0.0 > Disable comments in generated code in order to av

[jira] [Resolved] (SPARK-15662) Add since annotation for classes in sql.catalog

2016-05-31 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-15662. --- Resolution: Fixed Fix Version/s: 2.0.0 > Add since annotation for classes in sql.catalog > ---

[jira] [Created] (SPARK-15684) Not mask startsWith and endsWith in R

2016-05-31 Thread Miao Wang (JIRA)
Miao Wang created SPARK-15684: - Summary: Not mask startsWith and endsWith in R Key: SPARK-15684 URL: https://issues.apache.org/jira/browse/SPARK-15684 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-12666) spark-shell --packages cannot load artifacts which are publishLocal'd by SBT

2016-05-31 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308950#comment-15308950 ] Bryan Cutler commented on SPARK-12666: -- This seems like it is more of an issue with

[jira] [Commented] (SPARK-15684) Not mask startsWith and endsWith in R

2016-05-31 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308940#comment-15308940 ] Miao Wang commented on SPARK-15684: --- [~yanboliang] Per our discussion, this one is simi

[jira] [Resolved] (SPARK-15451) Spark PR builder should fail if code doesn't compile against JDK 7

2016-05-31 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-15451. Resolution: Fixed Assignee: Marcelo Vanzin Fix Version/s: 2.0.0 > Spark PR

[jira] [Updated] (SPARK-11474) Options to jdbc load are lower cased

2016-05-31 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-11474: Component/s: (was: Input/Output) SQL > Options to jdbc load are lower cased >

[jira] [Commented] (SPARK-15545) R remove non-exported unused methods, like jsonRDD

2016-05-31 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308892#comment-15308892 ] Miao Wang commented on SPARK-15545: --- I will give a try. Thanks > R remove non-exporte

[jira] [Comment Edited] (SPARK-15654) Reading gzipped files results in duplicate rows

2016-05-31 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15307043#comment-15307043 ] Takeshi Yamamuro edited comment on SPARK-15654 at 5/31/16 11:10 PM: ---

[jira] [Commented] (SPARK-6320) Adding new query plan strategy to SQLContext

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308824#comment-15308824 ] Apache Spark commented on SPARK-6320: - User 'ueshin' has created a pull request for th

[jira] [Commented] (SPARK-15441) dataset outer join seems to return incorrect result

2016-05-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308815#comment-15308815 ] Apache Spark commented on SPARK-15441: -- User 'cloud-fan' has created a pull request

[jira] [Commented] (SPARK-15447) Performance test for ALS in Spark 2.0

2016-05-31 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15308797#comment-15308797 ] Nick Pentreath commented on SPARK-15447: Created a Google sheet with initial resu

[jira] [Updated] (SPARK-15557) expression ((cast(99 as decimal) + '3') * '2.3' ) return null

2016-05-31 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-15557: --- Assignee: Dilip Biswal > expression ((cast(99 as decimal) + '3') * '2.3' ) return null >

[jira] [Resolved] (SPARK-15557) expression ((cast(99 as decimal) + '3') * '2.3' ) return null

2016-05-31 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-15557. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13368 [https://github.

  1   2   3   >