[jira] [Updated] (SPARK-3880) HBase as data source to SparkSQL

2015-01-16 Thread Yan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan updated SPARK-3880: --- Attachment: SparkSQLOnHBase_v2.docx Version 2 > HBase as data source to SparkSQL > >

[jira] [Commented] (SPARK-3880) HBase as data source to SparkSQL

2015-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14281236#comment-14281236 ] Apache Spark commented on SPARK-3880: - User 'yzhou2001' has created a pull request for

[jira] [Created] (SPARK-5296) Predicate Pushdown (BaseRelation) to have an interface that will accept OR filters

2015-01-16 Thread Corey J. Nolet (JIRA)
Corey J. Nolet created SPARK-5296: - Summary: Predicate Pushdown (BaseRelation) to have an interface that will accept OR filters Key: SPARK-5296 URL: https://issues.apache.org/jira/browse/SPARK-5296 Pr

[jira] [Updated] (SPARK-5295) Only expose leaf data types

2015-01-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5295: --- Description: 1. We expose all the stuff in data types right now, including NumericTypes, etc. These s

[jira] [Updated] (SPARK-5295) Stabilize data types

2015-01-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5295: --- Summary: Stabilize data types (was: Only expose leaf data types) > Stabilize data types > ---

[jira] [Created] (SPARK-5295) Only expose leaf data types

2015-01-16 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5295: -- Summary: Only expose leaf data types Key: SPARK-5295 URL: https://issues.apache.org/jira/browse/SPARK-5295 Project: Spark Issue Type: Sub-task Componen

[jira] [Resolved] (SPARK-5193) Make Spark SQL API usable in Java and remove the Java-specific API

2015-01-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5193. Resolution: Fixed Fix Version/s: 1.3.0 > Make Spark SQL API usable in Java and remove the Jav

[jira] [Updated] (SPARK-5278) check ambiguous reference to fields in Spark SQL is incompleted

2015-01-16 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-5278: --- Summary: check ambiguous reference to fields in Spark SQL is incompleted (was: ambiguous reference to

[jira] [Commented] (SPARK-5294) Hide tables in AllStagePages for "Active Stages, Completed Stages and Failed Stages" when they are empty

2015-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14281173#comment-14281173 ] Apache Spark commented on SPARK-5294: - User 'sarutak' has created a pull request for t

[jira] [Updated] (SPARK-5293) Enable Spark user applications to use different versions of Akka

2015-01-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5293: --- Description: A lot of Spark user applications are using (or want to use) Akka. Akka as a whole can co

[jira] [Updated] (SPARK-5293) Enable Spark user applications to use different versions of Akka

2015-01-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5293: --- Description: A lot of Spark user applications are using (or want to use) Akka. Akka as a whole can co

[jira] [Created] (SPARK-5294) Hide tables in AllStagePages for "Active Stages, Completed Stages and Failed Stages" when they are empty

2015-01-16 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-5294: - Summary: Hide tables in AllStagePages for "Active Stages, Completed Stages and Failed Stages" when they are empty Key: SPARK-5294 URL: https://issues.apache.org/jira/browse/SPAR

[jira] [Commented] (SPARK-5251) Using `tableIdentifier` in hive metastore

2015-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14281170#comment-14281170 ] Apache Spark commented on SPARK-5251: - User 'scwf' has created a pull request for this

[jira] [Commented] (SPARK-1021) sortByKey() launches a cluster job when it shouldn't

2015-01-16 Thread Xuefu Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14281168#comment-14281168 ] Xuefu Zhang commented on SPARK-1021: This problem also occurred on Hive on Spark (HIVE

[jira] [Updated] (SPARK-5214) Add EventLoop and change DAGScheduler to an EventLoop

2015-01-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5214: --- Assignee: Shixiong Zhu > Add EventLoop and change DAGScheduler to an EventLoop > -

[jira] [Created] (SPARK-5293) Enable Spark user applications to use different versions of Akka

2015-01-16 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5293: -- Summary: Enable Spark user applications to use different versions of Akka Key: SPARK-5293 URL: https://issues.apache.org/jira/browse/SPARK-5293 Project: Spark I

[jira] [Created] (SPARK-5292) optimize join for table that are already sharded/support for hive bucket

2015-01-16 Thread gagan taneja (JIRA)
gagan taneja created SPARK-5292: --- Summary: optimize join for table that are already sharded/support for hive bucket Key: SPARK-5292 URL: https://issues.apache.org/jira/browse/SPARK-5292 Project: Spark

[jira] [Commented] (SPARK-5291) Add timestamp and reason why an executor is removed to SparkListenerExecutorAdded and SparkListenerExecutorRemoved

2015-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14281143#comment-14281143 ] Apache Spark commented on SPARK-5291: - User 'sarutak' has created a pull request for t

[jira] [Updated] (SPARK-5291) Add timestamp and reason why an executor is removed to SparkListenerExecutorAdded and SparkListenerExecutorRemoved

2015-01-16 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-5291: -- Description: Recently SparkListenerExecutorAdded and SparkListenerExecutorRemoved are added. I t

[jira] [Updated] (SPARK-5291) Add timestamp and reason why an executor is removed to SparkListenerExecutorAdded and SparkListenerExecutorRemoved

2015-01-16 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-5291: -- Summary: Add timestamp and reason why an executor is removed to SparkListenerExecutorAdded and S

[jira] [Created] (SPARK-5291) Add timestamp and reason why an executor is removed

2015-01-16 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-5291: - Summary: Add timestamp and reason why an executor is removed Key: SPARK-5291 URL: https://issues.apache.org/jira/browse/SPARK-5291 Project: Spark Issue Typ

[jira] [Updated] (SPARK-5282) RowMatrix easily gets int overflow in the memory size warning

2015-01-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5282: - Assignee: yuhao yang > RowMatrix easily gets int overflow in the memory size warning > ---

[jira] [Commented] (SPARK-5287) NativeType.defaultSizeOf should have default sizes of all NativeTypes.

2015-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280986#comment-14280986 ] Apache Spark commented on SPARK-5287: - User 'yhuai' has created a pull request for thi

[jira] [Commented] (SPARK-5289) Backport publishing of repl, yarn into branch-1.2

2015-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280927#comment-14280927 ] Apache Spark commented on SPARK-5289: - User 'pwendell' has created a pull request for

[jira] [Created] (SPARK-5290) Executing functions in sparkSQL registered in sqlcontext gives scala.reflect.internal.MissingRequirementError: class org.apache.spark.sql.catalyst.ScalaReflection

2015-01-16 Thread Manoj Samel (JIRA)
Manoj Samel created SPARK-5290: -- Summary: Executing functions in sparkSQL registered in sqlcontext gives scala.reflect.internal.MissingRequirementError: class org.apache.spark.sql.catalyst.ScalaReflection Key: SPARK-5290

[jira] [Updated] (SPARK-5289) Backport publishing of repl, yarn into branch-1.2

2015-01-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5289: --- Description: In SPARK-3452 we did some clean-up of published artifacts that turned out to adv

[jira] [Updated] (SPARK-5289) Backport publishing of repl, yarn into branch-1.2

2015-01-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5289: --- Description: In SPARK-3452 we did some clean-up of published artifacts that turned out to adve

[jira] [Updated] (SPARK-5289) Backport publishing of repl, yarn into branch-1.2

2015-01-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5289: --- Summary: Backport publishing of repl, yarn into branch-1.2 (was: Backport publishing of repl,

[jira] [Updated] (SPARK-5289) Backport publishing of repl, yarn into branch-1.2

2015-01-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5289: --- Description: In SPARK-3452 we did some clean-up of published artifacts that turned out to adv

[jira] [Created] (SPARK-5289) Backport publishing of repl, yarn, and hive-thriftserver into branch-1.2

2015-01-16 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-5289: -- Summary: Backport publishing of repl, yarn, and hive-thriftserver into branch-1.2 Key: SPARK-5289 URL: https://issues.apache.org/jira/browse/SPARK-5289 Project: S

[jira] [Updated] (SPARK-5287) NativeType.defaultSizeOf should have default sizes of all NativeTypes.

2015-01-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5287: Summary: NativeType.defaultSizeOf should have default sizes of all NativeTypes. (was: NativeType.defaultSiz

[jira] [Created] (SPARK-5288) Stabilize Spark SQL data type API followup

2015-01-16 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5288: --- Summary: Stabilize Spark SQL data type API followup Key: SPARK-5288 URL: https://issues.apache.org/jira/browse/SPARK-5288 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-5287) NativeType.defaultSizeOf should have all data types.

2015-01-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5287: Description: Otherwise, we will failed to do stats estimation. (was: NativeType.all and NativeType.default

[jira] [Updated] (SPARK-5287) NativeType.defaultSizeOf should have all data types.

2015-01-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5287: Summary: NativeType.defaultSizeOf should have all data types. (was: NativeType's companion object should in

[jira] [Closed] (SPARK-733) Add documentation on use of accumulators in lazy transformation

2015-01-16 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid closed SPARK-733. -- Resolution: Fixed Fixed by https://github.com/apache/spark/pull/4022 > Add documentation on use of accum

[jira] [Updated] (SPARK-733) Add documentation on use of accumulators in lazy transformation

2015-01-16 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-733: --- Fix Version/s: 1.2.1 1.3.0 > Add documentation on use of accumulators in lazy transf

[jira] [Created] (SPARK-5287) NativeType's companion object should include all native types.

2015-01-16 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5287: --- Summary: NativeType's companion object should include all native types. Key: SPARK-5287 URL: https://issues.apache.org/jira/browse/SPARK-5287 Project: Spark Issue Typ

[jira] [Commented] (SPARK-5284) Insert into Hive throws NPE when a inner complex type field has a null value

2015-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280835#comment-14280835 ] Apache Spark commented on SPARK-5284: - User 'yhuai' has created a pull request for thi

[jira] [Commented] (SPARK-5286) Fail to drop an invalid table when using the data source API

2015-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280827#comment-14280827 ] Apache Spark commented on SPARK-5286: - User 'yhuai' has created a pull request for thi

[jira] [Updated] (SPARK-5286) Fail to drop an invalid table when using the data source API

2015-01-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5286: Summary: Fail to drop an invalid table when using the data source API (was: Fail to drop a invalid table wh

[jira] [Created] (SPARK-5286) Fail to drop a invalid table when using the data source API

2015-01-16 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5286: --- Summary: Fail to drop a invalid table when using the data source API Key: SPARK-5286 URL: https://issues.apache.org/jira/browse/SPARK-5286 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-5260) Expose JsonRDD.allKeysWithValueTypes() in a utility class

2015-01-16 Thread Corey J. Nolet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280802#comment-14280802 ] Corey J. Nolet edited comment on SPARK-5260 at 1/16/15 8:52 PM:

[jira] [Updated] (SPARK-4502) Spark SQL reads unneccesary nested fields from Parquet

2015-01-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-4502: --- Summary: Spark SQL reads unneccesary nested fields from Parquet (was: Spark SQL reads unneccesary fie

[jira] [Commented] (SPARK-5260) Expose JsonRDD.allKeysWithValueTypes() in a utility class

2015-01-16 Thread Corey J. Nolet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280802#comment-14280802 ] Corey J. Nolet commented on SPARK-5260: --- bq. you can make the change and create a p

[jira] [Comment Edited] (SPARK-5260) Expose JsonRDD.allKeysWithValueTypes() in a utility class

2015-01-16 Thread Corey J. Nolet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280802#comment-14280802 ] Corey J. Nolet edited comment on SPARK-5260 at 1/16/15 8:48 PM:

[jira] [Updated] (SPARK-5260) Expose JsonRDD.allKeysWithValueTypes() in a utility class

2015-01-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5260: --- Fix Version/s: (was: 1.3.0) > Expose JsonRDD.allKeysWithValueTypes() in a utility class >

[jira] [Updated] (SPARK-5270) Elegantly check if RDD is empty

2015-01-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5270: --- Target Version/s: 1.3.0 > Elegantly check if RDD is empty > --- >

[jira] [Resolved] (SPARK-4357) Modify release publishing to work with Scala 2.11

2015-01-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4357. Resolution: Fixed Sorry this is actually working now. We now publish artifacts for Scala 2.1

[jira] [Commented] (SPARK-5285) Removed GroupExpression in catalyst

2015-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280767#comment-14280767 ] Apache Spark commented on SPARK-5285: - User 'scwf' has created a pull request for this

[jira] [Created] (SPARK-5285) Removed GroupExpression in catalyst

2015-01-16 Thread wangfei (JIRA)
wangfei created SPARK-5285: -- Summary: Removed GroupExpression in catalyst Key: SPARK-5285 URL: https://issues.apache.org/jira/browse/SPARK-5285 Project: Spark Issue Type: Improvement Comp

[jira] [Created] (SPARK-5284) Insert into Hive throws NPE when a inner complex type field has a null value

2015-01-16 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5284: --- Summary: Insert into Hive throws NPE when a inner complex type field has a null value Key: SPARK-5284 URL: https://issues.apache.org/jira/browse/SPARK-5284 Project: Spark

[jira] [Commented] (SPARK-4259) Add Spectral Clustering Algorithm with Gaussian Similarity Function

2015-01-16 Thread Andrew Musselman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280731#comment-14280731 ] Andrew Musselman commented on SPARK-4259: - Thinking of picking this up; has there

[jira] [Commented] (SPARK-4520) SparkSQL exception when reading certain columns from a parquet file

2015-01-16 Thread Tyler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280705#comment-14280705 ] Tyler commented on SPARK-4520: -- No rush. Just interested. I figured my problem was something

[jira] [Commented] (SPARK-4520) SparkSQL exception when reading certain columns from a parquet file

2015-01-16 Thread sadhan sood (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280698#comment-14280698 ] sadhan sood commented on SPARK-4520: Tyler, Alex - the problem is not with parquet but

[jira] [Reopened] (SPARK-3726) RandomForest: Support for bootstrap options

2015-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reopened SPARK-3726: -- Assignee: Manoj Kumar (was: Manish Amde) This wasn't really fixed actually; my mistak

[jira] [Resolved] (SPARK-3726) RandomForest: Support for bootstrap options

2015-01-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-3726. -- Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Manish Amde (was: Manoj Kumar)

[jira] [Commented] (SPARK-3726) RandomForest: Support for bootstrap options

2015-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280663#comment-14280663 ] Joseph K. Bradley commented on SPARK-3726: -- IMO I think it should be closed. I'l

[jira] [Commented] (SPARK-3726) RandomForest: Support for bootstrap options

2015-01-16 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280658#comment-14280658 ] Manoj Kumar commented on SPARK-3726: Ah I see. I had my doubts when I started looking

[jira] [Commented] (SPARK-3726) RandomForest: Support for bootstrap options

2015-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280651#comment-14280651 ] Joseph K. Bradley commented on SPARK-3726: -- Sorry! I had forgotten that this was

[jira] [Commented] (SPARK-4766) ML Estimator Params should subclass Transformer Params

2015-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280604#comment-14280604 ] Joseph K. Bradley commented on SPARK-4766: -- That is a good point, but I'll put it

[jira] [Created] (SPARK-5283) ML sharedParams should be public

2015-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5283: Summary: ML sharedParams should be public Key: SPARK-5283 URL: https://issues.apache.org/jira/browse/SPARK-5283 Project: Spark Issue Type: Improvemen

[jira] [Closed] (SPARK-5231) History Server shows wrong job submission time.

2015-01-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5231. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Kousuke Saruta > History Server shows wrong

[jira] [Commented] (SPARK-4766) ML Estimator Params should subclass Transformer Params

2015-01-16 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280571#comment-14280571 ] Peter Rudenko commented on SPARK-4766: -- Also make a traits that extends Params public

[jira] [Updated] (SPARK-5201) ParallelCollectionRDD.slice(seq, numSlices) has int overflow when dealing with inclusive range

2015-01-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5201: - Affects Version/s: (was: 1.2.0) 1.0.0 > ParallelCollectionRDD.slice(seq, numSli

[jira] [Closed] (SPARK-5201) ParallelCollectionRDD.slice(seq, numSlices) has int overflow when dealing with inclusive range

2015-01-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5201. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Ye Xianjin Target Version/s: 1.

[jira] [Closed] (SPARK-1507) Spark on Yarn: Add support for user to specify # cores for ApplicationMaster

2015-01-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-1507. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: WangTaoTheTonic Target Version/

[jira] [Commented] (SPARK-5270) Elegantly check if RDD is empty

2015-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280491#comment-14280491 ] Apache Spark commented on SPARK-5270: - User 'srowen' has created a pull request for th

[jira] [Commented] (SPARK-3726) RandomForest: Support for bootstrap options

2015-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280486#comment-14280486 ] Apache Spark commented on SPARK-3726: - User 'MechCoder' has created a pull request for

[jira] [Commented] (SPARK-5270) Elegantly check if RDD is empty

2015-01-16 Thread Al M (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280260#comment-14280260 ] Al M commented on SPARK-5270: - I don't mind at all. I'd be really happy to have such a utilit

[jira] [Commented] (SPARK-4630) Dynamically determine optimal number of partitions

2015-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280173#comment-14280173 ] Apache Spark commented on SPARK-4630: - User 'lianhuiwang' has created a pull request f

[jira] [Commented] (SPARK-5282) RowMatrix easily gets int overflow in the memory size warning

2015-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280167#comment-14280167 ] Apache Spark commented on SPARK-5282: - User 'hhbyyh' has created a pull request for th

[jira] [Commented] (SPARK-5282) RowMatrix easily gets int overflow in the memory size warning

2015-01-16 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280159#comment-14280159 ] yuhao yang commented on SPARK-5282: --- typical wrong message: Row matrix: 17000 cloumns wi

[jira] [Created] (SPARK-5282) RowMatrix easily gets int overflow in the memory size warning

2015-01-16 Thread yuhao yang (JIRA)
yuhao yang created SPARK-5282: - Summary: RowMatrix easily gets int overflow in the memory size warning Key: SPARK-5282 URL: https://issues.apache.org/jira/browse/SPARK-5282 Project: Spark Issue

[jira] [Updated] (SPARK-5281) Registering table on RDD is giving MissingRequirementError

2015-01-16 Thread sarsol (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sarsol updated SPARK-5281: -- Component/s: SQL > Registering table on RDD is giving MissingRequirementError >

[jira] [Commented] (SPARK-5270) Elegantly check if RDD is empty

2015-01-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280145#comment-14280145 ] Sean Owen commented on SPARK-5270: -- I think it would be nice to have a utility method lik

[jira] [Updated] (SPARK-5281) Registering table on RDD is giving MissingRequirementError

2015-01-16 Thread sarsol (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sarsol updated SPARK-5281: -- Priority: Critical (was: Major) > Registering table on RDD is giving MissingRequirementError >

[jira] [Closed] (SPARK-5234) examples for ml don't have sparkContext.stop

2015-01-16 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-5234. - fixed > examples for ml don't have sparkContext.stop > > >

[jira] [Created] (SPARK-5281) Registering table on RDD is giving MissingRequirementError

2015-01-16 Thread sarsol (JIRA)
sarsol created SPARK-5281: - Summary: Registering table on RDD is giving MissingRequirementError Key: SPARK-5281 URL: https://issues.apache.org/jira/browse/SPARK-5281 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-4357) Modify release publishing to work with Scala 2.11

2015-01-16 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280037#comment-14280037 ] François Garillot commented on SPARK-4357: -- Scala 2.11.5 [has been released|http:

[jira] [Commented] (SPARK-5147) write ahead logs from streaming receiver are not purged because cleanupOldBlocks in WriteAheadLogBasedBlockHandler is never called

2015-01-16 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-5147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280034#comment-14280034 ] François Garillot commented on SPARK-5147: -- I see. Thanks for your answers ! For

[jira] [Commented] (SPARK-5186) Vector.equals and Vector.hashCode are very inefficient and fail on SparseVectors with large size

2015-01-16 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14280025#comment-14280025 ] yuhao yang commented on SPARK-5186: --- I just updated the PR with a hashCode fix. Please h

[jira] [Created] (SPARK-5280) Import RDF graphs into GraphX

2015-01-16 Thread lukovnikov (JIRA)
lukovnikov created SPARK-5280: - Summary: Import RDF graphs into GraphX Key: SPARK-5280 URL: https://issues.apache.org/jira/browse/SPARK-5280 Project: Spark Issue Type: New Feature Compo

[jira] [Commented] (SPARK-4867) UDF clean up

2015-01-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14279983#comment-14279983 ] Reynold Xin commented on SPARK-4867: BTW if we plan to implement most SQL functions us

[jira] [Commented] (SPARK-5270) Elegantly check if RDD is empty

2015-01-16 Thread Al M (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14279962#comment-14279962 ] Al M commented on SPARK-5270: - Good point it's not a catch-all solution. The rdd.partitions.s