[jira] [Commented] (SPARK-3129) Prevent data loss in Spark Streaming

2014-09-19 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140049#comment-14140049 ] Hari Shreedharan commented on SPARK-3129: - Do these numbers look ok enough to you

[jira] [Updated] (SPARK-3600) RDD[Double] doesn't use primitive arrays for caching

2014-09-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3600: - Summary: RDD[Double] doesn't use primitive arrays for caching (was: RandomRDDs doesn't create

[jira] [Updated] (SPARK-3600) RDD[Double] doesn't use primitive arrays for caching

2014-09-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3600: - Issue Type: Improvement (was: Bug) RDD[Double] doesn't use primitive arrays for caching

[jira] [Updated] (SPARK-3600) RDD[Double] doesn't use primitive arrays for caching

2014-09-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3600: - Component/s: (was: MLlib) RDD[Double] doesn't use primitive arrays for caching

[jira] [Updated] (SPARK-3600) RDD[Double] doesn't use primitive arrays for caching

2014-09-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3600: - Target Version/s: (was: 1.1.1, 1.2.0) RDD[Double] doesn't use primitive arrays for caching

[jira] [Commented] (SPARK-3298) [SQL] registerAsTable / registerTempTable overwrites old tables

2014-09-19 Thread Ravindra Pesala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140124#comment-14140124 ] Ravindra Pesala commented on SPARK-3298: I guess, we should add some API like

[jira] [Commented] (SPARK-3403) NaiveBayes crashes with blas/lapack native libraries for breeze (netlib-java)

2014-09-19 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140128#comment-14140128 ] Alexander Ulanov commented on SPARK-3403: - Posted to netlib-java:

[jira] [Comment Edited] (SPARK-3403) NaiveBayes crashes with blas/lapack native libraries for breeze (netlib-java)

2014-09-19 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138829#comment-14138829 ] Alexander Ulanov edited comment on SPARK-3403 at 9/19/14 7:16 AM:

[jira] [Created] (SPARK-3601) Kryo NPE for output operations on Avro complex Objects even after registering.

2014-09-19 Thread mohan gaddam (JIRA)
mohan gaddam created SPARK-3601: --- Summary: Kryo NPE for output operations on Avro complex Objects even after registering. Key: SPARK-3601 URL: https://issues.apache.org/jira/browse/SPARK-3601 Project:

[jira] [Updated] (SPARK-3601) Kryo NPE for output operations on Avro complex Objects even after registering.

2014-09-19 Thread mohan gaddam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] mohan gaddam updated SPARK-3601: Description: Kryo serializer works well when avro objects has simple data. but when the same avro

[jira] [Updated] (SPARK-3601) Kryo NPE for output operations on Avro complex Objects even after registering.

2014-09-19 Thread mohan gaddam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] mohan gaddam updated SPARK-3601: Description: Kryo serializer works well when avro objects has simple data. but when the same avro

[jira] [Updated] (SPARK-3601) Kryo NPE for output operations on Avro complex Objects even after registering.

2014-09-19 Thread mohan gaddam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] mohan gaddam updated SPARK-3601: Description: Kryo serializer works well when avro objects has simple data. but when the same avro

[jira] [Updated] (SPARK-3601) Kryo NPE for output operations on Avro complex Objects even after registering.

2014-09-19 Thread mohan gaddam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] mohan gaddam updated SPARK-3601: Description: Kryo serializer works well when avro objects has simple data. but when the same avro

[jira] [Updated] (SPARK-3601) Kryo NPE for output operations on Avro complex Objects even after registering.

2014-09-19 Thread mohan gaddam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] mohan gaddam updated SPARK-3601: Description: Kryo serializer works well when avro objects has simple data. but when the same avro

[jira] [Updated] (SPARK-3601) Kryo NPE for output operations on Avro complex Objects even after registering.

2014-09-19 Thread mohan gaddam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] mohan gaddam updated SPARK-3601: Description: Kryo serializer works well when avro objects has simple data. but when the same avro

[jira] [Commented] (SPARK-3536) SELECT on empty parquet table throws exception

2014-09-19 Thread Ravindra Pesala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140145#comment-14140145 ] Ravindra Pesala commented on SPARK-3536: It return null metadata from parquet if

[jira] [Commented] (SPARK-3434) Distributed block matrix

2014-09-19 Thread Gaurav Mishra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140160#comment-14140160 ] Gaurav Mishra commented on SPARK-3434: -- A matrix being represented by multiple RDDs

[jira] [Created] (SPARK-3602) Can't run cassandra_inputformat.py

2014-09-19 Thread Frens Jan Rumph (JIRA)
Frens Jan Rumph created SPARK-3602: -- Summary: Can't run cassandra_inputformat.py Key: SPARK-3602 URL: https://issues.apache.org/jira/browse/SPARK-3602 Project: Spark Issue Type: Bug

[jira] [Comment Edited] (SPARK-3602) Can't run cassandra_inputformat.py

2014-09-19 Thread Frens Jan Rumph (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140236#comment-14140236 ] Frens Jan Rumph edited comment on SPARK-3602 at 9/19/14 9:26 AM:

[jira] [Commented] (SPARK-3530) Pipeline and Parameters

2014-09-19 Thread Egor Pakhomov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140260#comment-14140260 ] Egor Pakhomov commented on SPARK-3530: -- Nice doc. Parameters passing as part of grid

[jira] [Commented] (SPARK-3536) SELECT on empty parquet table throws exception

2014-09-19 Thread Ravindra Pesala (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140269#comment-14140269 ] Ravindra Pesala commented on SPARK-3536: [~isaias.barroso] I have submitted the PR

[jira] [Commented] (SPARK-3403) NaiveBayes crashes with blas/lapack native libraries for breeze (netlib-java)

2014-09-19 Thread Sam Halliday (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140319#comment-14140319 ] Sam Halliday commented on SPARK-3403: - thanks guys. This looks like its even more

[jira] [Created] (SPARK-3603) InvalidClassException on a Linux VM - probably problem with serialization

2014-09-19 Thread Tomasz Dudziak (JIRA)
Tomasz Dudziak created SPARK-3603: - Summary: InvalidClassException on a Linux VM - probably problem with serialization Key: SPARK-3603 URL: https://issues.apache.org/jira/browse/SPARK-3603 Project:

[jira] [Commented] (SPARK-2365) Add IndexedRDD, an efficient updatable key-value store

2014-09-19 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140647#comment-14140647 ] Imran Rashid commented on SPARK-2365: - This looks fantastic. I think it will also see

[jira] [Commented] (SPARK-2706) Enable Spark to support Hive 0.13

2014-09-19 Thread Greg Senia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140657#comment-14140657 ] Greg Senia commented on SPARK-2706: --- We have been using this fix for a few weeks now

[jira] [Commented] (SPARK-3604) unbounded recursion in getNumPartitions triggers stack overflow for large UnionRDD

2014-09-19 Thread Eric Friedman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140713#comment-14140713 ] Eric Friedman commented on SPARK-3604: -- many more frames of the same content than

[jira] [Commented] (SPARK-3598) cast to timestamp should be the same as hive

2014-09-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140900#comment-14140900 ] Apache Spark commented on SPARK-3598: - User 'adrian-wang' has created a pull request

[jira] [Commented] (SPARK-3536) SELECT on empty parquet table throws exception

2014-09-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140898#comment-14140898 ] Apache Spark commented on SPARK-3536: - User 'ravipesala' has created a pull request

[jira] [Commented] (SPARK-3250) More Efficient Sampling

2014-09-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140897#comment-14140897 ] Apache Spark commented on SPARK-3250: - User 'erikerlandson' has created a pull request

[jira] [Commented] (SPARK-3599) Avoid loading and printing properties file content frequently

2014-09-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140896#comment-14140896 ] Apache Spark commented on SPARK-3599: - User 'WangTaoTheTonic' has created a pull

[jira] [Commented] (SPARK-3604) unbounded recursion in getNumPartitions triggers stack overflow for large UnionRDD

2014-09-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140940#comment-14140940 ] Patrick Wendell commented on SPARK-3604: Yeah good catch, we should fix this.

[jira] [Updated] (SPARK-3604) unbounded recursion in getNumPartitions triggers stack overflow for large UnionRDD

2014-09-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3604: --- Target Version/s: 1.2.0 unbounded recursion in getNumPartitions triggers stack overflow for

[jira] [Updated] (SPARK-3604) unbounded recursion in getNumPartitions triggers stack overflow for large UnionRDD

2014-09-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3604: --- Priority: Critical (was: Blocker) unbounded recursion in getNumPartitions triggers stack

[jira] [Commented] (SPARK-2175) Null values when using App trait.

2014-09-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140947#comment-14140947 ] Patrick Wendell commented on SPARK-2175: Thanks for reporting this - can someone

[jira] [Created] (SPARK-3605) Typo in SchemaRDD JavaDoc

2014-09-19 Thread Sandy Ryza (JIRA)
Sandy Ryza created SPARK-3605: - Summary: Typo in SchemaRDD JavaDoc Key: SPARK-3605 URL: https://issues.apache.org/jira/browse/SPARK-3605 Project: Spark Issue Type: Bug Components: SQL

[jira] [Commented] (SPARK-3605) Typo in SchemaRDD JavaDoc

2014-09-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14141028#comment-14141028 ] Apache Spark commented on SPARK-3605: - User 'sryza' has created a pull request for

[jira] [Commented] (SPARK-3573) Dataset

2014-09-19 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14141063#comment-14141063 ] Sandy Ryza commented on SPARK-3573: --- Currently SchemaRDD does depend on Catalyst. Are

[jira] [Updated] (SPARK-3536) SELECT on empty parquet table throws exception

2014-09-19 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3536: Assignee: Ravindra Pesala SELECT on empty parquet table throws exception

[jira] [Commented] (SPARK-951) Gaussian Mixture Model

2014-09-19 Thread Anant Daksh Asthana (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14141124#comment-14141124 ] Anant Daksh Asthana commented on SPARK-951: --- caizhua Could you please elaborate a

[jira] [Commented] (SPARK-2175) Null values when using App trait.

2014-09-19 Thread Brandon Amos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14141217#comment-14141217 ] Brandon Amos commented on SPARK-2175: - Hi, does the following snippet from the mailing

[jira] [Commented] (SPARK-3573) Dataset

2014-09-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14141271#comment-14141271 ] Xiangrui Meng commented on SPARK-3573: -- [~sandyr] SQL/Streaming/GraphX provide

[jira] [Created] (SPARK-3606) Spark-on-Yarn AmIpFilter does not work with Yarn HA.

2014-09-19 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-3606: - Summary: Spark-on-Yarn AmIpFilter does not work with Yarn HA. Key: SPARK-3606 URL: https://issues.apache.org/jira/browse/SPARK-3606 Project: Spark Issue

[jira] [Commented] (SPARK-3604) unbounded recursion in getNumPartitions triggers stack overflow for large UnionRDD

2014-09-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14141313#comment-14141313 ] Apache Spark commented on SPARK-3604: - User 'harishreedharan' has created a pull

[jira] [Commented] (SPARK-1853) Show Streaming application code context (file, line number) in Spark Stages UI

2014-09-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14141344#comment-14141344 ] Apache Spark commented on SPARK-1853: - User 'tdas' has created a pull request for this

[jira] [Commented] (SPARK-3129) Prevent data loss in Spark Streaming

2014-09-19 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14141382#comment-14141382 ] Matei Zaharia commented on SPARK-3129: -- So Hari, what is the maximum sustainable rate

[jira] [Created] (SPARK-3607) ConnectionManager threads.max configs on the thread pools don't work

2014-09-19 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-3607: Summary: ConnectionManager threads.max configs on the thread pools don't work Key: SPARK-3607 URL: https://issues.apache.org/jira/browse/SPARK-3607 Project: Spark

[jira] [Created] (SPARK-3608) Spark EC2 Script does not correctly break when AWS tagging succeeds.

2014-09-19 Thread Vida Ha (JIRA)
Vida Ha created SPARK-3608: -- Summary: Spark EC2 Script does not correctly break when AWS tagging succeeds. Key: SPARK-3608 URL: https://issues.apache.org/jira/browse/SPARK-3608 Project: Spark

[jira] [Commented] (SPARK-3608) Spark EC2 Script does not correctly break when AWS tagging succeeds.

2014-09-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14141438#comment-14141438 ] Apache Spark commented on SPARK-3608: - User 'vidaha' has created a pull request for

[jira] [Resolved] (SPARK-3501) Hive SimpleUDF will create duplicated type cast which cause exception in constant folding

2014-09-19 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3501. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 2368

[jira] [Resolved] (SPARK-2594) Add CACHE TABLE name AS SELECT ...

2014-09-19 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-2594. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 2397

[jira] [Resolved] (SPARK-3592) applySchema to an RDD of Row

2014-09-19 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3592. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 2448

[jira] [Created] (SPARK-3609) Add sizeInBytes statistics to Limit operator

2014-09-19 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-3609: - Summary: Add sizeInBytes statistics to Limit operator Key: SPARK-3609 URL: https://issues.apache.org/jira/browse/SPARK-3609 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-3610) Unable to load app logs for MLLib programs in history server

2014-09-19 Thread SK (JIRA)
SK created SPARK-3610: - Summary: Unable to load app logs for MLLib programs in history server Key: SPARK-3610 URL: https://issues.apache.org/jira/browse/SPARK-3610 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-3606) Spark-on-Yarn AmIpFilter does not work with Yarn HA.

2014-09-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14141573#comment-14141573 ] Apache Spark commented on SPARK-3606: - User 'vanzin' has created a pull request for