[jira] [Resolved] (SPARK-6141) Upgrade Breeze to 0.11 to fix convergence bug

2015-03-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6141. -- Resolution: Fixed Fix Version/s: 1.3.1 Issue resolved by pull request 4879 [https://githu

[jira] [Commented] (SPARK-6149) Spark SQL CLI doesn't work when compiled against Hive 12 with SBT because of runtime incompatibility issues caused by Guava 15

2015-03-03 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346519#comment-14346519 ] Patrick Wendell commented on SPARK-6149: To be more specific, I am suggesting that

[jira] [Commented] (SPARK-6149) Spark SQL CLI doesn't work when compiled against Hive 12 with SBT because of runtime incompatibility issues caused by Guava 15

2015-03-03 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346515#comment-14346515 ] Patrick Wendell commented on SPARK-6149: Yes - because of this I think simply excl

[jira] [Created] (SPARK-6155) Build with Scala 2.11.5 failed for Spark v1.3.0-rc2

2015-03-03 Thread Jianshi Huang (JIRA)
Jianshi Huang created SPARK-6155: Summary: Build with Scala 2.11.5 failed for Spark v1.3.0-rc2 Key: SPARK-6155 URL: https://issues.apache.org/jira/browse/SPARK-6155 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-4368) Ceph integration?

2015-03-03 Thread kishorekumar neelamegam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346499#comment-14346499 ] kishorekumar neelamegam edited comment on SPARK-4368 at 3/4/15 6:58 AM:

[jira] [Commented] (SPARK-4368) Ceph integration?

2015-03-03 Thread kishorekumar neelamegam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346499#comment-14346499 ] kishorekumar neelamegam commented on SPARK-4368: If no changes or needed i

[jira] [Commented] (SPARK-2087) Clean Multi-user semantics for thrift JDBC/ODBC server.

2015-03-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346443#comment-14346443 ] Apache Spark commented on SPARK-2087: - User 'chenghao-intel' has created a pull reques

[jira] [Created] (SPARK-6154) Build error with Scala 2.11 for v1.3.0-rc2

2015-03-03 Thread Jianshi Huang (JIRA)
Jianshi Huang created SPARK-6154: Summary: Build error with Scala 2.11 for v1.3.0-rc2 Key: SPARK-6154 URL: https://issues.apache.org/jira/browse/SPARK-6154 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-6153) intellij import from maven cannot debug sparksqlclidriver

2015-03-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346428#comment-14346428 ] Apache Spark commented on SPARK-6153: - User 'adrian-wang' has created a pull request f

[jira] [Created] (SPARK-6153) intellij import from maven cannot debug sparksqlclidriver

2015-03-03 Thread Adrian Wang (JIRA)
Adrian Wang created SPARK-6153: -- Summary: intellij import from maven cannot debug sparksqlclidriver Key: SPARK-6153 URL: https://issues.apache.org/jira/browse/SPARK-6153 Project: Spark Issue Typ

[jira] [Commented] (SPARK-6149) Spark SQL CLI doesn't work when compiled against Hive 12 with SBT because of runtime incompatibility issues caused by Guava 15

2015-03-03 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346414#comment-14346414 ] Cheng Lian commented on SPARK-6149: --- Pointed out by [~pwendell], this is a Maven-vs-SBT

[jira] [Closed] (SPARK-6150) Validate indices before constructing a SparseVector

2015-03-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-6150. Resolution: Won't Fix > Validate indices before constructing a SparseVector > --

[jira] [Commented] (SPARK-3437) Adapt maven build to work without the need of hardcoding scala binary version in artifact id.

2015-03-03 Thread Prashant Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346405#comment-14346405 ] Prashant Sharma commented on SPARK-3437: Yes, you are right - Idea here is to not

[jira] [Issue Comment Deleted] (SPARK-3437) Adapt maven build to work without the need of hardcoding scala binary version in artifact id.

2015-03-03 Thread Prashant Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Sharma updated SPARK-3437: --- Comment: was deleted (was: Yes, you are right - Idea here is to not write _2.xx in artifact ID

[jira] [Commented] (SPARK-3437) Adapt maven build to work without the need of hardcoding scala binary version in artifact id.

2015-03-03 Thread Prashant Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346404#comment-14346404 ] Prashant Sharma commented on SPARK-3437: Yes, you are right - Idea here is to not

[jira] [Commented] (SPARK-6144) When in cluster mode using ADD JAR with a hdfs:// sourced jar will fail

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346396#comment-14346396 ] Andrew Or commented on SPARK-6144: -- I believe this is a regression from 1.2 caused by ht

[jira] [Updated] (SPARK-6144) When in cluster mode using ADD JAR with a hdfs:// sourced jar will fail

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6144: - Assignee: Trystan Leftwich > When in cluster mode using ADD JAR with a hdfs:// sourced jar will fail > ---

[jira] [Commented] (SPARK-4184) Improve Spark Streaming documentation to address commonly-asked questions

2015-03-03 Thread Chris Fregly (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346371#comment-14346371 ] Chris Fregly commented on SPARK-4184: - Hey [~sowen]! I'm gonna move this to 1.3.1. T

[jira] [Updated] (SPARK-4184) Improve Spark Streaming documentation to address commonly-asked questions

2015-03-03 Thread Chris Fregly (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Fregly updated SPARK-4184: Summary: Improve Spark Streaming documentation to address commonly-asked questions (was: Improve S

[jira] [Updated] (SPARK-4184) Improve Spark Streaming documentation

2015-03-03 Thread Chris Fregly (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Fregly updated SPARK-4184: Target Version/s: 1.3.1 (was: 1.2.0) > Improve Spark Streaming documentation >

[jira] [Updated] (SPARK-6149) Spark SQL CLI doesn't work when compiled against Hive 12 with SBT because of runtime incompatibility issues caused by Guava 15

2015-03-03 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6149: -- Summary: Spark SQL CLI doesn't work when compiled against Hive 12 with SBT because of runtime incompatib

[jira] [Updated] (SPARK-6149) Spark SQL CLI doesn't work when compiled against Hive 12 with SBT because of runtime incompatibility issues caused by Guava (15?)

2015-03-03 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6149: -- Summary: Spark SQL CLI doesn't work when compiled against Hive 12 with SBT because of runtime incompatib

[jira] [Updated] (SPARK-6144) When in cluster mode using ADD JAR with a hdfs:// sourced jar will fail

2015-03-03 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6144: --- Target Version/s: 1.3.0 > When in cluster mode using ADD JAR with a hdfs:// sourced jar will f

[jira] [Updated] (SPARK-6144) When in cluster mode using ADD JAR with a hdfs:// sourced jar will fail

2015-03-03 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6144: --- Priority: Blocker (was: Major) > When in cluster mode using ADD JAR with a hdfs:// sourced ja

[jira] [Commented] (SPARK-5834) spark 1.2.1 officical package bundled with httpclient 4.1.2 is too old

2015-03-03 Thread Littlestar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346349#comment-14346349 ] Littlestar commented on SPARK-5834: --- Spark 1.3.0 RC1/RC2 bundle with httpclient 4.3.6, w

[jira] [Created] (SPARK-6152) Spark does not support Java 8 compiled Scala classes

2015-03-03 Thread Ronald Chen (JIRA)
Ronald Chen created SPARK-6152: -- Summary: Spark does not support Java 8 compiled Scala classes Key: SPARK-6152 URL: https://issues.apache.org/jira/browse/SPARK-6152 Project: Spark Issue Type: Bu

[jira] [Updated] (SPARK-6151) schemaRDD to parquetfile with saveAsParquetFile control the HDFS block size

2015-03-03 Thread Littlestar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Littlestar updated SPARK-6151: -- Component/s: SQL > schemaRDD to parquetfile with saveAsParquetFile control the HDFS block size > ---

[jira] [Created] (SPARK-6151) schemaRDD to parquetfile with saveAsParquetFile control the HDFS block size

2015-03-03 Thread Littlestar (JIRA)
Littlestar created SPARK-6151: - Summary: schemaRDD to parquetfile with saveAsParquetFile control the HDFS block size Key: SPARK-6151 URL: https://issues.apache.org/jira/browse/SPARK-6151 Project: Spark

[jira] [Commented] (SPARK-6150) Validate indices before constructing a SparseVector

2015-03-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346280#comment-14346280 ] Apache Spark commented on SPARK-6150: - User 'yu-iskw' has created a pull request for t

[jira] [Created] (SPARK-6150) Validate indices before constructing a SparseVector

2015-03-03 Thread Yu Ishikawa (JIRA)
Yu Ishikawa created SPARK-6150: -- Summary: Validate indices before constructing a SparseVector Key: SPARK-6150 URL: https://issues.apache.org/jira/browse/SPARK-6150 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-6132) Context cleaner race condition across SparkContexts

2015-03-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346248#comment-14346248 ] Apache Spark commented on SPARK-6132: - User 'andrewor14' has created a pull request fo

[jira] [Commented] (SPARK-6144) When in cluster mode using ADD JAR with a hdfs:// sourced jar will fail

2015-03-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346176#comment-14346176 ] Apache Spark commented on SPARK-6144: - User 'trystanleftwich' has created a pull reque

[jira] [Commented] (SPARK-6148) cachedDataSourceTables may store outdated metadata if the table is updated from another HiveContext

2015-03-03 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346165#comment-14346165 ] Yin Huai commented on SPARK-6148: - To workaround this issue. Users need to use refreshTabl

[jira] [Updated] (SPARK-6148) cachedDataSourceTables may store outdated metadata if the table is updated from another HiveContext

2015-03-03 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6148: Description: If we have two HiveContext, if we change a table through one (e.g. append and overwrite with n

[jira] [Updated] (SPARK-6148) cachedDataSourceTables may store outdated metadata if the table is updated from another HiveContext

2015-03-03 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6148: Description: If we have two HiveContext, if we change a table through one, > cachedDataSourceTables may sto

[jira] [Updated] (SPARK-6148) cachedDataSourceTables may store outdated metadata if the table is updated from another HiveContext

2015-03-03 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6148: Summary: cachedDataSourceTables may store outdated metadata if the table is updated from another HiveContext

[jira] [Updated] (SPARK-6148) After saveAsTable, non-nullable columns cannot be read back

2015-03-03 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6148: Priority: Major (was: Blocker) > After saveAsTable, non-nullable columns cannot be read back >

[jira] [Updated] (SPARK-6148) After saveAsTable, non-nullable columns cannot be read back

2015-03-03 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6148: Target Version/s: (was: 1.3.0) > After saveAsTable, non-nullable columns cannot be read back > ---

[jira] [Created] (SPARK-6149) Spark SQL CLI doesn't work when compiled against Hive 12 because of runtime incompatibility issues caused by Guava (15?)

2015-03-03 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-6149: - Summary: Spark SQL CLI doesn't work when compiled against Hive 12 because of runtime incompatibility issues caused by Guava (15?) Key: SPARK-6149 URL: https://issues.apache.org/jira/br

[jira] [Created] (SPARK-6147) Move JDBC data source integration tests to the Spark integration tests project

2015-03-03 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-6147: - Summary: Move JDBC data source integration tests to the Spark integration tests project Key: SPARK-6147 URL: https://issues.apache.org/jira/browse/SPARK-6147 Project: Spark

[jira] [Created] (SPARK-6148) After saveAsTable, non-nullable columns cannot be read back

2015-03-03 Thread Yin Huai (JIRA)
Yin Huai created SPARK-6148: --- Summary: After saveAsTable, non-nullable columns cannot be read back Key: SPARK-6148 URL: https://issues.apache.org/jira/browse/SPARK-6148 Project: Spark Issue Type:

[jira] [Reopened] (SPARK-5423) ExternalAppendOnlyMap won't delete temp spilled file if some exception happens during using it

2015-03-03 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nishkam Ravi reopened SPARK-5423: - The commit was reverted due to performance issues. Re-opening this JIRA. > ExternalAppendOnlyMap won'

[jira] [Resolved] (SPARK-6142) 10-12% Performance regression with "finalize"

2015-03-03 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nishkam Ravi resolved SPARK-6142. - Resolution: Fixed > 10-12% Performance regression with "finalize" > --

[jira] [Commented] (SPARK-6142) 10-12% Performance regression with "finalize"

2015-03-03 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346086#comment-14346086 ] Nishkam Ravi commented on SPARK-6142: - Fixed (master 9af001749a37a86ccbf78063ec514a218

[jira] [Updated] (SPARK-5949) Driver program has to register roaring bitmap classes used by spark with Kryo when number of partitions is greater than 2000

2015-03-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5949: --- Fix Version/s: (was: 1.3.0) 1.4.0 > Driver program has to register roaring bitm

[jira] [Resolved] (SPARK-5949) Driver program has to register roaring bitmap classes used by spark with Kryo when number of partitions is greater than 2000

2015-03-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5949. Resolution: Fixed Fix Version/s: 1.3.0 > Driver program has to register roaring bitmap classe

[jira] [Created] (SPARK-6146) Support more datatype in SqlParser

2015-03-03 Thread Yin Huai (JIRA)
Yin Huai created SPARK-6146: --- Summary: Support more datatype in SqlParser Key: SPARK-6146 URL: https://issues.apache.org/jira/browse/SPARK-6146 Project: Spark Issue Type: Bug Components:

[jira] [Updated] (SPARK-6146) Support more datatype in SqlParser

2015-03-03 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6146: Issue Type: Improvement (was: Bug) > Support more datatype in SqlParser > -

[jira] [Updated] (SPARK-6133) SparkContext#stop is not idempotent

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6133: - Fix Version/s: 1.2.2 > SparkContext#stop is not idempotent > --- > >

[jira] [Updated] (SPARK-6133) SparkContext#stop is not idempotent

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6133: - Fix Version/s: 1.4.0 > SparkContext#stop is not idempotent > --- > >

[jira] [Updated] (SPARK-6133) SparkContext#stop is not idempotent

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6133: - Target Version/s: 1.2.2, 1.4.0 (was: 1.4.0) > SparkContext#stop is not idempotent > -

[jira] [Updated] (SPARK-6133) SparkContext#stop is not idempotent

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6133: - Target Version/s: 1.2.2, 1.4.0, 1.3.1 (was: 1.2.2, 1.4.0) > SparkContext#stop is not idempotent > ---

[jira] [Updated] (SPARK-6145) ORDER BY fails to resolve nested fields

2015-03-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6145: Description: {code} sqlContext.jsonRDD(sc.parallelize( """{"a": {"b": 1}, "c": 1}""" :: Ni

[jira] [Created] (SPARK-6145) ORDER BY fails to resolve nested fields

2015-03-03 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-6145: --- Summary: ORDER BY fails to resolve nested fields Key: SPARK-6145 URL: https://issues.apache.org/jira/browse/SPARK-6145 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-6144) When in cluster mode using ADD JAR with a hdfs:// sourced jar will fail

2015-03-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345908#comment-14345908 ] Apache Spark commented on SPARK-6144: - User 'trystanleftwich' has created a pull reque

[jira] [Updated] (SPARK-6144) When in cluster mode using ADD JAR with a hdfs:// sourced jar will fail

2015-03-03 Thread Trystan Leftwich (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Trystan Leftwich updated SPARK-6144: Description: While in cluster mode if you use ADD JAR with a HDFS sourced jar it will fail

[jira] [Created] (SPARK-6144) When in cluster mode using ADD JAR with a hdfs:// sourced jar will fail

2015-03-03 Thread Trystan Leftwich (JIRA)
Trystan Leftwich created SPARK-6144: --- Summary: When in cluster mode using ADD JAR with a hdfs:// sourced jar will fail Key: SPARK-6144 URL: https://issues.apache.org/jira/browse/SPARK-6144 Project:

[jira] [Updated] (SPARK-6132) Context cleaner race condition across SparkContexts

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6132: - Target Version/s: 1.1.2, 1.2.2, 1.4.0, 1.3.1 (was: 1.4.0, 1.3.1) > Context cleaner race condition across

[jira] [Updated] (SPARK-6132) Context cleaner race condition across SparkContexts

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6132: - Target Version/s: 1.4.0, 1.3.1 (was: 1.4.0) > Context cleaner race condition across SparkContexts > -

[jira] [Updated] (SPARK-6132) Context cleaner race condition across SparkContexts

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6132: - Labels: backport-needed (was: ) > Context cleaner race condition across SparkContexts > -

[jira] [Updated] (SPARK-6132) Context cleaner race condition across SparkContexts

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6132: - Fix Version/s: 1.4.0 > Context cleaner race condition across SparkContexts > -

[jira] [Updated] (SPARK-6143) Improve FP-Growth for mining closed-forms of frequent patterns

2015-03-03 Thread Denis Dus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Denis Dus updated SPARK-6143: - Description: It is more convenient for person to analyze closed forms of frequent itemsets (and patterns

[jira] [Updated] (SPARK-6143) Improve FP-Growth for mining closed-forms of frequent patterns

2015-03-03 Thread Denis Dus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Denis Dus updated SPARK-6143: - Description: It is more convenient for person to analyze closed forms of frequent itemsets (and patterns

[jira] [Created] (SPARK-6143) Improve FP-Growth for mining closed-forms of frequent patterns

2015-03-03 Thread Denis Dus (JIRA)
Denis Dus created SPARK-6143: Summary: Improve FP-Growth for mining closed-forms of frequent patterns Key: SPARK-6143 URL: https://issues.apache.org/jira/browse/SPARK-6143 Project: Spark Issue T

[jira] [Commented] (SPARK-5949) Driver program has to register roaring bitmap classes used by spark with Kryo when number of partitions is greater than 2000

2015-03-03 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345696#comment-14345696 ] Imran Rashid commented on SPARK-5949: - Thanks [~ptorok]. I've updated the PR. Can yo

[jira] [Commented] (SPARK-5433) Spark EC2 doesn't mount local disks for all instance types

2015-03-03 Thread Uri Laserson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345669#comment-14345669 ] Uri Laserson commented on SPARK-5433: - Confirming that I ran into this problem with an

[jira] [Commented] (SPARK-6137) G-Means clustering algorithm implementation

2015-03-03 Thread Denis Dus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345663#comment-14345663 ] Denis Dus commented on SPARK-6137: -- I really have no experience with X-Means, but from st

[jira] [Commented] (SPARK-6137) G-Means clustering algorithm implementation

2015-03-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345635#comment-14345635 ] Joseph K. Bradley commented on SPARK-6137: -- Yeah, I'm not aware of a theoretical

[jira] [Created] (SPARK-6142) 10-12% Performance regression with "finalize"

2015-03-03 Thread Nishkam Ravi (JIRA)
Nishkam Ravi created SPARK-6142: --- Summary: 10-12% Performance regression with "finalize" Key: SPARK-6142 URL: https://issues.apache.org/jira/browse/SPARK-6142 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-6141) Upgrade Breeze to 0.11 to fix convergence bug

2015-03-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6141: - Priority: Critical (was: Blocker) > Upgrade Breeze to 0.11 to fix convergence bug > -

[jira] [Comment Edited] (SPARK-6137) G-Means clustering algorithm implementation

2015-03-03 Thread Denis Dus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345575#comment-14345575 ] Denis Dus edited comment on SPARK-6137 at 3/3/15 7:27 PM: -- As I s

[jira] [Updated] (SPARK-6141) Upgrade Breeze to 0.11 to fix convergence bug

2015-03-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6141: - Target Version/s: 1.3.0 > Upgrade Breeze to 0.11 to fix convergence bug >

[jira] [Commented] (SPARK-6137) G-Means clustering algorithm implementation

2015-03-03 Thread Denis Dus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345575#comment-14345575 ] Denis Dus commented on SPARK-6137: -- As I see, splitting rules are different. 1) GMeans us

[jira] [Updated] (SPARK-6141) Upgrade Breeze to 0.11 to fix convergence bug

2015-03-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6141: - Assignee: DB Tsai > Upgrade Breeze to 0.11 to fix convergence bug > --

[jira] [Commented] (SPARK-6141) Upgrade Breeze to 0.11 to fix convergence bug

2015-03-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345556#comment-14345556 ] Apache Spark commented on SPARK-6141: - User 'dbtsai' has created a pull request for th

[jira] [Created] (SPARK-6141) Upgrade Breeze to 0.11 to fix convergence bug

2015-03-03 Thread DB Tsai (JIRA)
DB Tsai created SPARK-6141: -- Summary: Upgrade Breeze to 0.11 to fix convergence bug Key: SPARK-6141 URL: https://issues.apache.org/jira/browse/SPARK-6141 Project: Spark Issue Type: Bug Com

[jira] [Commented] (SPARK-6067) Spark sql hive dynamic partitions job will fail if task fails

2015-03-03 Thread Jason Hubbard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345519#comment-14345519 ] Jason Hubbard commented on SPARK-6067: -- Hi baishuo. To reproduce, create a Hive tabl

[jira] [Updated] (SPARK-6067) Spark sql hive dynamic partitions job will fail if task fails

2015-03-03 Thread Jason Hubbard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Hubbard updated SPARK-6067: - Attachment: job.log > Spark sql hive dynamic partitions job will fail if task fails >

[jira] [Updated] (SPARK-3789) Python bindings for GraphX

2015-03-03 Thread Kushal Datta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kushal Datta updated SPARK-3789: Target Version/s: 1.4.0 > Python bindings for GraphX > -- > >

[jira] [Commented] (SPARK-6137) G-Means clustering algorithm implementation

2015-03-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345499#comment-14345499 ] Joseph K. Bradley commented on SPARK-6137: -- I understand that GMeans picks K. Wh

[jira] [Commented] (SPARK-5920) Use a BufferedInputStream to read local shuffle data

2015-03-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345495#comment-14345495 ] Apache Spark commented on SPARK-5920: - User 'ravipesala' has created a pull request fo

[jira] [Commented] (SPARK-3789) Python bindings for GraphX

2015-03-03 Thread Kushal Datta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345491#comment-14345491 ] Kushal Datta commented on SPARK-3789: - PyGraphX depends on Java API for GraphX > Pyth

[jira] [Commented] (SPARK-6137) G-Means clustering algorithm implementation

2015-03-03 Thread Denis Dus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345484#comment-14345484 ] Denis Dus commented on SPARK-6137: -- Streaming K-Means (as I see) is just a variation of r

[jira] [Commented] (SPARK-6137) G-Means clustering algorithm implementation

2015-03-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345408#comment-14345408 ] Joseph K. Bradley commented on SPARK-6137: -- There is a method for splitting clust

[jira] [Commented] (SPARK-5949) Driver program has to register roaring bitmap classes used by spark with Kryo when number of partitions is greater than 2000

2015-03-03 Thread Peter Torok (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345379#comment-14345379 ] Peter Torok commented on SPARK-5949: Actually, there were two more classes discovered

[jira] [Comment Edited] (SPARK-3859) Use consistent config names for duration (with units!)

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345365#comment-14345365 ] Andrew Or edited comment on SPARK-3859 at 3/3/15 5:28 PM: -- The pr

[jira] [Comment Edited] (SPARK-3859) Use consistent config names for duration (with units!)

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345365#comment-14345365 ] Andrew Or edited comment on SPARK-3859 at 3/3/15 5:27 PM: -- The pr

[jira] [Closed] (SPARK-3859) Use consistent config names for duration (with units!)

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-3859. Resolution: Duplicate > Use consistent config names for duration (with units!) > ---

[jira] [Comment Edited] (SPARK-3859) Use consistent config names for duration (with units!)

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345365#comment-14345365 ] Andrew Or edited comment on SPARK-3859 at 3/3/15 5:27 PM: -- The pr

[jira] [Comment Edited] (SPARK-3859) Use consistent config names for duration (with units!)

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345365#comment-14345365 ] Andrew Or edited comment on SPARK-3859 at 3/3/15 5:26 PM: -- The pr

[jira] [Commented] (SPARK-3859) Use consistent config names for duration (with units!)

2015-03-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345365#comment-14345365 ] Andrew Or commented on SPARK-3859: -- The problem is we keep adding more and more of these

[jira] [Commented] (SPARK-5949) Driver program has to register roaring bitmap classes used by spark with Kryo when number of partitions is greater than 2000

2015-03-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345259#comment-14345259 ] Apache Spark commented on SPARK-5949: - User 'squito' has created a pull request for th

[jira] [Created] (SPARK-6140) Retain longForm call site information even when shortForm is overridden.

2015-03-03 Thread Christopher Ng (JIRA)
Christopher Ng created SPARK-6140: - Summary: Retain longForm call site information even when shortForm is overridden. Key: SPARK-6140 URL: https://issues.apache.org/jira/browse/SPARK-6140 Project: Spa

[jira] [Commented] (SPARK-5143) spark-network-yarn 2.11 depends on spark-network-shuffle 2.10

2015-03-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345232#comment-14345232 ] Apache Spark commented on SPARK-5143: - User 'srowen' has created a pull request for th

[jira] [Closed] (SPARK-603) add simple Counter API

2015-03-03 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid closed SPARK-603. -- Resolution: Won't Fix > add simple Counter API > -- > > Key: SPARK-60

[jira] [Commented] (SPARK-603) add simple Counter API

2015-03-03 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345204#comment-14345204 ] Imran Rashid commented on SPARK-603: Hi [~srowen] I don't think anyone is actively wor

[jira] [Updated] (SPARK-5707) Enabling spark.sql.codegen throws ClassNotFound exception

2015-03-03 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-5707: -- Description: Exception thrown: {noformat} org.apache.spark.SparkException: Job aborted due to stage fail

[jira] [Updated] (SPARK-5707) Enabling spark.sql.codegen throws ClassNotFound exception

2015-03-03 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-5707: -- Description: Exception thrown: {noformat} org.apache.spark.SparkException: Job aborted due to stage fail

[jira] [Created] (SPARK-6139) Allow pre-populate sliding window with initial data

2015-03-03 Thread Matus Faro (Kik) (JIRA)
Matus Faro (Kik) created SPARK-6139: --- Summary: Allow pre-populate sliding window with initial data Key: SPARK-6139 URL: https://issues.apache.org/jira/browse/SPARK-6139 Project: Spark Issue

[jira] [Commented] (SPARK-6139) Allow pre-populate sliding window with initial data

2015-03-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345195#comment-14345195 ] Apache Spark commented on SPARK-6139: - User 'matuskik' has created a pull request for

  1   2   >