[jira] [Commented] (SPARK-5207) StandardScalerModel mean and variance re-use

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285960#comment-14285960 ] Apache Spark commented on SPARK-5207: - User 'ogeagla' has created a pull request for

[jira] [Resolved] (SPARK-5301) Add missing linear algebra utilities to IndexedRowMatrix and CoordinateMatrix

2015-01-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5301. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4089

[jira] [Created] (SPARK-5352) Add getPartitionStrategy in Graph

2015-01-21 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-5352: --- Summary: Add getPartitionStrategy in Graph Key: SPARK-5352 URL: https://issues.apache.org/jira/browse/SPARK-5352 Project: Spark Issue Type:

[jira] [Commented] (SPARK-2620) case class cannot be used as key for reduce

2015-01-21 Thread Tobias Schlatter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286069#comment-14286069 ] Tobias Schlatter commented on SPARK-2620: - It is a hack in an attempt to have

[jira] [Updated] (SPARK-4793) way to find assembly jar is too strict

2015-01-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4793: - Target Version/s: 1.3.0 (was: 1.3.0, 1.1.2, 1.2.1) way to find assembly jar is too strict

[jira] [Updated] (SPARK-4759) Deadlock in complex spark job in local mode

2015-01-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4759: - Fix Version/s: (was: 1.2.0) 1.2.1 Deadlock in complex spark job in local mode

[jira] [Created] (SPARK-5355) SparkConf is not thread-safe

2015-01-21 Thread Davies Liu (JIRA)
Davies Liu created SPARK-5355: - Summary: SparkConf is not thread-safe Key: SPARK-5355 URL: https://issues.apache.org/jira/browse/SPARK-5355 Project: Spark Issue Type: Bug Affects Versions:

[jira] [Commented] (SPARK-5352) Add getPartitionStrategy in Graph

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285898#comment-14285898 ] Apache Spark commented on SPARK-5352: - User 'maropu' has created a pull request for

[jira] [Updated] (SPARK-5346) Parquet filter pushdown is not enabled when parquet.task.side.metadata is set to true (default value)

2015-01-21 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-5346: -- Description: When computing Parquet splits, reading Parquet metadata from executor side is more memory

[jira] [Closed] (SPARK-4215) Allow requesting executors only on Yarn (for now)

2015-01-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4215. Resolution: Fixed Target Version/s: 1.3.0 (was: 1.3.0, 1.2.1) Allow requesting executors only

[jira] [Closed] (SPARK-4759) Deadlock in complex spark job in local mode

2015-01-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4759. Resolution: Fixed Fix Version/s: 1.2.0 Deadlock in complex spark job in local mode

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-21 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285980#comment-14285980 ] RJ Nowling commented on SPARK-4894: --- [~mengxr] Since [~lmcguire] has submitted the

[jira] [Closed] (SPARK-4569) Rename externalSorting in Aggregator

2015-01-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4569. Resolution: Fixed Fix Version/s: 1.2.1 1.1.2 Assignee: Ilya Ganelin

[jira] [Commented] (SPARK-2669) Hadoop configuration is not localised when submitting job in yarn-cluster mode

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286153#comment-14286153 ] Apache Spark commented on SPARK-2669: - User 'vanzin' has created a pull request for

[jira] [Commented] (SPARK-4337) Add ability to cancel pending requests to YARN

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286142#comment-14286142 ] Apache Spark commented on SPARK-4337: - User 'sryza' has created a pull request for

[jira] [Updated] (SPARK-4793) way to find assembly jar is too strict

2015-01-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4793: - Labels: (was: backport-needed) way to find assembly jar is too strict

[jira] [Updated] (SPARK-5301) Add missing linear algebra utilities to IndexedRowMatrix and CoordinateMatrix

2015-01-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5301: - Assignee: Reza Zadeh Add missing linear algebra utilities to IndexedRowMatrix and

[jira] [Updated] (SPARK-4215) Allow requesting executors only on Yarn (for now)

2015-01-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4215: - Labels: (was: backport-needed) Allow requesting executors only on Yarn (for now)

[jira] [Commented] (SPARK-5353) Log failures in ExceutorClassLoader

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285901#comment-14285901 ] Apache Spark commented on SPARK-5353: - User 'gzm0' has created a pull request for this

[jira] [Updated] (SPARK-5346) Parquet filter pushdown is not enabled when parquet.task.side.metadata is set to true (default value)

2015-01-21 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-5346: -- Priority: Blocker (was: Critical) Parquet filter pushdown is not enabled when

[jira] [Closed] (SPARK-4793) way to find assembly jar is too strict

2015-01-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4793. Resolution: Fixed way to find assembly jar is too strict --

[jira] [Resolved] (SPARK-4749) Allow initializing KMeans clusters using a seed

2015-01-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4749. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3610

[jira] [Created] (SPARK-5354) When possible, correctly set outputPartitioning for leaf SparkPlans

2015-01-21 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5354: --- Summary: When possible, correctly set outputPartitioning for leaf SparkPlans Key: SPARK-5354 URL: https://issues.apache.org/jira/browse/SPARK-5354 Project: Spark

[jira] [Updated] (SPARK-5352) Add getPartitionStrategy in Graph

2015-01-21 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-5352: Description: Graph remembers an applied partition strategy in partitionBy() and returns it

[jira] [Commented] (SPARK-5309) Reduce Binary/String conversion overhead when reading/writing Parquet files

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285899#comment-14285899 ] Apache Spark commented on SPARK-5309: - User 'MickDavies' has created a pull request

[jira] [Updated] (SPARK-4569) Rename externalSorting in Aggregator

2015-01-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4569: -- Labels: (was: backport-needed) It looks like all backports have been completed, so I'm removing the

[jira] [Resolved] (SPARK-1714) Take advantage of AMRMClient APIs to simplify logic in YarnAllocationHandler

2015-01-21 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-1714. -- Resolution: Fixed Fix Version/s: 1.3.0 Take advantage of AMRMClient APIs to simplify

[jira] [Updated] (SPARK-5349) Multiple spark shells should be able to share resources

2015-01-21 Thread Tobias Bertelsen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tobias Bertelsen updated SPARK-5349: Description: The resource requirements of an interactive shell varies heavily. Sometimes

[jira] [Updated] (SPARK-5351) Can't zip RDDs with unequal numbers of partitions in ReplicatedVertexView.upgrade()

2015-01-21 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-5351: Description: If the value of 'spark.default.parallelism' does not match the number of

[jira] [Commented] (SPARK-5351) Can't zip RDDs with unequal numbers of partitions in ReplicatedVertexView.upgrade()

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285820#comment-14285820 ] Apache Spark commented on SPARK-5351: - User 'maropu' has created a pull request for

[jira] [Commented] (SPARK-5176) Thrift server fails with confusing error message when deploy-mode is cluster

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285843#comment-14285843 ] Apache Spark commented on SPARK-5176: - User 'tpanningnextcen' has created a pull

[jira] [Created] (SPARK-5351) Can't zip RDDs with unequal numbers of partitions in ReplicatedVertexView.upgrade()

2015-01-21 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-5351: --- Summary: Can't zip RDDs with unequal numbers of partitions in ReplicatedVertexView.upgrade() Key: SPARK-5351 URL: https://issues.apache.org/jira/browse/SPARK-5351

[jira] [Created] (SPARK-5360) For CoGroupedRDD, rdds for narrow dependencies and shuffle handles are included twice in serialized task

2015-01-21 Thread Kay Ousterhout (JIRA)
Kay Ousterhout created SPARK-5360: - Summary: For CoGroupedRDD, rdds for narrow dependencies and shuffle handles are included twice in serialized task Key: SPARK-5360 URL:

[jira] [Commented] (SPARK-5355) SparkConf is not thread-safe

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286179#comment-14286179 ] Apache Spark commented on SPARK-5355: - User 'davies' has created a pull request for

[jira] [Updated] (SPARK-4959) Attributes are case sensitive when using a select query from a projection

2015-01-21 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4959: Fix Version/s: 1.2.1 Attributes are case sensitive when using a select query from a

[jira] [Commented] (SPARK-5260) Expose JsonRDD.allKeysWithValueTypes() in a utility class

2015-01-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286250#comment-14286250 ] Yin Huai commented on SPARK-5260: - [~sonixbp] Unfortunately, I failed to come up with a

[jira] [Resolved] (SPARK-5009) allCaseVersions function in SqlLexical leads to StackOverflow Exception

2015-01-21 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5009. - Resolution: Fixed Fix Version/s: (was: 1.2.1) Issue resolved by pull request

[jira] [Resolved] (SPARK-5064) GraphX rmatGraph hangs

2015-01-21 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave resolved SPARK-5064. --- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Issue resolved by pull request

[jira] [Updated] (SPARK-5275) pyspark.streaming is not included in assembly jar

2015-01-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5275: --- Fix Version/s: 1.2.1 1.3.0 pyspark.streaming is not included in assembly

[jira] [Updated] (SPARK-4939) Python updateStateByKey example hang in local mode

2015-01-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4939: --- Target Version/s: 1.3.0 (was: 1.3.0, 1.2.1) Python updateStateByKey example hang in local

[jira] [Resolved] (SPARK-5244) add parser for COALESCE()

2015-01-21 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5244. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4040

[jira] [Commented] (SPARK-4939) Python updateStateByKey example hang in local mode

2015-01-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286234#comment-14286234 ] Patrick Wendell commented on SPARK-4939: [~tdas] [~davies] [~kayousterhout]

[jira] [Created] (SPARK-5357) Upgrade from commons-codec 1.5

2015-01-21 Thread Matthew Whelan (JIRA)
Matthew Whelan created SPARK-5357: - Summary: Upgrade from commons-codec 1.5 Key: SPARK-5357 URL: https://issues.apache.org/jira/browse/SPARK-5357 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-5006) spark.port.maxRetries doesn't work

2015-01-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5006: -- Target Version/s: 1.3.0, 1.2.1 (was: 1.3.0) spark.port.maxRetries doesn't work

[jira] [Updated] (SPARK-5006) spark.port.maxRetries doesn't work

2015-01-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5006: -- Fix Version/s: 1.2.1 spark.port.maxRetries doesn't work --

[jira] [Updated] (SPARK-4587) Model export/import

2015-01-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4587: - Description: This is an umbrella JIRA for one of the most requested features on the user

[jira] [Commented] (SPARK-5142) Possibly data may be ruined in Spark Streaming's WAL mechanism.

2015-01-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286387#comment-14286387 ] Tathagata Das commented on SPARK-5142: -- This is definitely a tricky issue. One thing

[jira] [Commented] (SPARK-5360) For CoGroupedRDD, rdds for narrow dependencies and shuffle handles are included twice in serialized task

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286386#comment-14286386 ] Apache Spark commented on SPARK-5360: - User 'kayousterhout' has created a pull request

[jira] [Commented] (SPARK-3424) KMeans Plus Plus is too slow

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286227#comment-14286227 ] Apache Spark commented on SPARK-3424: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-4939) Python updateStateByKey example hang in local mode

2015-01-21 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286236#comment-14286236 ] Kay Ousterhout commented on SPARK-4939: --- [~pwendell] just want to make sure you

[jira] [Updated] (SPARK-5346) Parquet filter pushdown is not enabled when parquet.task.side.metadata is set to true (default value)

2015-01-21 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5346: Target Version/s: 1.3.0, 1.2.2 (was: 1.3.0, 1.2.1) Parquet filter pushdown is not enabled

[jira] [Updated] (SPARK-5360) For CoGroupedRDD, rdds for narrow dependencies and shuffle handles are included twice in serialized task

2015-01-21 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-5360: -- Description: CoGroupPartition, part of CoGroupedRDD, includes references to each RDD that the

[jira] [Closed] (SPARK-5359) ML model import/export

2015-01-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-5359. Resolution: Duplicate ML model import/export --

[jira] [Updated] (SPARK-3702) Standardize MLlib classes for learners, models

2015-01-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-3702: - Description: Summary: Create a class hierarchy for learning algorithms and the models

[jira] [Commented] (SPARK-1714) Take advantage of AMRMClient APIs to simplify logic in YarnAllocationHandler

2015-01-21 Thread Ted Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286407#comment-14286407 ] Ted Yu commented on SPARK-1714: --- {code} if (completedContainer.getExitStatus ==

[jira] [Commented] (SPARK-1714) Take advantage of AMRMClient APIs to simplify logic in YarnAllocationHandler

2015-01-21 Thread Ted Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286416#comment-14286416 ] Ted Yu commented on SPARK-1714: --- allocatedHostToContainersMap.synchronized is absent for the

[jira] [Resolved] (SPARK-3958) Possible stream-corruption issues in TorrentBroadcast

2015-01-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3958. Resolution: Fixed Target Version/s: (was: 1.2.1) At this point I'm not aware of

[jira] [Resolved] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2015-01-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4105. Resolution: Fixed Target Version/s: (was: 1.2.1) At this point I'm not aware of

[jira] [Updated] (SPARK-5361) add in tuple handling for converting python RDD back to JavaRDD

2015-01-21 Thread Winston Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Winston Chen updated SPARK-5361: Description: Existing `SerDeUtil.pythonToJava` implementation does not count in tuple cases:

[jira] [Commented] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2015-01-21 Thread Victor Tso (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286465#comment-14286465 ] Victor Tso commented on SPARK-4105: --- What's the fix version? FAILED_TO_UNCOMPRESS(5)

[jira] [Closed] (SPARK-944) Give example of writing to HBase from Spark Streaming

2015-01-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das closed SPARK-944. --- Resolution: Not a Problem Give example of writing to HBase from Spark Streaming

[jira] [Updated] (SPARK-5361) python tuple not supported while converting PythonRDD back to JavaRDD

2015-01-21 Thread Winston Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Winston Chen updated SPARK-5361: Summary: python tuple not supported while converting PythonRDD back to JavaRDD (was: add in tuple

[jira] [Commented] (SPARK-4520) SparkSQL exception when reading certain columns from a parquet file

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286578#comment-14286578 ] Apache Spark commented on SPARK-4520: - User 'sadhan' has created a pull request for

[jira] [Commented] (SPARK-5347) InputMetrics bug when inputSplit is not instanceOf FileSplit

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286688#comment-14286688 ] Apache Spark commented on SPARK-5347: - User 'shenh062326' has created a pull request

[jira] [Updated] (SPARK-5063) Display more helpful error messages for several invalid operations

2015-01-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5063: -- Description: Spark does not support nested RDDs or performing Spark actions inside of transformations;

[jira] [Commented] (SPARK-4506) Update documentation to clarify whether standalone-cluster mode is now officially supported

2015-01-21 Thread Asim Jalis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286466#comment-14286466 ] Asim Jalis commented on SPARK-4506: --- This bug should be reopened. The doc needs some

[jira] [Resolved] (SPARK-4984) add a pop-up containing the full for job description when it is very long

2015-01-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4984. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3819

[jira] [Updated] (SPARK-4984) add a pop-up containing the full for job description when it is very long

2015-01-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4984: -- Assignee: wangfei add a pop-up containing the full for job description when it is very long

[jira] [Updated] (SPARK-4984) add a pop-up containing the full for job description when it is very long

2015-01-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4984: -- Component/s: (was: Spark Core) Web UI add a pop-up containing the full for job

[jira] [Updated] (SPARK-5227) InputOutputMetricsSuite input metrics when reading text file with multiple splits test fails in branch-1.2 SBT Jenkins build w/hadoop1.0 and hadoop2.0 profiles

2015-01-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5227: -- Priority: Blocker (was: Major) InputOutputMetricsSuite input metrics when reading text file with

[jira] [Commented] (SPARK-5227) InputOutputMetricsSuite input metrics when reading text file with multiple splits test fails in branch-1.2 SBT Jenkins build w/hadoop1.0 and hadoop2.0 profiles

2015-01-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286681#comment-14286681 ] Josh Rosen commented on SPARK-5227: --- I've bumped this up to a 1.2.1 blocker to see if we

[jira] [Created] (SPARK-5362) Gradient and Optimizer to support generic output (instead of label) and data batches

2015-01-21 Thread Alexander Ulanov (JIRA)
Alexander Ulanov created SPARK-5362: --- Summary: Gradient and Optimizer to support generic output (instead of label) and data batches Key: SPARK-5362 URL: https://issues.apache.org/jira/browse/SPARK-5362

[jira] [Created] (SPARK-5361) add in tuple handling for converting python RDD back to JavaRDD

2015-01-21 Thread Winston Chen (JIRA)
Winston Chen created SPARK-5361: --- Summary: add in tuple handling for converting python RDD back to JavaRDD Key: SPARK-5361 URL: https://issues.apache.org/jira/browse/SPARK-5361 Project: Spark

[jira] [Commented] (SPARK-5342) Allow long running Spark apps to run on secure YARN/HDFS

2015-01-21 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286543#comment-14286543 ] Hari Shreedharan commented on SPARK-5342: - Looks like SPARK-3883 is adding SSL

[jira] [Commented] (SPARK-5147) write ahead logs from streaming receiver are not purged because cleanupOldBlocks in WriteAheadLogBasedBlockHandler is never called

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286596#comment-14286596 ] Apache Spark commented on SPARK-5147: - User 'tdas' has created a pull request for this

[jira] [Resolved] (SPARK-5355) SparkConf is not thread-safe

2015-01-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-5355. --- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Issue resolved by pull request

[jira] [Updated] (SPARK-5355) SparkConf is not thread-safe

2015-01-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5355: -- Assignee: Davies Liu SparkConf is not thread-safe Key:

[jira] [Commented] (SPARK-5362) Gradient and Optimizer to support generic output (instead of label) and data batches

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286701#comment-14286701 ] Apache Spark commented on SPARK-5362: - User 'avulanov' has created a pull request for

[jira] [Commented] (SPARK-5362) Gradient and Optimizer to support generic output (instead of label) and data batches

2015-01-21 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286703#comment-14286703 ] Alexander Ulanov commented on SPARK-5362: -

[jira] [Commented] (SPARK-5361) python tuple not supported while converting PythonRDD back to JavaRDD

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286459#comment-14286459 ] Apache Spark commented on SPARK-5361: - User 'wingchen' has created a pull request for

[jira] [Resolved] (SPARK-4631) Add real unit test for MQTT

2015-01-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-4631. -- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Add real unit test for

[jira] [Updated] (SPARK-5063) Display more helpful error messages for several invalid operations

2015-01-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5063: -- Target Version/s: 1.2.1 Display more helpful error messages for several invalid operations

[jira] [Commented] (SPARK-4939) Python updateStateByKey example hang in local mode

2015-01-21 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286447#comment-14286447 ] Davies Liu commented on SPARK-4939: --- Sent out a PR to change the local scheduler to

[jira] [Commented] (SPARK-4939) Python updateStateByKey example hang in local mode

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286445#comment-14286445 ] Apache Spark commented on SPARK-4939: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-944) Give example of writing to HBase from Spark Streaming

2015-01-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286639#comment-14286639 ] Tathagata Das commented on SPARK-944: - I am closing this JIRA because this is not

[jira] [Commented] (SPARK-4586) Python API for ML Pipeline

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286693#comment-14286693 ] Apache Spark commented on SPARK-4586: - User 'mengxr' has created a pull request for

[jira] [Updated] (SPARK-5063) Display more helpful error messages for several invalid operations

2015-01-21 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5063: -- Summary: Display more helpful error messages for several invalid operations (was: Raise more helpful

[jira] [Commented] (SPARK-5256) Improving MLlib optimization APIs

2015-01-21 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286706#comment-14286706 ] Alexander Ulanov commented on SPARK-5256: - I've implemented my proposition with

[jira] [Commented] (SPARK-2546) Configuration object thread safety issue

2015-01-21 Thread Tsuyoshi OZAWA (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286971#comment-14286971 ] Tsuyoshi OZAWA commented on SPARK-2546: --- Now HADOOP-11209, the problem reported by

[jira] [Resolved] (SPARK-3424) KMeans Plus Plus is too slow

2015-01-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-3424. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4144

[jira] [Commented] (SPARK-5297) File Streams do not work with custom key/values

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286750#comment-14286750 ] Apache Spark commented on SPARK-5297: - User 'jerryshao' has created a pull request for

[jira] [Commented] (SPARK-4786) Parquet filter pushdown for BYTE and SHORT types

2015-01-21 Thread Yash Datta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14287004#comment-14287004 ] Yash Datta commented on SPARK-4786: --- https://github.com/apache/spark/pull/4156 Parquet

[jira] [Commented] (SPARK-4786) Parquet filter pushdown for BYTE and SHORT types

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14287003#comment-14287003 ] Apache Spark commented on SPARK-4786: - User 'saucam' has created a pull request for

[jira] [Issue Comment Deleted] (SPARK-4786) Parquet filter pushdown for BYTE and SHORT types

2015-01-21 Thread Yash Datta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yash Datta updated SPARK-4786: -- Comment: was deleted (was: https://github.com/apache/spark/pull/4156) Parquet filter pushdown for

[jira] [Commented] (SPARK-5342) Allow long running Spark apps to run on secure YARN/HDFS

2015-01-21 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14287018#comment-14287018 ] Hari Shreedharan commented on SPARK-5342: - I am considering just copying the

[jira] [Created] (SPARK-5363) Spark 1.2 freeze without error notification

2015-01-21 Thread Tassilo Klein (JIRA)
Tassilo Klein created SPARK-5363: Summary: Spark 1.2 freeze without error notification Key: SPARK-5363 URL: https://issues.apache.org/jira/browse/SPARK-5363 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5347) InputMetrics bug when inputSplit is not instanceOf FileSplit

2015-01-21 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286738#comment-14286738 ] Hong Shen commented on SPARK-5347: -- In addition, we will use some other inputFormat and

[jira] [Issue Comment Deleted] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2015-01-21 Thread Victor Tso (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Victor Tso updated SPARK-4105: -- Comment: was deleted (was: What's the fix version?) FAILED_TO_UNCOMPRESS(5) errors when fetching

[jira] [Updated] (SPARK-5351) Can't zip RDDs with unequal numbers of partitions in ReplicatedVertexView.upgrade()

2015-01-21 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-5351: Description: If the value of 'spark.default.parallelism' does not match the number of

[jira] [Commented] (SPARK-5357) Upgrade from commons-codec 1.5

2015-01-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14286728#comment-14286728 ] Apache Spark commented on SPARK-5357: - User 'MattWhelan' has created a pull request

  1   2   >