[jira] [Created] (SPARK-3556) Monitoring and debugging improvements (Spark 1.2)

2014-09-16 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-3556: - Summary: Monitoring and debugging improvements (Spark 1.2) Key: SPARK-3556 URL: https://issues.apache.org/jira/browse/SPARK-3556 Project: Spark Issue Type: Umbrell

[jira] [Commented] (SPARK-3535) Spark on Mesos not correctly setting heap overhead

2014-09-16 Thread Brenden Matthews (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136321#comment-14136321 ] Brenden Matthews commented on SPARK-3535: - I've updated the patch to include the s

[jira] [Updated] (SPARK-3556) Monitoring and debugging improvements (Spark 1.2)

2014-09-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-3556: -- Issue Type: Epic (was: Umbrella) > Monitoring and debugging improvements (Spark 1.2) >

[jira] [Updated] (SPARK-3556) Monitoring and debugging improvements (Spark 1.2)

2014-09-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-3556: -- Epic Name: Monitoring and debugging improvements (Spark 1.2) > Monitoring and debugging improvements (Sp

[jira] [Created] (SPARK-3557) Yarn client config prioritization is backwards

2014-09-16 Thread Andrew Or (JIRA)
Andrew Or created SPARK-3557: Summary: Yarn client config prioritization is backwards Key: SPARK-3557 URL: https://issues.apache.org/jira/browse/SPARK-3557 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-3067) JobProgressPage could not show Fair Scheduler Pools section sometimes

2014-09-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-3067: -- Component/s: Web UI > JobProgressPage could not show Fair Scheduler Pools section sometimes > --

[jira] [Commented] (SPARK-3067) JobProgressPage could not show Fair Scheduler Pools section sometimes

2014-09-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136387#comment-14136387 ] Josh Rosen commented on SPARK-3067: --- Do you think SPARK-1208 sounds related to this? >

[jira] [Resolved] (SPARK-2414) Remove jquery

2014-09-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-2414. --- Resolution: Won't Fix Resolving this as "Won't Fix", since several of the web UI visualization PRs wi

[jira] [Commented] (SPARK-2883) Spark Support for ORCFile format

2014-09-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136407#comment-14136407 ] Michael Armbrust commented on SPARK-2883: - #1 should be easy, that information is

[jira] [Created] (SPARK-3558) Throw exception for concurrently-running SparkContexts / StreamingContexts in the same JVM

2014-09-16 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-3558: - Summary: Throw exception for concurrently-running SparkContexts / StreamingContexts in the same JVM Key: SPARK-3558 URL: https://issues.apache.org/jira/browse/SPARK-3558 Pr

[jira] [Commented] (SPARK-1455) Determine which test suites to run based on code changes

2014-09-16 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136418#comment-14136418 ] Nicholas Chammas commented on SPARK-1455: - Which approach do y'all prefer? # Have

[jira] [Resolved] (SPARK-2463) Creating multiple StreamingContexts from shell generates duplicate Streaming tabs in UI

2014-09-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-2463. --- Resolution: Invalid Going to resolve this as "Invalid", since we don't currently support concurrently

[jira] [Commented] (SPARK-1455) Determine which test suites to run based on code changes

2014-09-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136424#comment-14136424 ] Michael Armbrust commented on SPARK-1455: - I'd prefer the first. For testing of i

[jira] [Commented] (SPARK-2463) Creating multiple StreamingContexts from shell generates duplicate Streaming tabs in UI

2014-09-16 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136432#comment-14136432 ] Nicholas Chammas commented on SPARK-2463: - [~joshrosen] - Though the most recent c

[jira] [Reopened] (SPARK-2463) Creating multiple StreamingContexts from shell generates duplicate Streaming tabs in UI

2014-09-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reopened SPARK-2463: --- Assignee: Josh Rosen [~nchammas] Good point. I'll re-open and investigate. > Creating multiple Str

[jira] [Resolved] (SPARK-3555) UI port contention suite flakey

2014-09-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3555. Resolution: Fixed Fix Version/s: 1.1.1 1.2.0 Issue resolved by pul

[jira] [Commented] (SPARK-3261) KMeans clusterer can return duplicate cluster centers

2014-09-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136445#comment-14136445 ] Apache Spark commented on SPARK-3261: - User 'derrickburns' has created a pull request

[jira] [Commented] (SPARK-3424) KMeans Plus Plus is too slow

2014-09-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136446#comment-14136446 ] Apache Spark commented on SPARK-3424: - User 'derrickburns' has created a pull request

[jira] [Commented] (SPARK-3219) K-Means clusterer should support Bregman distance functions

2014-09-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136444#comment-14136444 ] Apache Spark commented on SPARK-3219: - User 'derrickburns' has created a pull request

[jira] [Commented] (SPARK-3218) K-Means clusterer can fail on degenerate data

2014-09-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136443#comment-14136443 ] Apache Spark commented on SPARK-3218: - User 'derrickburns' has created a pull request

[jira] [Updated] (SPARK-2004) QA Automation

2014-09-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2004: --- Assignee: (was: Xiangrui Meng) > QA Automation > - > > Key: SPARK-2004

[jira] [Updated] (SPARK-2005) Investigate linux container-based solution

2014-09-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2005: --- Assignee: (was: Xiangrui Meng) > Investigate linux container-based solution >

[jira] [Updated] (SPARK-3129) Prevent data loss in Spark Streaming

2014-09-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3129: --- Component/s: Streaming > Prevent data loss in Spark Streaming > --

[jira] [Commented] (SPARK-2883) Spark Support for ORCFile format

2014-09-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136455#comment-14136455 ] Yin Huai commented on SPARK-2883: - The column projection issue is actually a bug. Because

[jira] [Commented] (SPARK-2593) Add ability to pass an existing Akka ActorSystem into Spark

2014-09-16 Thread Helena Edelson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136463#comment-14136463 ] Helena Edelson commented on SPARK-2593: --- [~pwendell] I've spent no time thus far bec

[jira] [Commented] (SPARK-2883) Spark Support for ORCFile format

2014-09-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136478#comment-14136478 ] Yin Huai commented on SPARK-2883: - I have created https://issues.apache.org/jira/browse/SP

[jira] [Created] (SPARK-3559) appendReadColumnIDs and appendReadColumnNames introduce unnecessary columns in the lists of needed column ids and column names stored in hiveConf

2014-09-16 Thread Yin Huai (JIRA)
Yin Huai created SPARK-3559: --- Summary: appendReadColumnIDs and appendReadColumnNames introduce unnecessary columns in the lists of needed column ids and column names stored in hiveConf Key: SPARK-3559 URL: https://issu

[jira] [Commented] (SPARK-2883) Spark Support for ORCFile format

2014-09-16 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136481#comment-14136481 ] Zhan Zhang commented on SPARK-2883: --- Sorry, I mean column pruning. Currently what I see

[jira] [Issue Comment Deleted] (SPARK-2883) Spark Support for ORCFile format

2014-09-16 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhan Zhang updated SPARK-2883: -- Comment: was deleted (was: Sorry, I mean column pruning. Currently what I see is that HiveTableScan rea

[jira] [Commented] (SPARK-2883) Spark Support for ORCFile format

2014-09-16 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136490#comment-14136490 ] Zhan Zhang commented on SPARK-2883: --- ORC format is self describing with schema of itself

[jira] [Commented] (SPARK-2883) Spark Support for ORCFile format

2014-09-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136521#comment-14136521 ] Yin Huai commented on SPARK-2883: - Oh, seems types (including field names) of the flattene

[jira] [Updated] (SPARK-1503) Implement Nesterov's accelerated first-order method

2014-09-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1503: - Assignee: (was: Xiangrui Meng) > Implement Nesterov's accelerated first-order method > ---

[jira] [Updated] (SPARK-3357) Internal log messages should be set at DEBUG level instead of INFO

2014-09-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3357: - Assignee: (was: Xiangrui Meng) > Internal log messages should be set at DEBUG level instead of

[jira] [Updated] (SPARK-3258) Python API for streaming MLlib algorithms

2014-09-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3258: - Assignee: (was: Xiangrui Meng) > Python API for streaming MLlib algorithms > -

[jira] [Updated] (SPARK-1486) Support multi-model training in MLlib

2014-09-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1486: - Assignee: Burak Yavuz (was: Xiangrui Meng) > Support multi-model training in MLlib >

[jira] [Resolved] (SPARK-2944) sc.makeRDD doesn't distribute partitions evenly

2014-09-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2944. -- Resolution: Cannot Reproduce Closing this one now because I couldn't find an easy way to reprodu

[jira] [Updated] (SPARK-3066) Support recommendAll in matrix factorization model

2014-09-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3066: - Assignee: (was: Xiangrui Meng) > Support recommendAll in matrix factorization model >

[jira] [Updated] (SPARK-3559) appendReadColumnIDs and appendReadColumnNames introduce unnecessary columns in the lists of needed column ids and column names stored in hiveConf

2014-09-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-3559: Description: Because we are using the same hiveConf and we are currently using ColumnProjectionUtils.append

[jira] [Updated] (SPARK-3559) appendReadColumnIDs and appendReadColumnNames introduce unnecessary columns in the lists of needed column ids and column names stored in hiveConf

2014-09-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-3559: Priority: Blocker (was: Critical) > appendReadColumnIDs and appendReadColumnNames introduce unnecessary col

[jira] [Commented] (SPARK-3534) Avoid running MLlib and Streaming tests when testing SQL PRs

2014-09-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136562#comment-14136562 ] Apache Spark commented on SPARK-3534: - User 'nchammas' has created a pull request for

[jira] [Commented] (SPARK-3218) K-Means clusterer can fail on degenerate data

2014-09-16 Thread Derrick Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136560#comment-14136560 ] Derrick Burns commented on SPARK-3218: -- I created a pull request as you suggested. ht

[jira] [Commented] (SPARK-1455) Determine which test suites to run based on code changes

2014-09-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136561#comment-14136561 ] Apache Spark commented on SPARK-1455: - User 'nchammas' has created a pull request for

[jira] [Commented] (SPARK-2593) Add ability to pass an existing Akka ActorSystem into Spark

2014-09-16 Thread Tupshin Harper (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136565#comment-14136565 ] Tupshin Harper commented on SPARK-2593: --- [~pwendell] I'd be curious to hear any of t

[jira] [Commented] (SPARK-3219) K-Means clusterer should support Bregman distance functions

2014-09-16 Thread Derrick Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136564#comment-14136564 ] Derrick Burns commented on SPARK-3219: -- I went ahead and created a pull request. I w

[jira] [Created] (SPARK-3560) In yarn-cluster mode, jars are distributed through multiple mechanisms.

2014-09-16 Thread Sandy Ryza (JIRA)
Sandy Ryza created SPARK-3560: - Summary: In yarn-cluster mode, jars are distributed through multiple mechanisms. Key: SPARK-3560 URL: https://issues.apache.org/jira/browse/SPARK-3560 Project: Spark

[jira] [Updated] (SPARK-3560) In yarn-cluster mode, jars are distributed through multiple mechanisms.

2014-09-16 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-3560: -- Component/s: YARN > In yarn-cluster mode, jars are distributed through multiple mechanisms. > --

[jira] [Closed] (SPARK-2966) Add an approximation algorithm for hierarchical clustering to MLlib

2014-09-16 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu Ishikawa closed SPARK-2966. -- Resolution: Duplicate > Add an approximation algorithm for hierarchical clustering to MLlib > --

[jira] [Commented] (SPARK-2966) Add an approximation algorithm for hierarchical clustering to MLlib

2014-09-16 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136583#comment-14136583 ] Yu Ishikawa commented on SPARK-2966: The issue is merged to SPARK-2429. > Add an appr

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2014-09-16 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136601#comment-14136601 ] Yu Ishikawa commented on SPARK-2429: Hi [~rnowling], I have read the papers which you

[jira] [Commented] (SPARK-3490) Alleviate port collisions during tests

2014-09-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136632#comment-14136632 ] Andrew Or commented on SPARK-3490: -- Backported. Closing this. > Alleviate port collision

[jira] [Resolved] (SPARK-3490) Alleviate port collisions during tests

2014-09-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3490. Resolution: Fixed Fix Version/s: 1.1.1 Issue resolved by pull request 2415 [https://g

[jira] [Commented] (SPARK-3560) In yarn-cluster mode, jars are distributed through multiple mechanisms.

2014-09-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136637#comment-14136637 ] Andrew Or commented on SPARK-3560: -- [~sandyr] So is the fix simply to not set `spark.jars

[jira] [Updated] (SPARK-3560) In yarn-cluster mode, jars are distributed through multiple mechanisms.

2014-09-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-3560: - Target Version/s: 1.1.1, 1.2.0 (was: 1.1.1) > In yarn-cluster mode, jars are distributed through multiple

[jira] [Updated] (SPARK-3560) In yarn-cluster mode, jars are distributed through multiple mechanisms.

2014-09-16 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-3560: - Affects Version/s: 1.1.0 > In yarn-cluster mode, jars are distributed through multiple mechanisms. > -

[jira] [Updated] (SPARK-3547) Maybe we should not simply make return code 1 equal to CLASS_NOT_FOUND

2014-09-16 Thread WangTaoTheTonic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] WangTaoTheTonic updated SPARK-3547: --- Description: It incurred runtime exception when hadoop version is not A.B.* format, which is

[jira] [Commented] (SPARK-3547) Maybe we should not simply make return code 1 equal to CLASS_NOT_FOUND

2014-09-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136663#comment-14136663 ] Apache Spark commented on SPARK-3547: - User 'WangTaoTheTonic' has created a pull reque

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2014-09-16 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136671#comment-14136671 ] RJ Nowling commented on SPARK-2429: --- Great! I look forward to seeing your implementatio

[jira] [Created] (SPARK-3561) Native Hadoop/YARN integration for batch/ETL workloads

2014-09-16 Thread Oleg Zhurakousky (JIRA)
Oleg Zhurakousky created SPARK-3561: --- Summary: Native Hadoop/YARN integration for batch/ETL workloads Key: SPARK-3561 URL: https://issues.apache.org/jira/browse/SPARK-3561 Project: Spark Is

[jira] [Updated] (SPARK-3561) Native Hadoop/YARN integration for batch/ETL workloads

2014-09-16 Thread Oleg Zhurakousky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleg Zhurakousky updated SPARK-3561: Attachment: Spark_3561.pdf Detailed design document > Native Hadoop/YARN integration for ba

[jira] [Commented] (SPARK-1486) Support multi-model training in MLlib

2014-09-16 Thread Anant Daksh Asthana (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136696#comment-14136696 ] Anant Daksh Asthana commented on SPARK-1486: That sounds very true and relevan

[jira] [Updated] (SPARK-3561) Native Hadoop/YARN integration for batch/ETL workloads

2014-09-16 Thread Oleg Zhurakousky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleg Zhurakousky updated SPARK-3561: Description: Currently Spark provides integration with external resource-managers such as A

[jira] [Updated] (SPARK-3561) Native Hadoop/YARN integration for batch/ETL workloads

2014-09-16 Thread Oleg Zhurakousky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleg Zhurakousky updated SPARK-3561: Description: Currently Spark provides integration with external resource-managers such as A

[jira] [Updated] (SPARK-3561) Native Hadoop/YARN integration for batch/ETL workloads

2014-09-16 Thread Oleg Zhurakousky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleg Zhurakousky updated SPARK-3561: Description: Currently Spark provides integration with external resource-managers such as A

[jira] [Updated] (SPARK-3561) Native Hadoop/YARN integration for batch/ETL workloads

2014-09-16 Thread Oleg Zhurakousky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleg Zhurakousky updated SPARK-3561: Description: Currently Spark provides integration with external resource-managers such as A

[jira] [Updated] (SPARK-3561) Native Hadoop/YARN integration for batch/ETL workloads

2014-09-16 Thread Oleg Zhurakousky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleg Zhurakousky updated SPARK-3561: Description: Currently Spark provides integration with external resource-managers such as A

[jira] [Updated] (SPARK-3561) Native Hadoop/YARN integration for batch/ETL workloads

2014-09-16 Thread Oleg Zhurakousky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleg Zhurakousky updated SPARK-3561: Attachment: (was: Spark_3561.pdf) > Native Hadoop/YARN integration for batch/ETL workloa

[jira] [Updated] (SPARK-3561) Native Hadoop/YARN integration for batch/ETL workloads

2014-09-16 Thread Oleg Zhurakousky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleg Zhurakousky updated SPARK-3561: Attachment: SPARK-3561.pdf > Native Hadoop/YARN integration for batch/ETL workloads > --

[jira] [Commented] (SPARK-3561) Native Hadoop/YARN integration for batch/ETL workloads

2014-09-16 Thread Oleg Zhurakousky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136702#comment-14136702 ] Oleg Zhurakousky commented on SPARK-3561: - PR is available - https://github.com/ap

[jira] [Commented] (SPARK-3561) Native Hadoop/YARN integration for batch/ETL workloads

2014-09-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136706#comment-14136706 ] Apache Spark commented on SPARK-3561: - User 'olegz' has created a pull request for thi

[jira] [Commented] (SPARK-3489) support rdd.zip(rdd1, rdd2,...) with variable number of rdds as params

2014-09-16 Thread Mohit Jaggi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136738#comment-14136738 ] Mohit Jaggi commented on SPARK-3489: Proposed diff --- MohitMacBook:spark mohit$ git

[jira] [Resolved] (SPARK-744) BlockManagerUI with no RDD: java.lang.UnsupportedOperationException: empty.reduceLeft

2014-09-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-744. -- Resolution: Incomplete > BlockManagerUI with no RDD: java.lang.UnsupportedOperationException: > empty.re

[jira] [Updated] (SPARK-611) Allow JStack to be run from web UI

2014-09-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-611: - Target Version/s: 1.2.0 > Allow JStack to be run from web UI > -- > >

[jira] [Updated] (SPARK-611) Allow JStack to be run from web UI

2014-09-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-611: - Summary: Allow JStack to be run from web UI (was: Expose basic JVM metrics in WebUI.) > Allow JStack to b

[jira] [Updated] (SPARK-2105) SparkUI doesn't remove active stages that failed

2014-09-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-2105: -- Component/s: Web UI > SparkUI doesn't remove active stages that failed > ---

[jira] [Updated] (SPARK-1622) Expose input split(s) accessed by a task in UI or logs

2014-09-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-1622: -- Component/s: Web UI > Expose input split(s) accessed by a task in UI or logs > -

[jira] [Updated] (SPARK-3489) support rdd.zip(rdd1, rdd2,...) with variable number of rdds as params

2014-09-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3489: --- Fix Version/s: (was: 1.0.3) > support rdd.zip(rdd1, rdd2,...) with variable number of rdds

[jira] [Updated] (SPARK-3489) support rdd.zip(rdd1, rdd2,...) with variable number of rdds as params

2014-09-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3489: --- Target Version/s: 1.2.0 (was: 1.0.2) > support rdd.zip(rdd1, rdd2,...) with variable number o

[jira] [Commented] (SPARK-3561) Native Hadoop/YARN integration for batch/ETL workloads

2014-09-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136777#comment-14136777 ] Patrick Wendell commented on SPARK-3561: Hey [~ozhurakousky] - could you provide m

[jira] [Comment Edited] (SPARK-3561) Native Hadoop/YARN integration for batch/ETL workloads

2014-09-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136777#comment-14136777 ] Patrick Wendell edited comment on SPARK-3561 at 9/17/14 4:54 AM: ---

[jira] [Created] (SPARK-3562) Periodic cleanup

2014-09-16 Thread xukun (JIRA)
xukun created SPARK-3562: Summary: Periodic cleanup Key: SPARK-3562 URL: https://issues.apache.org/jira/browse/SPARK-3562 Project: Spark Issue Type: New Feature Components: core Affects

[jira] [Updated] (SPARK-3562) Periodic cleanup event logs

2014-09-16 Thread xukun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xukun updated SPARK-3562: - Summary: Periodic cleanup event logs (was: Periodic cleanup) > Periodic cleanup event logs >

[jira] [Created] (SPARK-3563) Shuffle data not always clean

2014-09-16 Thread shenhong (JIRA)
shenhong created SPARK-3563: --- Summary: Shuffle data not always clean Key: SPARK-3563 URL: https://issues.apache.org/jira/browse/SPARK-3563 Project: Spark Issue Type: Bug Components: core

<    1   2