[jira] [Updated] (SPARK-3580) Add Consistent Method To Get Number of RDD Partitions Across Different Languages

2014-09-17 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3580: --- Labels: starter (was: ) Add Consistent Method To Get Number of RDD Partitions Across

[jira] [Commented] (SPARK-3580) Add Consistent Method To Get Number of RDD Partitions Across Different Languages

2014-09-17 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138460#comment-14138460 ] Patrick Wendell commented on SPARK-3580: Yeah I think it's a good idea to add

[jira] [Updated] (SPARK-3579) Jekyll doc generation is different across environments

2014-09-17 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3579: --- Description: This can result in a lot of false changes when someone alters something with

[jira] [Commented] (SPARK-3578) GraphGenerators.sampleLogNormal sometimes returns too-large result

2014-09-17 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138480#comment-14138480 ] Patrick Wendell commented on SPARK-3578: @ankurdave, could you tag stuff as GraphX

[jira] [Updated] (SPARK-3578) GraphGenerators.sampleLogNormal sometimes returns too-large result

2014-09-17 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3578: --- Component/s: GraphX GraphGenerators.sampleLogNormal sometimes returns too-large result

[jira] [Resolved] (SPARK-3333) Large number of partitions causes OOM

2014-09-17 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-. Resolution: Fixed This was documented in the release upgrade notes, so I think we're all

[jira] [Resolved] (SPARK-3547) Maybe we should not simply make return code 1 equal to CLASS_NOT_FOUND

2014-09-18 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3547. Resolution: Fixed Fix Version/s: 1.2.0 Assignee: WangTaoTheTonic Resolved

[jira] [Resolved] (SPARK-3579) Jekyll doc generation is different across environments

2014-09-18 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3579. Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 2443

[jira] [Resolved] (SPARK-1477) Add the lifecycle interface

2014-09-18 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-1477. Resolution: Won't Fix Unless we are planning to interact with these components in a generic

[jira] [Comment Edited] (SPARK-1477) Add the lifecycle interface

2014-09-18 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139218#comment-14139218 ] Patrick Wendell edited comment on SPARK-1477 at 9/18/14 5:34 PM:

[jira] [Resolved] (SPARK-3566) .gitignore and .rat-excludes should consider Windows cmd file and Emacs' backup files

2014-09-18 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3566. Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Kousuke Saruta

[jira] [Resolved] (SPARK-3589) [Minor]Remove redundant code in deploy module

2014-09-18 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3589. Resolution: Fixed Fix Version/s: 1.2.0 1.1.1 Assignee:

[jira] [Updated] (SPARK-3587) Spark SQL can't support lead() over() window function

2014-09-18 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3587: --- Labels: (was: features) Spark SQL can't support lead() over() window function

[jira] [Updated] (SPARK-3574) Shuffle finish time always reported as -1

2014-09-18 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3574: --- Component/s: Spark Core Shuffle finish time always reported as -1

[jira] [Updated] (SPARK-2672) Support compression in wholeFile()

2014-09-18 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2672: --- Summary: Support compression in wholeFile() (was: support compressed file in wholeFile())

[jira] [Updated] (SPARK-2761) Merge similar code paths in ExternalSorter and EAOM

2014-09-18 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2761: --- Component/s: Spark Core Merge similar code paths in ExternalSorter and EAOM

[jira] [Commented] (SPARK-3573) Dataset

2014-09-18 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140017#comment-14140017 ] Patrick Wendell commented on SPARK-3573: [~sandyr] This is a good question I'm not

[jira] [Commented] (SPARK-3270) Spark API for Application Extensions

2014-09-18 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140043#comment-14140043 ] Patrick Wendell commented on SPARK-3270: Hey There, For the particular use case

[jira] [Commented] (SPARK-3604) unbounded recursion in getNumPartitions triggers stack overflow for large UnionRDD

2014-09-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140940#comment-14140940 ] Patrick Wendell commented on SPARK-3604: Yeah good catch, we should fix this.

[jira] [Updated] (SPARK-3604) unbounded recursion in getNumPartitions triggers stack overflow for large UnionRDD

2014-09-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3604: --- Target Version/s: 1.2.0 unbounded recursion in getNumPartitions triggers stack overflow for

[jira] [Updated] (SPARK-3604) unbounded recursion in getNumPartitions triggers stack overflow for large UnionRDD

2014-09-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3604: --- Priority: Critical (was: Blocker) unbounded recursion in getNumPartitions triggers stack

[jira] [Commented] (SPARK-2175) Null values when using App trait.

2014-09-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14140947#comment-14140947 ] Patrick Wendell commented on SPARK-2175: Thanks for reporting this - can someone

[jira] [Created] (SPARK-3615) Kafka test should not hard code Zookeeper port

2014-09-20 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-3615: -- Summary: Kafka test should not hard code Zookeeper port Key: SPARK-3615 URL: https://issues.apache.org/jira/browse/SPARK-3615 Project: Spark Issue Type:

[jira] [Updated] (SPARK-3610) Unable to load app logs for MLLib programs in history server

2014-09-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3610: --- Priority: Critical (was: Major) Unable to load app logs for MLLib programs in history

[jira] [Updated] (SPARK-3610) History server log name should not be based on user input

2014-09-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3610: --- Description: Right now we don't have a Original bug report: The default log files for the

[jira] [Updated] (SPARK-3610) History server log name should not be based on user input

2014-09-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3610: --- Description: Right now we use the user-defined application name when creating the logging

[jira] [Updated] (SPARK-3610) History server log name should not be based on user input

2014-09-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3610: --- Description: Right now we use the user-defined application name when creating the logging

[jira] [Updated] (SPARK-3610) History server log name should not be based on user input

2014-09-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3610: --- Fix Version/s: (was: 1.1.0) History server log name should not be based on user input

[jira] [Updated] (SPARK-3610) History server log name should not be based on user input

2014-09-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3610: --- Component/s: (was: Web UI) Target Version/s: 1.2.0 History server log name

[jira] [Commented] (SPARK-3604) unbounded recursion in getNumPartitions triggers stack overflow for large UnionRDD

2014-09-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142257#comment-14142257 ] Patrick Wendell commented on SPARK-3604: After looking at the PR - I think the

[jira] [Comment Edited] (SPARK-3604) unbounded recursion in getNumPartitions triggers stack overflow for large UnionRDD

2014-09-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142257#comment-14142257 ] Patrick Wendell edited comment on SPARK-3604 at 9/21/14 12:35 AM:

[jira] [Resolved] (SPARK-3599) Avoid loading and printing properties file content frequently

2014-09-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3599. Resolution: Fixed Assignee: WangTaoTheTonic Avoid loading and printing properties

[jira] [Commented] (SPARK-1597) Add a version of reduceByKey that takes the Partitioner as a second argument

2014-09-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142335#comment-14142335 ] Patrick Wendell commented on SPARK-1597: See relevant comment here:

[jira] [Updated] (SPARK-2630) Input data size of CoalescedRDD is incorrect

2014-09-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2630: --- Target Version/s: 1.2.0 (was: 1.1.0) Input data size of CoalescedRDD is incorrect

[jira] [Updated] (SPARK-2630) Input data size of CoalescedRDD is incorrect

2014-09-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2630: --- Assignee: Andrew Ash Input data size of CoalescedRDD is incorrect

[jira] [Updated] (SPARK-2630) Input data size of CoalescedRDD is incorrect

2014-09-20 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2630: --- Priority: Blocker (was: Critical) Input data size of CoalescedRDD is incorrect

[jira] [Updated] (SPARK-3595) Spark should respect configured OutputCommitter when using saveAsHadoopFile

2014-09-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3595: --- Assignee: Ian Hummel Spark should respect configured OutputCommitter when using

[jira] [Resolved] (SPARK-3595) Spark should respect configured OutputCommitter when using saveAsHadoopFile

2014-09-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3595. Resolution: Fixed Fix Version/s: 1.2.0 Target Version/s: 1.2.0 Thanks I've

[jira] [Commented] (SPARK-3622) Provide a custom transformation that can output multiple RDDs

2014-09-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142865#comment-14142865 ] Patrick Wendell commented on SPARK-3622: Do you mind clarifying a little bit how

[jira] [Comment Edited] (SPARK-3622) Provide a custom transformation that can output multiple RDDs

2014-09-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142865#comment-14142865 ] Patrick Wendell edited comment on SPARK-3622 at 9/22/14 3:24 AM:

[jira] [Updated] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-09-21 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3633: --- Summary: Fetches failure observed after SPARK-2711 (was: PR 1707/commit #4fde28c is

[jira] [Commented] (SPARK-3622) Provide a custom transformation that can output multiple RDDs

2014-09-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142982#comment-14142982 ] Patrick Wendell commented on SPARK-3622: In Spark most RDD operations are lazy, so

[jira] [Comment Edited] (SPARK-3622) Provide a custom transformation that can output multiple RDDs

2014-09-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142982#comment-14142982 ] Patrick Wendell edited comment on SPARK-3622 at 9/22/14 7:35 AM:

[jira] [Comment Edited] (SPARK-3622) Provide a custom transformation that can output multiple RDDs

2014-09-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142982#comment-14142982 ] Patrick Wendell edited comment on SPARK-3622 at 9/22/14 7:35 AM:

[jira] [Created] (SPARK-3648) Provide a script for fetching remote PR's for review

2014-09-22 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-3648: -- Summary: Provide a script for fetching remote PR's for review Key: SPARK-3648 URL: https://issues.apache.org/jira/browse/SPARK-3648 Project: Spark Issue

[jira] [Updated] (SPARK-3648) Provide a script for fetching remote PR's for review

2014-09-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3648: --- Issue Type: New Feature (was: Bug) Provide a script for fetching remote PR's for review

[jira] [Updated] (SPARK-1720) use LD_LIBRARY_PATH instead of -Djava.library.path

2014-09-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1720: --- Priority: Critical (was: Major) Target Version/s: 1.2.0 use LD_LIBRARY_PATH

[jira] [Commented] (SPARK-1720) use LD_LIBRARY_PATH instead of -Djava.library.path

2014-09-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14143930#comment-14143930 ] Patrick Wendell commented on SPARK-1720: Another user reported this issue, so

[jira] [Updated] (SPARK-1860) Standalone Worker cleanup should not clean up running executors

2014-09-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1860: --- Priority: Blocker (was: Critical) Standalone Worker cleanup should not clean up running

[jira] [Updated] (SPARK-1860) Standalone Worker cleanup should not clean up running executors

2014-09-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1860: --- Target Version/s: 1.2.0 Standalone Worker cleanup should not clean up running executors

[jira] [Updated] (SPARK-3032) Potential bug when running sort-based shuffle with sorting using TimSort

2014-09-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3032: --- Priority: Critical (was: Major) Potential bug when running sort-based shuffle with sorting

[jira] [Updated] (SPARK-3032) Potential bug when running sort-based shuffle with sorting using TimSort

2014-09-22 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3032: --- Target Version/s: 1.2.0 Potential bug when running sort-based shuffle with sorting using

[jira] [Commented] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-09-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144490#comment-14144490 ] Patrick Wendell commented on SPARK-3633: [~nravi] if you are trying to debug this,

[jira] [Resolved] (SPARK-3647) Shaded Guava patch causes access issues with package private classes

2014-09-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3647. Resolution: Fixed Fixed by Marcelo in this patch:

[jira] [Resolved] (SPARK-3612) Executor shouldn't quit if heartbeat message fails to reach the driver

2014-09-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3612. Resolution: Fixed Fix Version/s: 1.2.0 1.1.1 Assignee:

[jira] [Updated] (SPARK-3032) Potential bug when running sort-based shuffle with sorting using TimSort

2014-09-23 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3032: --- Priority: Blocker (was: Critical) Potential bug when running sort-based shuffle with

[jira] [Resolved] (SPARK-3659) Set EC2 version to 1.1.0 in master branch

2014-09-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3659. Resolution: Fixed Fix Version/s: 1.2.0 https://github.com/apache/spark/pull/2510

[jira] [Updated] (SPARK-2691) Allow Spark on Mesos to be launched with Docker

2014-09-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2691: --- Assignee: Timothy Chen (was: Tim Chen) Allow Spark on Mesos to be launched with Docker

[jira] [Updated] (SPARK-2691) Allow Spark on Mesos to be launched with Docker

2014-09-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2691: --- Assignee: Tim Chen (was: Timothy Hunter) Allow Spark on Mesos to be launched with Docker

[jira] [Resolved] (SPARK-3604) unbounded recursion in getNumPartitions triggers stack overflow for large UnionRDD

2014-09-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3604. Resolution: Not a Problem unbounded recursion in getNumPartitions triggers stack overflow

[jira] [Updated] (SPARK-3681) Failed to serialized ArrayType or MapType after accessing them in Python

2014-09-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3681: --- Component/s: PySpark Failed to serialized ArrayType or MapType after accessing them in

[jira] [Updated] (SPARK-3663) Document SPARK_LOG_DIR and SPARK_PID_DIR

2014-09-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3663: --- Component/s: Documentation Document SPARK_LOG_DIR and SPARK_PID_DIR

[jira] [Updated] (SPARK-3610) History server log name should not be based on user input

2014-09-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3610: --- Component/s: Spark Core History server log name should not be based on user input

[jira] [Resolved] (SPARK-3615) Kafka test should not hard code Zookeeper port

2014-09-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3615. Resolution: Fixed https://github.com/apache/spark/pull/2483 Kafka test should not hard

[jira] [Created] (SPARK-3686) flume.SparkSinkSuite.Success is flaky

2014-09-24 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-3686: -- Summary: flume.SparkSinkSuite.Success is flaky Key: SPARK-3686 URL: https://issues.apache.org/jira/browse/SPARK-3686 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-546) Support full outer join and multiple join in a single shuffle

2014-09-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-546. --- Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Aaron Staple Fixed by:

[jira] [Resolved] (SPARK-2778) Add unit tests for Yarn integration

2014-09-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2778. Resolution: Fixed Fix Version/s: 1.2.0 Fixed by:

[jira] [Commented] (SPARK-3687) Spark hang while processing more than 100 sequence files

2014-09-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14147465#comment-14147465 ] Patrick Wendell commented on SPARK-3687: Can you perform a jstack on the executor

[jira] [Resolved] (SPARK-3576) Provide script for creating the Spark AMI from scratch

2014-09-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3576. Resolution: Fixed This was fixed in spark-ec2 itself Provide script for creating the

[jira] [Updated] (SPARK-3288) All fields in TaskMetrics should be private and use getters/setters

2014-09-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3288: --- Assignee: (was: Andrew Or) All fields in TaskMetrics should be private and use

[jira] [Updated] (SPARK-3288) All fields in TaskMetrics should be private and use getters/setters

2014-09-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3288: --- Labels: starter (was: ) All fields in TaskMetrics should be private and use getters/setters

[jira] [Created] (SPARK-3694) Allow printing object graph of tasks/RDD's with a debug flag

2014-09-25 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-3694: -- Summary: Allow printing object graph of tasks/RDD's with a debug flag Key: SPARK-3694 URL: https://issues.apache.org/jira/browse/SPARK-3694 Project: Spark

[jira] [Updated] (SPARK-3694) Allow printing object graph of tasks/RDD's with a debug flag

2014-09-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3694: --- Description: This would be useful for debugging extra references inside of RDD's Here is an

[jira] [Resolved] (SPARK-3584) sbin/slaves doesn't work when we use password authentication for SSH

2014-09-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3584. Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Kousuke Saruta

[jira] [Resolved] (SPARK-3686) flume.SparkSinkSuite.Success is flaky

2014-09-25 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3686. Resolution: Fixed Resolved by: https://github.com/apache/spark/pull/2531

[jira] [Resolved] (SPARK-3695) Enable to show host and port in block fetch failure

2014-09-26 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3695. Resolution: Fixed Fix Version/s: 1.2.0 Enable to show host and port in block fetch

[jira] [Resolved] (SPARK-2655) Change the default logging level to WARN

2014-09-27 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2655. Resolution: Won't Fix Change the default logging level to WARN

[jira] [Created] (SPARK-3709) BroadcastSuite.Unpersisting rg.apache.spark.broadcast.BroadcastSuite.Unpersisting TorrentBroadcast is flaky

2014-09-27 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-3709: -- Summary: BroadcastSuite.Unpersisting rg.apache.spark.broadcast.BroadcastSuite.Unpersisting TorrentBroadcast is flaky Key: SPARK-3709 URL:

[jira] [Created] (SPARK-3710) YARN integration test is flaky

2014-09-27 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-3710: -- Summary: YARN integration test is flaky Key: SPARK-3710 URL: https://issues.apache.org/jira/browse/SPARK-3710 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-3710) YARN integration test is flaky

2014-09-27 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3710: --- Description: This has been regularly failing the master build: Example failure:

[jira] [Commented] (SPARK-3685) Spark's local dir scheme is not configurable

2014-09-28 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14151350#comment-14151350 ] Patrick Wendell commented on SPARK-3685: [~andrewor14] changing the use of local

[jira] [Updated] (SPARK-2626) Stop SparkContext in all examples

2014-09-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2626: --- Labels: starter (was: ) Stop SparkContext in all examples

[jira] [Commented] (SPARK-2805) update akka to version 2.3

2014-09-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14151976#comment-14151976 ] Patrick Wendell commented on SPARK-2805: I'm working on publishing akka today.

[jira] [Commented] (SPARK-3479) Have Jenkins show which tests failed in his GitHub messages

2014-09-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14151980#comment-14151980 ] Patrick Wendell commented on SPARK-3479: I think this is more than minor - it

[jira] [Updated] (SPARK-3479) Have Jenkins show which tests failed in his GitHub messages

2014-09-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3479: --- Priority: Major (was: Minor) Have Jenkins show which tests failed in his GitHub messages

[jira] [Updated] (SPARK-3479) Have Jenkins show which tests failed in his GitHub messages

2014-09-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3479: --- Issue Type: Improvement (was: Sub-task) Parent: (was: SPARK-2230) Have Jenkins

[jira] [Resolved] (SPARK-2230) Improvements to Jenkins QA Harness

2014-09-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2230. Resolution: Fixed This was tracking an earlier initiative to clean up this harness. Since

[jira] [Commented] (SPARK-2331) SparkContext.emptyRDD has wrong return type

2014-09-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14152018#comment-14152018 ] Patrick Wendell commented on SPARK-2331: Yeah we could have made this a wider type

[jira] [Resolved] (SPARK-2331) SparkContext.emptyRDD has wrong return type

2014-09-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2331. Resolution: Won't Fix SparkContext.emptyRDD has wrong return type

[jira] [Comment Edited] (SPARK-2331) SparkContext.emptyRDD has wrong return type

2014-09-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14152018#comment-14152018 ] Patrick Wendell edited comment on SPARK-2331 at 9/29/14 6:34 PM:

[jira] [Updated] (SPARK-3694) Allow printing object graph of tasks/RDD's with a debug flag

2014-09-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3694: --- Labels: starter (was: ) Allow printing object graph of tasks/RDD's with a debug flag

[jira] [Updated] (SPARK-2548) JavaRecoverableWordCount is missing

2014-09-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2548: --- Labels: starter (was: ) JavaRecoverableWordCount is missing

[jira] [Updated] (SPARK-3504) KMeans optimization: track distances and unmoved cluster centers across iterations

2014-09-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3504: --- Summary: KMeans optimization: track distances and unmoved cluster centers across iterations

[jira] [Commented] (SPARK-3504) KMeans optimization: track distances and unmoved cluster centers across iterations

2014-09-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14152730#comment-14152730 ] Patrick Wendell commented on SPARK-3504: I just updated the title to make it more

[jira] [Reopened] (SPARK-3007) Add Dynamic Partition support to Spark Sql hive

2014-09-30 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-3007: This was reverted based on causing large numbers of test failures. Add Dynamic Partition

[jira] [Reopened] (SPARK-2778) Add unit tests for Yarn integration

2014-09-30 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-2778: This has been reverted due to failing tests. Add unit tests for Yarn integration

[jira] [Updated] (SPARK-2778) Add unit tests for Yarn integration

2014-09-30 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2778: --- Attachment: yarn-logs.txt I'm attaching logs from the bad test. Add unit tests for Yarn

[jira] [Created] (SPARK-3744) FlumeStreamSuite will fail during port contention

2014-09-30 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-3744: -- Summary: FlumeStreamSuite will fail during port contention Key: SPARK-3744 URL: https://issues.apache.org/jira/browse/SPARK-3744 Project: Spark Issue

[jira] [Resolved] (SPARK-3757) mvn clean doesn't delete some files

2014-10-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3757. Resolution: Fixed Fix Version/s: 1.2.0 Resolved by:

[jira] [Resolved] (SPARK-3756) Include possible MultiException when detecting port collisions

2014-10-01 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3756. Resolution: Fixed Fix Version/s: 1.2.0 1.1.1 Assignee:

<    9   10   11   12   13   14   15   16   17   18   >