[jira] [Commented] (SPARK-7660) Snappy-java buffer-sharing bug leads to data corruption / test failures

2015-05-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544987#comment-14544987 ] Josh Rosen commented on SPARK-7660: --- Note that this affects more than just Spark 1.4.0;

[jira] [Commented] (SPARK-7660) Snappy-java buffer-sharing bug leads to data corruption / test failures

2015-05-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545005#comment-14545005 ] Josh Rosen commented on SPARK-7660: --- I pushed

[jira] [Assigned] (SPARK-7660) Snappy-java buffer-sharing bug leads to data corruption / test failures

2015-05-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-7660: - Assignee: Josh Rosen Snappy-java buffer-sharing bug leads to data corruption / test failures

[jira] [Commented] (SPARK-7662) Exception of multi-attribute generator anlysis in projection

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545072#comment-14545072 ] Apache Spark commented on SPARK-7662: - User 'chenghao-intel' has created a pull

[jira] [Assigned] (SPARK-7662) Exception of multi-attribute generator anlysis in projection

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7662: --- Assignee: Apache Spark Exception of multi-attribute generator anlysis in projection

[jira] [Assigned] (SPARK-7662) Exception of multi-attribute generator anlysis in projection

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7662: --- Assignee: (was: Apache Spark) Exception of multi-attribute generator anlysis in

[jira] [Created] (SPARK-7660) Snappy-java buffer-sharing bug leads to data corruption / test failures

2015-05-15 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-7660: - Summary: Snappy-java buffer-sharing bug leads to data corruption / test failures Key: SPARK-7660 URL: https://issues.apache.org/jira/browse/SPARK-7660 Project: Spark

[jira] [Commented] (SPARK-2344) Add Fuzzy C-Means algorithm to MLlib

2015-05-15 Thread Alex (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545027#comment-14545027 ] Alex commented on SPARK-2344: - Hi, How are you? I have couple of questions: 1) When are you

[jira] [Commented] (SPARK-6747) Support List as a return type in Hive UDF

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545077#comment-14545077 ] Apache Spark commented on SPARK-6747: - User 'maropu' has created a pull request for

[jira] [Updated] (SPARK-7660) Snappy-java buffer-sharing bug leads to data corruption / test failures

2015-05-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-7660: -- Description: snappy-java contains a bug that can lead to situations where separate SnappyOutputStream

[jira] [Updated] (SPARK-6258) Python MLlib API missing items: Clustering

2015-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6258: - Fix Version/s: 1.4.0 Python MLlib API missing items: Clustering

[jira] [Resolved] (SPARK-7591) FSBasedRelation interface tweaks

2015-05-15 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-7591. --- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6150

[jira] [Commented] (SPARK-7621) Report KafkaReceiver MessageHandler errors so StreamingListeners can take action

2015-05-15 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545105#comment-14545105 ] Saisai Shao commented on SPARK-7621: Hi [~jerluc], you could submit a related PR on

[jira] [Commented] (SPARK-7269) Incorrect aggregation analysis

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544979#comment-14544979 ] Apache Spark commented on SPARK-7269: - User 'cloud-fan' has created a pull request for

[jira] [Resolved] (SPARK-6258) Python MLlib API missing items: Clustering

2015-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-6258. -- Resolution: Fixed Issue resolved by pull request 6087

[jira] [Commented] (SPARK-7654) DataFrameReader and DataFrameWriter for input/output API

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545017#comment-14545017 ] Apache Spark commented on SPARK-7654: - User 'rxin' has created a pull request for this

[jira] [Created] (SPARK-7661) Support for dynamic allocation of executors in Kinesis Spark Streaming

2015-05-15 Thread Murtaza Kanchwala (JIRA)
Murtaza Kanchwala created SPARK-7661: Summary: Support for dynamic allocation of executors in Kinesis Spark Streaming Key: SPARK-7661 URL: https://issues.apache.org/jira/browse/SPARK-7661

[jira] [Assigned] (SPARK-7654) DataFrameReader and DataFrameWriter for input/output API

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7654: --- Assignee: Apache Spark (was: Reynold Xin) DataFrameReader and DataFrameWriter for

[jira] [Assigned] (SPARK-7654) DataFrameReader and DataFrameWriter for input/output API

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7654: --- Assignee: Reynold Xin (was: Apache Spark) DataFrameReader and DataFrameWriter for

[jira] [Assigned] (SPARK-7651) PySpark GMM predict, predictSoft should fail on bad input

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7651: --- Assignee: Apache Spark PySpark GMM predict, predictSoft should fail on bad input

[jira] [Assigned] (SPARK-7651) PySpark GMM predict, predictSoft should fail on bad input

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7651: --- Assignee: (was: Apache Spark) PySpark GMM predict, predictSoft should fail on bad input

[jira] [Commented] (SPARK-7651) PySpark GMM predict, predictSoft should fail on bad input

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545126#comment-14545126 ] Apache Spark commented on SPARK-7651: - User 'FlytxtRnD' has created a pull request for

[jira] [Assigned] (SPARK-7586) User guide update for spark.ml Word2Vec

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7586: --- Assignee: Apache Spark (was: Xusen Yin) User guide update for spark.ml Word2Vec

[jira] [Assigned] (SPARK-7586) User guide update for spark.ml Word2Vec

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7586: --- Assignee: Xusen Yin (was: Apache Spark) User guide update for spark.ml Word2Vec

[jira] [Commented] (SPARK-7586) User guide update for spark.ml Word2Vec

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545153#comment-14545153 ] Apache Spark commented on SPARK-7586: - User 'yinxusen' has created a pull request for

[jira] [Updated] (SPARK-7663) [MLLIB] feature.Word2Vec throws empty iterator error when the vocabulary size is zero

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7663: - Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) Yeah it should be an error in any

[jira] [Commented] (SPARK-7566) HiveContext.analyzer cannot be overriden

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545066#comment-14545066 ] Apache Spark commented on SPARK-7566: - User 'smola' has created a pull request for

[jira] [Created] (SPARK-7662) Exception of multi-attribute generator anlysis in projection

2015-05-15 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-7662: Summary: Exception of multi-attribute generator anlysis in projection Key: SPARK-7662 URL: https://issues.apache.org/jira/browse/SPARK-7662 Project: Spark Issue

[jira] [Updated] (SPARK-7663) [MLLIB] feature.Word2Vec throws empty iterator error when the vocabulary size is zero

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7663: - Fix Version/s: (was: 1.4.1) (Don't set Fix Version please:

[jira] [Commented] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2015-05-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545006#comment-14545006 ] Josh Rosen commented on SPARK-4105: --- I've opened SPARK-7660 to track progress on the fix

[jira] [Updated] (SPARK-7654) DataFrameReader and DataFrameWriter for input/output API

2015-05-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7654: --- Summary: DataFrameReader and DataFrameWriter for input/output API (was: Create builder pattern for

[jira] [Updated] (SPARK-7654) DataFrameReader and DataFrameWriter for input/output API

2015-05-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7654: --- Description: We have a proliferation of save options now. It'd make more sense to have a builder

[jira] [Assigned] (SPARK-7660) Snappy-java buffer-sharing bug leads to data corruption / test failures

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7660: --- Assignee: Apache Spark Snappy-java buffer-sharing bug leads to data corruption / test

[jira] [Commented] (SPARK-7660) Snappy-java buffer-sharing bug leads to data corruption / test failures

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545031#comment-14545031 ] Apache Spark commented on SPARK-7660: - User 'JoshRosen' has created a pull request for

[jira] [Assigned] (SPARK-7660) Snappy-java buffer-sharing bug leads to data corruption / test failures

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7660: --- Assignee: (was: Apache Spark) Snappy-java buffer-sharing bug leads to data corruption /

[jira] [Commented] (SPARK-7660) Snappy-java buffer-sharing bug leads to data corruption / test failures

2015-05-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545014#comment-14545014 ] Josh Rosen commented on SPARK-7660: --- If we're wary of upgrading to a new Snappy version

[jira] [Created] (SPARK-7663) [MLLIB] feature.Word2Vec throws empty iterator error when the vocabulary size is zero

2015-05-15 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-7663: Summary: [MLLIB] feature.Word2Vec throws empty iterator error when the vocabulary size is zero Key: SPARK-7663 URL: https://issues.apache.org/jira/browse/SPARK-7663 Project:

[jira] [Assigned] (SPARK-7227) Support fillna / dropna in R DataFrame

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7227: --- Assignee: Apache Spark (was: Sun Rui) Support fillna / dropna in R DataFrame

[jira] [Assigned] (SPARK-7227) Support fillna / dropna in R DataFrame

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7227: --- Assignee: Sun Rui (was: Apache Spark) Support fillna / dropna in R DataFrame

[jira] [Commented] (SPARK-7227) Support fillna / dropna in R DataFrame

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545300#comment-14545300 ] Apache Spark commented on SPARK-7227: - User 'sun-rui' has created a pull request for

[jira] [Updated] (SPARK-7657) [YARN] Show driver link in Spark UI

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7657: - Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) [YARN] Show driver link in Spark

[jira] [Commented] (SPARK-6499) pyspark: printSchema command on a dataframe hangs

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545355#comment-14545355 ] Sean Owen commented on SPARK-6499: -- I can't reproduce this. Are you sure it still

[jira] [Updated] (SPARK-6399) Code compiled against 1.3.0 may not run against older Spark versions

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6399: - Issue Type: Improvement (was: Bug) Code compiled against 1.3.0 may not run against older Spark versions

[jira] [Updated] (SPARK-6287) Add support for dynamic allocation in the Mesos coarse-grained scheduler

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6287: - Issue Type: Improvement (was: Bug) Add support for dynamic allocation in the Mesos coarse-grained

[jira] [Updated] (SPARK-7336) Sometimes the status of finished job show on JobHistory UI will be active, and never update.

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7336: - Priority: Minor (was: Major) Sometimes the status of finished job show on JobHistory UI will be active,

[jira] [Resolved] (SPARK-6520) Kyro serialization broken in the shell

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6520. -- Resolution: Won't Fix Yes, I think this is a function of how {{:paste}}d code is evaluated and how

[jira] [Resolved] (SPARK-5711) Sort Shuffle performance issues about using AppendOnlyMap for large data sets

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5711. -- Resolution: Not A Problem I'm not sure this qualifies as a bug. You're just saying that processing a

[jira] [Commented] (SPARK-5412) Cannot bind Master to a specific hostname as per the documentation

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545371#comment-14545371 ] Sean Owen commented on SPARK-5412: -- A-ha. I think the issue is that additional args to

[jira] [Updated] (SPARK-7664) DAG visualization: Fix incorrect link paths of DAG.

2015-05-15 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-7664: -- Summary: DAG visualization: Fix incorrect link paths of DAG. (was: Fix incorrect link paths of

[jira] [Updated] (SPARK-7631) treenode argString should not print children

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7631: - Priority: Minor (was: Major) treenode argString should not print children

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Created] (SPARK-7666) MLlib Python doc parity check

2015-05-15 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-7666: -- Summary: MLlib Python doc parity check Key: SPARK-7666 URL: https://issues.apache.org/jira/browse/SPARK-7666 Project: Spark Issue Type: Documentation

[jira] [Updated] (SPARK-6399) Code compiled against 1.3.0 may not run against older Spark versions

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6399: - Priority: Minor (was: Major) Code compiled against 1.3.0 may not run against older Spark versions

[jira] [Updated] (SPARK-6355) Spark standalone cluster does not support local:/ url for jar file

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6355: - Priority: Minor (was: Major) Spark standalone cluster does not support local:/ url for jar file

[jira] [Commented] (SPARK-6415) Spark Streaming fail-fast: Stop scheduling jobs when a batch fails, and kills the app

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545356#comment-14545356 ] Sean Owen commented on SPARK-6415: -- Sort of related to

[jira] [Updated] (SPARK-6415) Spark Streaming fail-fast: Stop scheduling jobs when a batch fails, and kills the app

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6415: - Issue Type: Improvement (was: Bug) Spark Streaming fail-fast: Stop scheduling jobs when a batch fails,

[jira] [Resolved] (SPARK-6035) Unable to launch spark stream driver in cluster mode

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6035. -- Resolution: Not A Problem This looks like a problem specific to your setup on EC2. Something failed to

[jira] [Updated] (SPARK-6533) Allow using wildcard and other file pattern in Parquet DataSource

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6533: - Labels: (was: backport-needed) Allow using wildcard and other file pattern in Parquet DataSource

[jira] [Resolved] (SPARK-7476) Dynamic partitioning random behaviour

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7476. -- Resolution: Invalid I think this is at best a question for user@. I don't think this relates to

[jira] [Created] (SPARK-7665) MLlib Python API breaking changes check between 1.3 1.4

2015-05-15 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-7665: -- Summary: MLlib Python API breaking changes check between 1.3 1.4 Key: SPARK-7665 URL: https://issues.apache.org/jira/browse/SPARK-7665 Project: Spark Issue

[jira] [Commented] (SPARK-7063) Update lz4 for Java 7 to avoid: when lz4 compression is used, it causes core dump

2015-05-15 Thread Tim Ellison (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545368#comment-14545368 ] Tim Ellison commented on SPARK-7063: I can confirm that this failure is no longer seen

[jira] [Commented] (SPARK-7664) DAG visualization: Fix incorrect link paths of DAG.

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545305#comment-14545305 ] Apache Spark commented on SPARK-7664: - User 'sarutak' has created a pull request for

[jira] [Assigned] (SPARK-7664) DAG visualization: Fix incorrect link paths of DAG.

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7664: --- Assignee: (was: Apache Spark) DAG visualization: Fix incorrect link paths of DAG.

[jira] [Assigned] (SPARK-7664) DAG visualization: Fix incorrect link paths of DAG.

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7664: --- Assignee: Apache Spark DAG visualization: Fix incorrect link paths of DAG.

[jira] [Updated] (SPARK-6973) The total stages on the allJobsPage is wrong

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6973: - Component/s: (was: Spark Core) Web UI The total stages on the allJobsPage is wrong

[jira] [Updated] (SPARK-7603) Crash of thrift server when doing SQL without limit

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7603: - Component/s: (was: Spark Core) SQL Crash of thrift server when doing SQL without

[jira] [Updated] (SPARK-7042) Spark version of akka-actor_2.11 is not compatible with the official akka-actor_2.11 2.3.x

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7042: - Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) I think this is an Akka / Scala

[jira] [Updated] (SPARK-6973) The total stages on the allJobsPage is wrong

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6973: - Priority: Minor (was: Major) The total stages on the allJobsPage is wrong

[jira] [Commented] (SPARK-6056) Unlimit offHeap memory use cause RM killing the container

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545362#comment-14545362 ] Sean Owen commented on SPARK-6056: -- I can't make out whether this is an issue or not. Do

[jira] [Updated] (SPARK-7503) Resources in .sparkStaging directory can't be cleaned up on error

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7503: - Assignee: Kousuke Saruta Resources in .sparkStaging directory can't be cleaned up on error

[jira] [Resolved] (SPARK-7503) Resources in .sparkStaging directory can't be cleaned up on error

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7503. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6026

[jira] [Created] (SPARK-7664) Fix incorrect link paths of DAG.

2015-05-15 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-7664: - Summary: Fix incorrect link paths of DAG. Key: SPARK-7664 URL: https://issues.apache.org/jira/browse/SPARK-7664 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-7344) Spark hangs reading and writing to the same S3 bucket

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545323#comment-14545323 ] Sean Owen commented on SPARK-7344: -- yes but the most recent script still runs with Hadoop

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Updated] (SPARK-6527) sc.binaryFiles can not access files on s3

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6527: - Component/s: EC2 Priority: Minor (was: Major) Is there any more detail on this? like stack traces

[jira] [Commented] (SPARK-7042) Spark version of akka-actor_2.11 is not compatible with the official akka-actor_2.11 2.3.x

2015-05-15 Thread Konstantin Shaposhnikov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545375#comment-14545375 ] Konstantin Shaposhnikov commented on SPARK-7042: There is nothing wrong

[jira] [Resolved] (SPARK-5271) PySpark History Web UI issues

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5271. -- Resolution: Not A Problem PySpark History Web UI issues -

[jira] [Resolved] (SPARK-5265) Submitting applications on Standalone cluster controlled by Zookeeper forces to know active master

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5265. -- Resolution: Duplicate I think you described the same issue twice here; please close the old one if

[jira] [Resolved] (SPARK-5241) spark-ec2 spark init scripts do not handle all hadoop (or tachyon?) dependencies correctly

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5241. -- Resolution: Invalid I don't understand the problem being reported here. Reopen if you can suggest a

[jira] [Commented] (SPARK-4808) Spark fails to spill with small number of large objects

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545464#comment-14545464 ] Sean Owen commented on SPARK-4808: -- I think this is considered resolved now for 1.4 after

[jira] [Resolved] (SPARK-4560) Lambda deserialization error

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4560. -- Resolution: Not A Problem Lambda deserialization error

[jira] [Assigned] (SPARK-4556) binary distribution assembly can't run in local mode

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4556: --- Assignee: Apache Spark binary distribution assembly can't run in local mode

[jira] [Resolved] (SPARK-3602) Can't run cassandra_inputformat.py

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3602. -- Resolution: Not A Problem I think this is due to mismatching Hadoop libs, or at least is stale enough

[jira] [Commented] (SPARK-2445) MesosExecutorBackend crashes in fine grained mode

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545504#comment-14545504 ] Sean Owen commented on SPARK-2445: -- [~gbow...@fastmail.co.uk] are you saying that

[jira] [Resolved] (SPARK-1928) DAGScheduler suspended by local task OOM

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1928. -- Resolution: Fixed Fix Version/s: 1.1.0 Assignee: Peng Zhen Resolved long ago by

[jira] [Resolved] (SPARK-2133) FileNotFoundException in BlockObjectWriter

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2133. -- Resolution: Cannot Reproduce FileNotFoundException in BlockObjectWriter

[jira] [Commented] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2015-05-15 Thread Guillaume E.B. (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545533#comment-14545533 ] Guillaume E.B. commented on SPARK-4105: --- I think I add the bug using another

[jira] [Commented] (SPARK-5220) keepPushingBlocks in BlockGenerator terminated when an exception occurs, which causes the block pushing thread to terminate and blocks receiver

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545456#comment-14545456 ] Sean Owen commented on SPARK-5220: -- [~superxma] is this resolved then?

[jira] [Assigned] (SPARK-5175) bug in updating counters when starting multiple workers/supervisors in actor-based receiver

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5175: --- Assignee: (was: Apache Spark) bug in updating counters when starting multiple

[jira] [Assigned] (SPARK-5174) Missing Document for starting multiple workers/supervisors in actor-based receiver

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5174: --- Assignee: (was: Apache Spark) Missing Document for starting multiple

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Created] (SPARK-7667) MLlib Python API consistency check

2015-05-15 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-7667: -- Summary: MLlib Python API consistency check Key: SPARK-7667 URL: https://issues.apache.org/jira/browse/SPARK-7667 Project: Spark Issue Type: Task

[jira] [Updated] (SPARK-4598) Paginate stage page to avoid OOM with 100,000 tasks

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4598: - Issue Type: Improvement (was: Bug) Paginate stage page to avoid OOM with 100,000 tasks

[jira] [Updated] (SPARK-1910) Add onBlockComplete API to receiver

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-1910: - Issue Type: Improvement (was: Bug) Add onBlockComplete API to receiver

[jira] [Updated] (SPARK-1107) Add shutdown hook on executor stop to stop running tasks

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-1107: - Issue Type: Improvement (was: Bug) We have a shutdown hook that stops the SparkContext, which is kind of

[jira] [Resolved] (SPARK-604) reconnect if mesos slaves dies

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-604. - Resolution: Cannot Reproduce Stale at this point, without similar findings recently. reconnect if mesos

[jira] [Commented] (SPARK-5331) Spark workers can't find tachyon master as spark-ec2 doesn't set spark.tachyonStore.url

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545441#comment-14545441 ] Sean Owen commented on SPARK-5331: -- [~florianverhein] is this an issue then or just a

[jira] [Resolved] (SPARK-5246) spark/spark-ec2.py cannot start Spark master in VPC if local DNS name does not resolve

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5246. -- Resolution: Done Assignee: Vladimir Grigor (Really, was fixed by a PR for mesos)

[jira] [Resolved] (SPARK-3942) LogisticRegressionWithLBFGS should not use SquaredL2Updater

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3942. -- Resolution: Won't Fix LogisticRegressionWithLBFGS should not use SquaredL2Updater

[jira] [Resolved] (SPARK-3967) Spark applications fail in yarn-cluster mode when the directories configured in yarn.nodemanager.local-dirs are located on different disks/partitions

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3967. -- Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Christophe Préaud Spark applications

  1   2   3   4   >