[jira] [Created] (SPARK-4933) eventLog file not found after merging into a single file

2014-12-23 Thread Zhang, Liye (JIRA)
Zhang, Liye created SPARK-4933: -- Summary: eventLog file not found after merging into a single file Key: SPARK-4933 URL: https://issues.apache.org/jira/browse/SPARK-4933 Project: Spark Issue

[jira] [Updated] (SPARK-4933) eventLog file not found after merging into a single file

2014-12-23 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhang, Liye updated SPARK-4933: --- Description: enent log file not found exception will be thrown after making eventLog into a single

[jira] [Updated] (SPARK-4933) eventLog file not found after merging into a single file

2014-12-23 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhang, Liye updated SPARK-4933: --- Description: enent log file not found exception will be thrown after making eventLog into a single

[jira] [Updated] (SPARK-4933) eventLog file not found after merging into a single file

2014-12-23 Thread Zhang, Liye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhang, Liye updated SPARK-4933: --- Description: enent log file not found exception will be thrown after making eventLog into a single

[jira] [Commented] (SPARK-4933) eventLog file not found after merging into a single file

2014-12-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14256736#comment-14256736 ] Apache Spark commented on SPARK-4933: - User 'liyezhang556520' has created a pull

[jira] [Created] (SPARK-4934) Connection key is hard to read

2014-12-23 Thread Hong Shen (JIRA)
Hong Shen created SPARK-4934: Summary: Connection key is hard to read Key: SPARK-4934 URL: https://issues.apache.org/jira/browse/SPARK-4934 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-4926) Spark manipulate Hbase

2014-12-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14256747#comment-14256747 ] Sean Owen commented on SPARK-4926: -- I think this is a question than an issue report, and

[jira] [Created] (SPARK-4935) When hive.cli.print.header configured, spark-sql aborted if passed in a invalid sql

2014-12-23 Thread wangfei (JIRA)
wangfei created SPARK-4935: -- Summary: When hive.cli.print.header configured, spark-sql aborted if passed in a invalid sql Key: SPARK-4935 URL: https://issues.apache.org/jira/browse/SPARK-4935 Project: Spark

[jira] [Commented] (SPARK-4935) When hive.cli.print.header configured, spark-sql aborted if passed in a invalid sql

2014-12-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14256811#comment-14256811 ] Apache Spark commented on SPARK-4935: - User 'scwf' has created a pull request for this

[jira] [Commented] (SPARK-4846) When the vocabulary size is large, Word2Vec may yield OutOfMemoryError: Requested array size exceeds VM limit

2014-12-23 Thread Joseph Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14256852#comment-14256852 ] Joseph Tang commented on SPARK-4846: It sounds accomplishable. I'll try this and make

[jira] [Created] (SPARK-4936) Please support Named Vector so as to maintain the record ID in clustering etc.

2014-12-23 Thread mahesh bhole (JIRA)
mahesh bhole created SPARK-4936: --- Summary: Please support Named Vector so as to maintain the record ID in clustering etc. Key: SPARK-4936 URL: https://issues.apache.org/jira/browse/SPARK-4936 Project:

[jira] [Created] (SPARK-4937) Adding optimization to simplify the filter condition

2014-12-23 Thread wangfei (JIRA)
wangfei created SPARK-4937: -- Summary: Adding optimization to simplify the filter condition Key: SPARK-4937 URL: https://issues.apache.org/jira/browse/SPARK-4937 Project: Spark Issue Type:

[jira] [Created] (SPARK-4938) Adding optimization to simplify the filter condition

2014-12-23 Thread wangfei (JIRA)
wangfei created SPARK-4938: -- Summary: Adding optimization to simplify the filter condition Key: SPARK-4938 URL: https://issues.apache.org/jira/browse/SPARK-4938 Project: Spark Issue Type:

[jira] [Commented] (SPARK-4938) Adding optimization to simplify the filter condition

2014-12-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14256964#comment-14256964 ] Apache Spark commented on SPARK-4938: - User 'scwf' has created a pull request for this

[jira] [Commented] (SPARK-4936) Please support Named Vector so as to maintain the record ID in clustering etc.

2014-12-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14256985#comment-14256985 ] Sean Owen commented on SPARK-4936: -- Are you referring to the NamedVector idea from

[jira] [Commented] (SPARK-4937) Adding optimization to simplify the filter condition

2014-12-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257037#comment-14257037 ] Apache Spark commented on SPARK-4937: - User 'scwf' has created a pull request for this

[jira] [Commented] (SPARK-4938) Adding optimization to simplify the filter condition

2014-12-23 Thread wangfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257042#comment-14257042 ] wangfei commented on SPARK-4938: Duplicate Adding optimization to simplify the filter

[jira] [Resolved] (SPARK-4938) Adding optimization to simplify the filter condition

2014-12-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4938. -- Resolution: Duplicate Fix Version/s: (was: 1.3.0) Target Version/s: (was: 1.3.0)

[jira] [Commented] (SPARK-4585) Spark dynamic scaling executors use upper limit value as default.

2014-12-23 Thread Brock Noland (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257069#comment-14257069 ] Brock Noland commented on SPARK-4585: - [~chengxiang li] has done some testing of HoS

[jira] [Commented] (SPARK-4820) Spark build encounters File name too long on some encrypted filesystems

2014-12-23 Thread Iljya Kalai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257116#comment-14257116 ] Iljya Kalai commented on SPARK-4820: Thanks for creating this issue, and thanks to

[jira] [Created] (SPARK-4939) Python updateStateByKey example hang in local mode

2014-12-23 Thread Davies Liu (JIRA)
Davies Liu created SPARK-4939: - Summary: Python updateStateByKey example hang in local mode Key: SPARK-4939 URL: https://issues.apache.org/jira/browse/SPARK-4939 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-4160) Standalone cluster mode does not upload all needed jars to driver node

2014-12-23 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257300#comment-14257300 ] Marcelo Vanzin commented on SPARK-4160: --- [~gst] if that's the casa it would be a

[jira] [Updated] (SPARK-4160) Standalone cluster mode does not upload all needed jars to driver node

2014-12-23 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-4160: -- Issue Type: Improvement (was: Bug) Standalone cluster mode does not upload all needed jars to

[jira] [Comment Edited] (SPARK-4160) Standalone cluster mode does not upload all needed jars to driver node

2014-12-23 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257300#comment-14257300 ] Marcelo Vanzin edited comment on SPARK-4160 at 12/23/14 6:23 PM:

[jira] [Created] (SPARK-4940) Document or Support more evenly distributing cores for Mesos mode

2014-12-23 Thread Timothy Chen (JIRA)
Timothy Chen created SPARK-4940: --- Summary: Document or Support more evenly distributing cores for Mesos mode Key: SPARK-4940 URL: https://issues.apache.org/jira/browse/SPARK-4940 Project: Spark

[jira] [Commented] (SPARK-4325) Improve spark-ec2 cluster launch times

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257376#comment-14257376 ] Josh Rosen commented on SPARK-4325: --- [~nchammas] - Yeah, I usually try for a one-to-one

[jira] [Commented] (SPARK-4241) spark_ec2.py support China AWS region: cn-north-1

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257374#comment-14257374 ] Josh Rosen commented on SPARK-4241: --- [~nchammas] I linked SPARK-4890 as a blocker for

[jira] [Updated] (SPARK-4931) Fix the messy format about log4j in running-on-yarn.md

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4931: -- Assignee: Shixiong Zhu Fix the messy format about log4j in running-on-yarn.md

[jira] [Resolved] (SPARK-4931) Fix the messy format about log4j in running-on-yarn.md

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4931. --- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Fix the messy format about

[jira] [Commented] (SPARK-4160) Standalone cluster mode does not upload all needed jars to driver node

2014-12-23 Thread Gurpreet Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257394#comment-14257394 ] Gurpreet Singh commented on SPARK-4160: --- Hi Marcelo, Will open a separate JIRA for

[jira] [Created] (SPARK-4941) Yarn cluster mode does not upload all needed jars to driver node (Spark 1.2.0)

2014-12-23 Thread Gurpreet Singh (JIRA)
Gurpreet Singh created SPARK-4941: - Summary: Yarn cluster mode does not upload all needed jars to driver node (Spark 1.2.0) Key: SPARK-4941 URL: https://issues.apache.org/jira/browse/SPARK-4941

[jira] [Updated] (SPARK-4834) Spark fails to clean up cache / lock files in local dirs

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4834: -- Assignee: Marcelo Vanzin Spark fails to clean up cache / lock files in local dirs

[jira] [Resolved] (SPARK-4834) Spark fails to clean up cache / lock files in local dirs

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4834. --- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Issue resolved by pull request

[jira] [Commented] (SPARK-4939) Python updateStateByKey example hang in local mode

2014-12-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257444#comment-14257444 ] Apache Spark commented on SPARK-4939: - User 'davies' has created a pull request for

[jira] [Updated] (SPARK-4766) ML Estimator Params should subclass Transformer Params

2014-12-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4766: - Description: Currently, in spark.ml, both Transformers and Estimators extend the same

[jira] [Resolved] (SPARK-4932) Add help comments in Analytics

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4932. --- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Assignee: Takeshi

[jira] [Commented] (SPARK-4766) ML Estimator Params should subclass Transformer Params

2014-12-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257495#comment-14257495 ] Joseph K. Bradley commented on SPARK-4766: -- This will require modifying

[jira] [Assigned] (SPARK-4766) ML Estimator Params should subclass Transformer Params

2014-12-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-4766: Assignee: Joseph K. Bradley ML Estimator Params should subclass Transformer

[jira] [Created] (SPARK-4942) ML Transformers should allow output cols to be turned on,off

2014-12-23 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-4942: Summary: ML Transformers should allow output cols to be turned on,off Key: SPARK-4942 URL: https://issues.apache.org/jira/browse/SPARK-4942 Project: Spark

[jira] [Updated] (SPARK-4942) ML Transformers should allow output cols to be turned on,off

2014-12-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4942: - Description: ML Transformers will eventually output multiple columns (e.g., predicted

[jira] [Updated] (SPARK-4914) Two sets of datanucleus versions left in lib_managed after running dev/run-tests

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4914: -- Assignee: Cheng Lian Two sets of datanucleus versions left in lib_managed after running

[jira] [Resolved] (SPARK-4914) Two sets of datanucleus versions left in lib_managed after running dev/run-tests

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4914. --- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Issue resolved by pull request

[jira] [Resolved] (SPARK-4730) Warn against deprecated YARN settings

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4730. --- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Issue resolved by pull request

[jira] [Resolved] (SPARK-4933) eventLog file not found after merging into a single file

2014-12-23 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-4933. --- Resolution: Duplicate eventLog file not found after merging into a single file

[jira] [Updated] (SPARK-4913) Fix incorrect event log path

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4913: -- Affects Version/s: 1.3.0 Fix incorrect event log path

[jira] [Commented] (SPARK-4314) Exception when textFileStream attempts to read deleted _COPYING_ file

2014-12-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257648#comment-14257648 ] Tathagata Das commented on SPARK-4314: -- Yes, doing -put is the wrong way to upload

[jira] [Resolved] (SPARK-4314) Exception when textFileStream attempts to read deleted _COPYING_ file

2014-12-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-4314. -- Resolution: Invalid Exception when textFileStream attempts to read deleted _COPYING_ file

[jira] [Updated] (SPARK-4913) Fix incorrect event log path

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4913: -- Assignee: Liang-Chi Hsieh Fix incorrect event log path

[jira] [Resolved] (SPARK-4913) Fix incorrect event log path

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4913. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3755

[jira] [Commented] (SPARK-4802) ReceiverInfo removal at ReceiverTracker upon deregistering receiver

2014-12-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257668#comment-14257668 ] Tathagata Das commented on SPARK-4802: -- SPARK-2892 is not a duplicate of this though

[jira] [Resolved] (SPARK-4802) ReceiverInfo removal at ReceiverTracker upon deregistering receiver

2014-12-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-4802. -- Resolution: Fixed Fix Version/s: 1.2.1 1.1.2 1.3.0

[jira] [Resolved] (SPARK-4671) Streaming block need not to replicate 2 copies when WAL is enabled

2014-12-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-4671. -- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Streaming block need

[jira] [Resolved] (SPARK-4606) SparkSubmitDriverBootstrapper does not propagate EOF to child JVM

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4606. --- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 1.1.2 Issue

[jira] [Updated] (SPARK-4606) SparkSubmitDriverBootstrapper does not propagate EOF to child JVM

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4606: -- Assignee: Marcelo Vanzin SparkSubmitDriverBootstrapper does not propagate EOF to child JVM

[jira] [Updated] (SPARK-4606) SparkSubmitDriverBootstrapper does not propagate EOF to child JVM

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4606: -- Affects Version/s: 1.1.1 SparkSubmitDriverBootstrapper does not propagate EOF to child JVM

[jira] [Commented] (SPARK-4817) [streaming]Print the specified number of data and handle all of the elements in RDD

2014-12-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257728#comment-14257728 ] Tathagata Das commented on SPARK-4817: -- I agree with [~srowen] point. 1. Updating

[jira] [Comment Edited] (SPARK-4817) [streaming]Print the specified number of data and handle all of the elements in RDD

2014-12-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257728#comment-14257728 ] Tathagata Das edited comment on SPARK-4817 at 12/24/14 12:36 AM:

[jira] [Created] (SPARK-4943) Parsing error for query with table name having dot

2014-12-23 Thread Alex Liu (JIRA)
Alex Liu created SPARK-4943: --- Summary: Parsing error for query with table name having dot Key: SPARK-4943 URL: https://issues.apache.org/jira/browse/SPARK-4943 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-4944) Table Not Found exception in Create Table Like registered RDD table

2014-12-23 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-4944: Summary: Table Not Found exception in Create Table Like registered RDD table Key: SPARK-4944 URL: https://issues.apache.org/jira/browse/SPARK-4944 Project: Spark

[jira] [Resolved] (SPARK-4860) Improve performance of sample() and takeSample() on SchemaRDD

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4860. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3764

[jira] [Updated] (SPARK-4860) Improve performance of sample() and takeSample() on SchemaRDD

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4860: -- Assignee: Ben Cook Improve performance of sample() and takeSample() on SchemaRDD

[jira] [Commented] (SPARK-4877) userClassPathFirst doesn't handle user classes inheriting from parent

2014-12-23 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257855#comment-14257855 ] Stephen Haberman commented on SPARK-4877: - FWIW two reviewers have okay'd this PR;

[jira] [Commented] (SPARK-4606) SparkSubmitDriverBootstrapper does not propagate EOF to child JVM

2014-12-23 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257860#comment-14257860 ] Stephen Haberman commented on SPARK-4606: - [~vanzin] since you're poking around in

[jira] [Commented] (SPARK-4704) SparkSubmitDriverBootstrap doesn't flush output

2014-12-23 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257859#comment-14257859 ] Stephen Haberman commented on SPARK-4704: - That that PR 3655 is for a separate

[jira] [Comment Edited] (SPARK-4704) SparkSubmitDriverBootstrap doesn't flush output

2014-12-23 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257859#comment-14257859 ] Stephen Haberman edited comment on SPARK-4704 at 12/24/14 2:43 AM:

[jira] [Issue Comment Deleted] (SPARK-4704) SparkSubmitDriverBootstrap doesn't flush output

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4704: -- Comment: was deleted (was: User 'harishreedharan' has created a pull request for this issue:

[jira] [Commented] (SPARK-4704) SparkSubmitDriverBootstrap doesn't flush output

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257861#comment-14257861 ] Josh Rosen commented on SPARK-4704: --- I've removed the link / comment to the unrelated

[jira] [Commented] (SPARK-4606) SparkSubmitDriverBootstrapper does not propagate EOF to child JVM

2014-12-23 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257875#comment-14257875 ] Marcelo Vanzin commented on SPARK-4606: --- @stephen hmm, I'm working on some other

[jira] [Commented] (SPARK-4606) SparkSubmitDriverBootstrapper does not propagate EOF to child JVM

2014-12-23 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257877#comment-14257877 ] Stephen Haberman commented on SPARK-4606: - Cool, that sounds great; thanks,

[jira] [Created] (SPARK-4945) Add overwrite option support for SchemaRDD.saveAsParquetFile

2014-12-23 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-4945: Summary: Add overwrite option support for SchemaRDD.saveAsParquetFile Key: SPARK-4945 URL: https://issues.apache.org/jira/browse/SPARK-4945 Project: Spark Issue

[jira] [Commented] (SPARK-4945) Add overwrite option support for SchemaRDD.saveAsParquetFile

2014-12-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257885#comment-14257885 ] Apache Spark commented on SPARK-4945: - User 'chenghao-intel' has created a pull

[jira] [Updated] (SPARK-4881) Use SparkConf#getBoolean instead of get().toBoolean

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4881: -- Assignee: Kousuke Saruta Use SparkConf#getBoolean instead of get().toBoolean

[jira] [Resolved] (SPARK-4881) Use SparkConf#getBoolean instead of get().toBoolean

2014-12-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4881. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3733

[jira] [Commented] (SPARK-3619) Upgrade to Mesos 0.21 to work around MESOS-1688

2014-12-23 Thread Jongyoul Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257935#comment-14257935 ] Jongyoul Lee commented on SPARK-3619: - [~tnachen] I'm trying to test mesos 0.21 in my

[jira] [Created] (SPARK-4947) Use EC2 status checks to know when to test SSH availability

2014-12-23 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-4947: --- Summary: Use EC2 status checks to know when to test SSH availability Key: SPARK-4947 URL: https://issues.apache.org/jira/browse/SPARK-4947 Project: Spark

[jira] [Resolved] (SPARK-4947) Use EC2 status checks to know when to test SSH availability

2014-12-23 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas resolved SPARK-4947. - Resolution: Fixed Resolved in [#3195|https://github.com/apache/spark/pull/3195]. Use

[jira] [Commented] (SPARK-4325) Improve spark-ec2 cluster launch times

2014-12-23 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257939#comment-14257939 ] Nicholas Chammas commented on SPARK-4325: - OK, I created a

[jira] [Commented] (SPARK-4921) Performance issue caused by TaskSetManager returning PROCESS_LOCAL for NO_PREF tasks

2014-12-23 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257940#comment-14257940 ] Sandy Ryza commented on SPARK-4921: --- Is there a barebones Spark program that I could use

[jira] [Created] (SPARK-4948) Use pssh instead of bash-isms and remove unnecessary operations

2014-12-23 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-4948: --- Summary: Use pssh instead of bash-isms and remove unnecessary operations Key: SPARK-4948 URL: https://issues.apache.org/jira/browse/SPARK-4948 Project: Spark

[jira] [Commented] (SPARK-4936) Please support Named Vector so as to maintain the record ID in clustering etc.

2014-12-23 Thread mahesh bhole (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257944#comment-14257944 ] mahesh bhole commented on SPARK-4936: - Thanks sean..I missed that part.. There is

[jira] [Created] (SPARK-4949) shutdownCallback in SparkDeploySchedulerBackend should be enclosed by synchronized block.

2014-12-23 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-4949: - Summary: shutdownCallback in SparkDeploySchedulerBackend should be enclosed by synchronized block. Key: SPARK-4949 URL: https://issues.apache.org/jira/browse/SPARK-4949

[jira] [Commented] (SPARK-4949) shutdownCallback in SparkDeploySchedulerBackend should be enclosed by synchronized block.

2014-12-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257949#comment-14257949 ] Apache Spark commented on SPARK-4949: - User 'sarutak' has created a pull request for

[jira] [Created] (SPARK-4950) Delete obsolete mapReduceTripelets used in Pregel

2014-12-23 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-4950: --- Summary: Delete obsolete mapReduceTripelets used in Pregel Key: SPARK-4950 URL: https://issues.apache.org/jira/browse/SPARK-4950 Project: Spark Issue

[jira] [Commented] (SPARK-4950) Delete obsolete mapReduceTripelets used in Pregel

2014-12-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257960#comment-14257960 ] Apache Spark commented on SPARK-4950: - User 'maropu' has created a pull request for

[jira] [Updated] (SPARK-4950) Delete obsolete mapReduceTripelets used in Pregel

2014-12-23 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-4950: Environment: (was: Any reason not to replace the api along with SPARK-3936?) Delete

[jira] [Commented] (SPARK-4950) Delete obsolete mapReduceTripelets used in Pregel

2014-12-23 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257961#comment-14257961 ] Takeshi Yamamuro commented on SPARK-4950: - Any reason not to replace the api along

[jira] [Created] (SPARK-4951) A busy executor may be killed when dynamicAllocation is enabled

2014-12-23 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-4951: --- Summary: A busy executor may be killed when dynamicAllocation is enabled Key: SPARK-4951 URL: https://issues.apache.org/jira/browse/SPARK-4951 Project: Spark

[jira] [Commented] (SPARK-4951) A busy executor may be killed when dynamicAllocation is enabled

2014-12-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257986#comment-14257986 ] Apache Spark commented on SPARK-4951: - User 'zsxwing' has created a pull request for

[jira] [Closed] (SPARK-4936) Please support Named Vector so as to maintain the record ID in clustering etc.

2014-12-23 Thread mahesh bhole (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] mahesh bhole closed SPARK-4936. --- Resolution: Fixed RDD of (Identifier,Vector) is laready available. e.g. JavaPairRDD for Java

[jira] [Comment Edited] (SPARK-4936) Please support Named Vector so as to maintain the record ID in clustering etc.

2014-12-23 Thread mahesh bhole (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257995#comment-14257995 ] mahesh bhole edited comment on SPARK-4936 at 12/24/14 5:43 AM:

[jira] [Commented] (SPARK-4937) Adding optimization to simplify the filter condition

2014-12-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14258001#comment-14258001 ] Apache Spark commented on SPARK-4937: - User 'liancheng' has created a pull request for

[jira] [Commented] (SPARK-4946) Using AkkaUtils.askWithReply in MapOutputTracker.askTracker to reduce the chance of the communicating problem

2014-12-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14258028#comment-14258028 ] Apache Spark commented on SPARK-4946: - User 'YanTangZhai' has created a pull request

[jira] [Created] (SPARK-4952) In some cases, spark on yarn failed to start

2014-12-23 Thread Guoqiang Li (JIRA)
Guoqiang Li created SPARK-4952: -- Summary: In some cases, spark on yarn failed to start Key: SPARK-4952 URL: https://issues.apache.org/jira/browse/SPARK-4952 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-4723) To abort the stages which have attempted some times

2014-12-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14258050#comment-14258050 ] Apache Spark commented on SPARK-4723: - User 'YanTangZhai' has created a pull request

[jira] [Created] (SPARK-4953) Fix the description of building Spark with YARN

2014-12-23 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-4953: - Summary: Fix the description of building Spark with YARN Key: SPARK-4953 URL: https://issues.apache.org/jira/browse/SPARK-4953 Project: Spark Issue Type:

[jira] [Commented] (SPARK-4953) Fix the description of building Spark with YARN

2014-12-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14258057#comment-14258057 ] Apache Spark commented on SPARK-4953: - User 'sarutak' has created a pull request for