[jira] [Commented] (SPARK-4715) ShuffleMemoryManager.tryToAcquire may return a negative value

2014-12-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232738#comment-14232738 ] Apache Spark commented on SPARK-4715: - User 'zsxwing' has created a pull request for

[jira] [Created] (SPARK-4720) Remainder should also return null if the divider is 0.

2014-12-03 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-4720: Summary: Remainder should also return null if the divider is 0. Key: SPARK-4720 URL: https://issues.apache.org/jira/browse/SPARK-4720 Project: Spark Issue

[jira] [Commented] (SPARK-4397) Reorganize 'implicit's to improve the API convenience

2014-12-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232840#comment-14232840 ] Apache Spark commented on SPARK-4397: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-4694) Long-run user thread(such as HiveThriftServer2) causes the 'process leak' in yarn-client mode

2014-12-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232778#comment-14232778 ] Apache Spark commented on SPARK-4694: - User 'SaintBacchus' has created a pull request

[jira] [Resolved] (SPARK-3391) Support attaching more than 1 EBS volumes

2014-12-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-3391. Resolution: Fixed Fix Version/s: 1.2.0 This was merged. Support attaching more than 1 EBS

[jira] [Commented] (SPARK-3553) Spark Streaming app streams files that have already been streamed in an endless loop

2014-12-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-3553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232802#comment-14232802 ] Micael Capitão commented on SPARK-3553: --- I confirm the weird behaviour running in

[jira] [Commented] (SPARK-4718) spark-ec2 script creates empty spark folder

2014-12-03 Thread Ignacio Blasco Lopez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232795#comment-14232795 ] Ignacio Blasco Lopez commented on SPARK-4718: - Tested with

[jira] [Commented] (SPARK-4714) Checking block is null or not after having gotten info.lock in remove block method

2014-12-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232717#comment-14232717 ] Apache Spark commented on SPARK-4714: - User 'suyanNone' has created a pull request for

[jira] [Issue Comment Deleted] (SPARK-3553) Spark Streaming app streams files that have already been streamed in an endless loop

2014-12-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-3553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micael Capitão updated SPARK-3553: -- Comment: was deleted (was: I confirm the weird behaviour running in HDFS too. I have the Spark

[jira] [Commented] (SPARK-4672) Cut off the super long serialization chain in GraphX to avoid the StackOverflow error

2014-12-03 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232734#comment-14232734 ] Ankur Dave commented on SPARK-4672: --- [~jerrylead] Thanks for investigating this bug and

[jira] [Updated] (SPARK-2456) Scheduler refactoring

2014-12-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2456: --- Assignee: (was: Reynold Xin) Scheduler refactoring - Key:

[jira] [Created] (SPARK-4718) spark-ec2 script creates empty spark folder

2014-12-03 Thread Ignacio Blasco Lopez (JIRA)
Ignacio Blasco Lopez created SPARK-4718: --- Summary: spark-ec2 script creates empty spark folder Key: SPARK-4718 URL: https://issues.apache.org/jira/browse/SPARK-4718 Project: Spark

[jira] [Commented] (SPARK-4720) Remainder should also return null if the divider is 0.

2014-12-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232848#comment-14232848 ] Apache Spark commented on SPARK-4720: - User 'ueshin' has created a pull request for

[jira] [Commented] (SPARK-4719) Consolidate various narrow dep RDD classes with MapPartitionsRDD

2014-12-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232804#comment-14232804 ] Apache Spark commented on SPARK-4719: - User 'rxin' has created a pull request for this

[jira] [Created] (SPARK-4716) Avoid shuffle when all-to-all operation has single input and output partition

2014-12-03 Thread Sandy Ryza (JIRA)
Sandy Ryza created SPARK-4716: - Summary: Avoid shuffle when all-to-all operation has single input and output partition Key: SPARK-4716 URL: https://issues.apache.org/jira/browse/SPARK-4716 Project: Spark

[jira] [Commented] (SPARK-4717) Optimize BLAS library to avoid de-reference multiple times in loop

2014-12-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232783#comment-14232783 ] Apache Spark commented on SPARK-4717: - User 'dbtsai' has created a pull request for

[jira] [Resolved] (SPARK-4672) Cut off the super long serialization chain in GraphX to avoid the StackOverflow error

2014-12-03 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave resolved SPARK-4672. --- Resolution: Fixed Issue resolved by pull request 3545 [https://github.com/apache/spark/pull/3545]

[jira] [Updated] (SPARK-2253) Disable partial aggregation automatically when reduction factor is low

2014-12-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2253: --- Fix Version/s: 1.3.0 Disable partial aggregation automatically when reduction factor is low

[jira] [Commented] (SPARK-3638) Commons HTTP client dependency conflict in extras/kinesis-asl module

2014-12-03 Thread Aniket Bhatnagar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232719#comment-14232719 ] Aniket Bhatnagar commented on SPARK-3638: - Yes. You may want to open another JIRA

[jira] [Created] (SPARK-4717) Optimize BLAS library to avoid de-reference multiple times in loop

2014-12-03 Thread DB Tsai (JIRA)
DB Tsai created SPARK-4717: -- Summary: Optimize BLAS library to avoid de-reference multiple times in loop Key: SPARK-4717 URL: https://issues.apache.org/jira/browse/SPARK-4717 Project: Spark Issue

[jira] [Commented] (SPARK-4694) Long-run user thread(such as HiveThriftServer2) causes the 'process leak' in yarn-client mode

2014-12-03 Thread SaintBacchus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232724#comment-14232724 ] SaintBacchus commented on SPARK-4694: - Thanks for reply. [~vanzin] the problem is very

[jira] [Commented] (SPARK-3553) Spark Streaming app streams files that have already been streamed in an endless loop

2014-12-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-3553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232801#comment-14232801 ] Micael Capitão commented on SPARK-3553: --- I confirm the weird behaviour running in

[jira] [Updated] (SPARK-2253) Disable partial aggregation automatically when reduction factor is low

2014-12-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2253: --- Assignee: (was: Reynold Xin) Disable partial aggregation automatically when reduction factor is

[jira] [Commented] (SPARK-4085) Job will fail if a shuffle file that's read locally gets deleted

2014-12-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232839#comment-14232839 ] Apache Spark commented on SPARK-4085: - User 'rxin' has created a pull request for this

[jira] [Created] (SPARK-4715) ShuffleMemoryManager.tryToAcquire may return a negative value

2014-12-03 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-4715: --- Summary: ShuffleMemoryManager.tryToAcquire may return a negative value Key: SPARK-4715 URL: https://issues.apache.org/jira/browse/SPARK-4715 Project: Spark

[jira] [Commented] (SPARK-4672) Cut off the super long serialization chain in GraphX to avoid the StackOverflow error

2014-12-03 Thread Lijie Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232845#comment-14232845 ] Lijie Xu commented on SPARK-4672: - Thank you [~ankurdave]. Yes, the StackOverflow error

[jira] [Created] (SPARK-4719) Consolidate various narrow dep RDD classes with MapPartitionsRDD

2014-12-03 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-4719: -- Summary: Consolidate various narrow dep RDD classes with MapPartitionsRDD Key: SPARK-4719 URL: https://issues.apache.org/jira/browse/SPARK-4719 Project: Spark

[jira] [Resolved] (SPARK-4710) Fix MLlib compilation warnings

2014-12-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4710. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3568

[jira] [Updated] (SPARK-4710) Fix MLlib compilation warnings

2014-12-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4710: - Assignee: Joseph K. Bradley Fix MLlib compilation warnings --

[jira] [Updated] (SPARK-4708) Make k-mean runs two/three times faster with dense/sparse sample

2014-12-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4708: - Assignee: DB Tsai Make k-mean runs two/three times faster with dense/sparse sample

[jira] [Resolved] (SPARK-4708) Make k-mean runs two/three times faster with dense/sparse sample

2014-12-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4708. -- Resolution: Implemented Fix Version/s: 1.2.0 Target Version/s: 1.2.0 Make

[jira] [Created] (SPARK-4721) Improve first thread to put block failed

2014-12-03 Thread SuYan (JIRA)
SuYan created SPARK-4721: Summary: Improve first thread to put block failed Key: SPARK-4721 URL: https://issues.apache.org/jira/browse/SPARK-4721 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-4721) Improve first thread to put block failed

2014-12-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232889#comment-14232889 ] Apache Spark commented on SPARK-4721: - User 'suyanNone' has created a pull request for

[jira] [Created] (SPARK-4722) StreamingLinearRegression should return a DStream of weights when calling trainOn

2014-12-03 Thread Arthur Andres (JIRA)
Arthur Andres created SPARK-4722: Summary: StreamingLinearRegression should return a DStream of weights when calling trainOn Key: SPARK-4722 URL: https://issues.apache.org/jira/browse/SPARK-4722

[jira] [Commented] (SPARK-4722) StreamingLinearRegression should return a DStream of weights when calling trainOn

2014-12-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232920#comment-14232920 ] Xiangrui Meng commented on SPARK-4722: -- [~Arthur][ You can use

[jira] [Commented] (SPARK-4722) StreamingLinearRegression should return a DStream of weights when calling trainOn

2014-12-03 Thread Arthur Andres (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232929#comment-14232929 ] Arthur Andres commented on SPARK-4722: -- I understand your point about it being heavy

[jira] [Created] (SPARK-4723) To abort the stages which have attempted some times

2014-12-03 Thread YanTang Zhai (JIRA)
YanTang Zhai created SPARK-4723: --- Summary: To abort the stages which have attempted some times Key: SPARK-4723 URL: https://issues.apache.org/jira/browse/SPARK-4723 Project: Spark Issue Type:

[jira] [Commented] (SPARK-4710) Fix MLlib compilation warnings

2014-12-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233007#comment-14233007 ] Sean Owen commented on SPARK-4710: -- (PS, I have a JIRA/PR to clean up most all of the

[jira] [Created] (SPARK-4724) JavaNetworkWordCount.java has a wrong import

2014-12-03 Thread JIRA
Emre Sevinç created SPARK-4724: -- Summary: JavaNetworkWordCount.java has a wrong import Key: SPARK-4724 URL: https://issues.apache.org/jira/browse/SPARK-4724 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-4702) Querying non-existent partition produces exception in v1.2.0-rc1

2014-12-03 Thread Yana Kadiyska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233031#comment-14233031 ] Yana Kadiyska commented on SPARK-4702: -- I'm investigating the possibility that this

[jira] [Created] (SPARK-4725) Re-think custom shuffle serializers for vertex messages

2014-12-03 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-4725: --- Summary: Re-think custom shuffle serializers for vertex messages Key: SPARK-4725 URL: https://issues.apache.org/jira/browse/SPARK-4725 Project: Spark

[jira] [Commented] (SPARK-4724) JavaNetworkWordCount.java has a wrong import

2014-12-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233034#comment-14233034 ] Sean Owen commented on SPARK-4724: -- No, this is correct. {{Durations}} is an object, with

[jira] [Resolved] (SPARK-4717) Optimize BLAS library to avoid de-reference multiple times in loop

2014-12-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4717. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3577

[jira] [Updated] (SPARK-4717) Optimize BLAS library to avoid de-reference multiple times in loop

2014-12-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4717: - Assignee: DB Tsai Optimize BLAS library to avoid de-reference multiple times in loop

[jira] [Commented] (SPARK-4724) JavaNetworkWordCount.java has a wrong import

2014-12-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233053#comment-14233053 ] Emre Sevinç commented on SPARK-4724: Then how do I import

[jira] [Commented] (SPARK-4001) Add Apriori algorithm to Spark MLlib

2014-12-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233067#comment-14233067 ] Xiangrui Meng commented on SPARK-4001: -- [~jackylk] Could you share some performance

[jira] [Commented] (SPARK-4710) Fix MLlib compilation warnings

2014-12-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233071#comment-14233071 ] Xiangrui Meng commented on SPARK-4710: -- [~srowen] Sorry I didn't see your PR. Since

[jira] [Commented] (SPARK-4710) Fix MLlib compilation warnings

2014-12-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233075#comment-14233075 ] Sean Owen commented on SPARK-4710: -- [~mengxr] It looks like it didn't overlap -- maybe

[jira] [Commented] (SPARK-4724) JavaNetworkWordCount.java has a wrong import

2014-12-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233078#comment-14233078 ] Sean Owen commented on SPARK-4724: -- I believe the class is new in 1.2. You are looking at

[jira] [Commented] (SPARK-4702) Querying non-existent partition produces exception in v1.2.0-rc1

2014-12-03 Thread Yana Kadiyska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233081#comment-14233081 ] Yana Kadiyska commented on SPARK-4702: -- Unfortunately I still see this error after

[jira] [Commented] (SPARK-4724) JavaNetworkWordCount.java has a wrong import

2014-12-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233100#comment-14233100 ] Emre Sevinç commented on SPARK-4724: OK, now I see. What confused me was the word

[jira] [Created] (SPARK-4726) NotSerializableException thrown on SystemDefaultHttpClient with stack not related to my functions

2014-12-03 Thread Dmitriy Makarenko (JIRA)
Dmitriy Makarenko created SPARK-4726: Summary: NotSerializableException thrown on SystemDefaultHttpClient with stack not related to my functions Key: SPARK-4726 URL:

[jira] [Resolved] (SPARK-4724) JavaNetworkWordCount.java has a wrong import

2014-12-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emre Sevinç resolved SPARK-4724. Resolution: Not a Problem Not a Problem. It was a misunderstanding on my side.

[jira] [Closed] (SPARK-4724) JavaNetworkWordCount.java has a wrong import

2014-12-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emre Sevinç closed SPARK-4724. -- JavaNetworkWordCount.java has a wrong import

[jira] [Commented] (SPARK-4156) Add expectation maximization for Gaussian mixture models to MLLib clustering

2014-12-03 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233107#comment-14233107 ] Travis Galoppo commented on SPARK-4156: --- I have modified the cluster initialization

[jira] [Commented] (SPARK-4690) AppendOnlyMap seems not using Quadratic probing as the JavaDoc

2014-12-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233124#comment-14233124 ] Sean Owen commented on SPARK-4690: -- No, it is using quadratic probing. It adds {{delta}}

[jira] [Comment Edited] (SPARK-4702) Querying non-existent partition produces exception in v1.2.0-rc1

2014-12-03 Thread Yana Kadiyska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233081#comment-14233081 ] Yana Kadiyska edited comment on SPARK-4702 at 12/3/14 3:53 PM:

[jira] [Created] (SPARK-4727) Add dimensional RDDs (time series, spatial)

2014-12-03 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-4727: - Summary: Add dimensional RDDs (time series, spatial) Key: SPARK-4727 URL: https://issues.apache.org/jira/browse/SPARK-4727 Project: Spark Issue Type:

[jira] [Created] (SPARK-4728) Add exponential, log normal, and gamma distributions to data generator to MLlib

2014-12-03 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-4728: - Summary: Add exponential, log normal, and gamma distributions to data generator to MLlib Key: SPARK-4728 URL: https://issues.apache.org/jira/browse/SPARK-4728 Project:

[jira] [Created] (SPARK-4729) Add time series subsampling to MLlib

2014-12-03 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-4729: - Summary: Add time series subsampling to MLlib Key: SPARK-4729 URL: https://issues.apache.org/jira/browse/SPARK-4729 Project: Spark Issue Type: New Feature

[jira] [Comment Edited] (SPARK-4702) Querying non-existent partition produces exception in v1.2.0-rc1

2014-12-03 Thread Yana Kadiyska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233081#comment-14233081 ] Yana Kadiyska edited comment on SPARK-4702 at 12/3/14 6:09 PM:

[jira] [Assigned] (SPARK-4552) query for empty parquet table in spark sql hive get IllegalArgumentException

2014-12-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reassigned SPARK-4552: --- Assignee: Michael Armbrust query for empty parquet table in spark sql hive get

[jira] [Updated] (SPARK-4552) query for empty parquet table in spark sql hive get IllegalArgumentException

2014-12-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4552: Priority: Blocker (was: Major) query for empty parquet table in spark sql hive get

[jira] [Commented] (SPARK-4552) query for empty parquet table in spark sql hive get IllegalArgumentException

2014-12-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233314#comment-14233314 ] Michael Armbrust commented on SPARK-4552: - It turns out this manifests also when

[jira] [Resolved] (SPARK-4702) Querying non-existent partition produces exception in v1.2.0-rc1

2014-12-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4702. - Resolution: Duplicate Querying non-existent partition produces exception in v1.2.0-rc1

[jira] [Commented] (SPARK-4694) Long-run user thread(such as HiveThriftServer2) causes the 'process leak' in yarn-client mode

2014-12-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233321#comment-14233321 ] Marcelo Vanzin commented on SPARK-4694: --- To answer your question, you can call

[jira] [Commented] (SPARK-4690) AppendOnlyMap seems not using Quadratic probing as the JavaDoc

2014-12-03 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1429#comment-1429 ] Matei Zaharia commented on SPARK-4690: -- Yup, that's the definition of it.

[jira] [Closed] (SPARK-4690) AppendOnlyMap seems not using Quadratic probing as the JavaDoc

2014-12-03 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia closed SPARK-4690. Resolution: Invalid AppendOnlyMap seems not using Quadratic probing as the JavaDoc

[jira] [Commented] (SPARK-4702) Querying non-existent partition produces exception in v1.2.0-rc1

2014-12-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233344#comment-14233344 ] Michael Armbrust commented on SPARK-4702: - Thanks for reporting. As a workaround

[jira] [Commented] (SPARK-4697) System properties should override environment variables

2014-12-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233381#comment-14233381 ] Andrew Or commented on SPARK-4697: -- Hey did you search for a duplicate or related JIRA in

[jira] [Commented] (SPARK-4687) SparkContext#addFile doesn't keep file folder information

2014-12-03 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233385#comment-14233385 ] Sandy Ryza commented on SPARK-4687: --- [~pwendell], do you think this is a reasonable API

[jira] [Commented] (SPARK-4552) query for empty parquet table in spark sql hive get IllegalArgumentException

2014-12-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233393#comment-14233393 ] Apache Spark commented on SPARK-4552: - User 'marmbrus' has created a pull request for

[jira] [Commented] (SPARK-4702) Querying non-existent partition produces exception in v1.2.0-rc1

2014-12-03 Thread Yana Kadiyska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233443#comment-14233443 ] Yana Kadiyska commented on SPARK-4702: -- Michael, just wanted to point out that the

[jira] [Reopened] (SPARK-3926) result of JavaRDD collectAsMap() is not serializable

2014-12-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-3926: -- I am reopening as it is not actually serializable without a no-arg constructor. PR coming shortly, that

[jira] [Commented] (SPARK-3926) result of JavaRDD collectAsMap() is not serializable

2014-12-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233453#comment-14233453 ] Apache Spark commented on SPARK-3926: - User 'srowen' has created a pull request for

[jira] [Commented] (SPARK-4702) Querying non-existent partition produces exception in v1.2.0-rc1

2014-12-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233457#comment-14233457 ] Michael Armbrust commented on SPARK-4702: - Yana, I'm a little confused. Were both

[jira] [Closed] (SPARK-4701) Typo in sbt/sbt

2014-12-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4701. Resolution: Fixed Fix Version/s: 1.1.2 1.2.0 Assignee:

[jira] [Updated] (SPARK-4691) code optimization for judgement

2014-12-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4691: - Affects Version/s: 1.1.0 code optimization for judgement ---

[jira] [Resolved] (SPARK-2143) Display Spark version on Driver web page

2014-12-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2143. -- Resolution: Fixed Fix Version/s: 1.2.0 Target Version/s: 1.2.0 (This went in for

[jira] [Updated] (SPARK-4715) ShuffleMemoryManager.tryToAcquire may return a negative value

2014-12-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4715: - Affects Version/s: 1.1.0 Assignee: Shixiong Zhu ShuffleMemoryManager.tryToAcquire may

[jira] [Closed] (SPARK-4715) ShuffleMemoryManager.tryToAcquire may return a negative value

2014-12-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4715. Resolution: Fixed Fix Version/s: 1.1.2 1.2.0 Target Version/s: 1.2.0,

[jira] [Commented] (SPARK-4702) Querying non-existent partition produces exception in v1.2.0-rc1

2014-12-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233499#comment-14233499 ] Michael Armbrust commented on SPARK-4702: - Also, have you tested with:

[jira] [Commented] (SPARK-4702) Querying non-existent partition produces exception in v1.2.0-rc1

2014-12-03 Thread Yana Kadiyska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233514#comment-14233514 ] Yana Kadiyska commented on SPARK-4702: -- Michael, I do not have a 1.1. In October I

[jira] [Commented] (SPARK-4575) Documentation for the pipeline features

2014-12-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233548#comment-14233548 ] Apache Spark commented on SPARK-4575: - User 'jkbradley' has created a pull request for

[jira] [Commented] (SPARK-3431) Parallelize execution of tests

2014-12-03 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233568#comment-14233568 ] Nicholas Chammas commented on SPARK-3431: - [~joshrosen] I tried [that patch you

[jira] [Created] (SPARK-4730) Warn against deprecated YARN settings

2014-12-03 Thread Andrew Or (JIRA)
Andrew Or created SPARK-4730: Summary: Warn against deprecated YARN settings Key: SPARK-4730 URL: https://issues.apache.org/jira/browse/SPARK-4730 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-4730) Warn against deprecated YARN settings

2014-12-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4730?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233596#comment-14233596 ] Apache Spark commented on SPARK-4730: - User 'andrewor14' has created a pull request

[jira] [Resolved] (SPARK-4552) query for empty parquet table in spark sql hive get IllegalArgumentException

2014-12-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4552. - Resolution: Fixed Issue resolved by pull request 3586

[jira] [Created] (SPARK-4731) Spark 1.1.1 launches broken EC2 clusters

2014-12-03 Thread Jey Kottalam (JIRA)
Jey Kottalam created SPARK-4731: --- Summary: Spark 1.1.1 launches broken EC2 clusters Key: SPARK-4731 URL: https://issues.apache.org/jira/browse/SPARK-4731 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-4731) Spark 1.1.1 launches broken EC2 clusters

2014-12-03 Thread Jey Kottalam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jey Kottalam updated SPARK-4731: Description: EC2 clusters launched using Spark 1.1.1's `spark-ec2` script with the `-v 1.1.1` flag

[jira] [Resolved] (SPARK-4498) Standalone Master can fail to recognize completed/failed applications

2014-12-03 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4498. --- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3550

[jira] [Commented] (SPARK-2188) Support sbt/sbt for Windows

2014-12-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233656#comment-14233656 ] Apache Spark commented on SPARK-2188: - User 'tsudukim' has created a pull request for

[jira] [Commented] (SPARK-2188) Support sbt/sbt for Windows

2014-12-03 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233670#comment-14233670 ] Masayoshi TSUZUKI commented on SPARK-2188: -- I implemented the equivalent scripts

[jira] [Created] (SPARK-4732) All application progress on the standalone scheduler can be halted by one systematically faulty node

2014-12-03 Thread Harry Brundage (JIRA)
Harry Brundage created SPARK-4732: - Summary: All application progress on the standalone scheduler can be halted by one systematically faulty node Key: SPARK-4732 URL:

[jira] [Updated] (SPARK-4732) All application progress on the standalone scheduler can be halted by one systematically faulty node

2014-12-03 Thread Harry Brundage (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Harry Brundage updated SPARK-4732: -- Description: We've experienced several cluster wide outages caused by unexpected system wide

[jira] [Updated] (SPARK-874) Have a --wait flag in ./sbin/stop-all.sh that polls until Worker's are finished

2014-12-03 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-874: -- Fix Version/s: (was: 1.2.0) Have a --wait flag in ./sbin/stop-all.sh that polls until

[jira] [Updated] (SPARK-4732) All application progress on the standalone scheduler can be halted by one systematically faulty node

2014-12-03 Thread Harry Brundage (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Harry Brundage updated SPARK-4732: -- Description: We've experienced several cluster wide outages caused by unexpected system wide

[jira] [Resolved] (SPARK-4085) Job will fail if a shuffle file that's read locally gets deleted

2014-12-03 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4085. Resolution: Fixed Fix Version/s: 1.2.0 Job will fail if a shuffle file that's read

[jira] [Resolved] (SPARK-4711) MLlib optimization: docs should suggest how to choose optimizer

2014-12-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4711. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3569

[jira] [Updated] (SPARK-4711) MLlib optimization: docs should suggest how to choose optimizer

2014-12-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4711: - Assignee: Joseph K. Bradley MLlib optimization: docs should suggest how to choose optimizer

  1   2   >