[jira] [Commented] (SPARK-5649) Throw exception when can not apply datatype cast

2015-02-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319724#comment-14319724 ] Michael Armbrust commented on SPARK-5649: -

[jira] [Resolved] (SPARK-5649) Throw exception when can not apply datatype cast

2015-02-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5649. - Resolution: Fixed Fix Version/s: 1.3.0 Assignee: wangfei Throw exception

[jira] [Created] (SPARK-5795) api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java

2015-02-13 Thread Littlestar (JIRA)
Littlestar created SPARK-5795: - Summary: api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java Key: SPARK-5795 URL: https://issues.apache.org/jira/browse/SPARK-5795 Project: Spark

[jira] [Commented] (SPARK-5795) api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java

2015-02-13 Thread Littlestar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319769#comment-14319769 ] Littlestar commented on SPARK-5795: --- org.apache.spark.api.java.JavaPairRDDK, V

[jira] [Commented] (SPARK-3785) Support off-loading computations to a GPU

2015-02-13 Thread Sam Halliday (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319777#comment-14319777 ] Sam Halliday commented on SPARK-3785: - Hi all, just joining the thread :-) I'm the

[jira] [Updated] (SPARK-5795) api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java

2015-02-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5795: - Priority: Minor (was: Major) When you say doesn't compile, you should show the compilation error.

[jira] [Updated] (SPARK-5728) MQTTStreamSuite leaves behind ActiveMQ database files

2015-02-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5728: - Fix Version/s: 1.2.2 MQTTStreamSuite leaves behind ActiveMQ database files

[jira] [Resolved] (SPARK-4832) some other processes might take the daemon pid

2015-02-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4832. -- Resolution: Fixed Fix Version/s: 1.2.2 1.3.0 Issue resolved by pull request

[jira] [Commented] (SPARK-5726) Hadamard Vector Product Transformer

2015-02-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319839#comment-14319839 ] Sean Owen commented on SPARK-5726: -- Go ahead and change it; my guess is that Xiangrui is

[jira] [Updated] (SPARK-4832) some other processes might take the daemon pid

2015-02-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4832: - Assignee: Tao Wang some other processes might take the daemon pid

[jira] [Updated] (SPARK-4631) Add real unit test for MQTT

2015-02-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4631: - Target Version/s: (was: 1.3.0) Fix Version/s: 1.2.2 Add real unit test for MQTT

[jira] [Commented] (SPARK-5081) Shuffle write increases

2015-02-13 Thread Dr. Christian Betz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319948#comment-14319948 ] Dr. Christian Betz commented on SPARK-5081: --- From SPARK-5715 I see a *factor

[jira] [Resolved] (SPARK-5285) Removed GroupExpression in catalyst

2015-02-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5285. - Resolution: Won't Fix Removed GroupExpression in catalyst

[jira] [Resolved] (SPARK-5518) Error messages for plans with invalid AttributeReferences

2015-02-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5518. - Resolution: Fixed Fix Version/s: 1.3.0 Error messages for plans with invalid

[jira] [Commented] (SPARK-5518) Error messages for plans with invalid AttributeReferences

2015-02-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319718#comment-14319718 ] Michael Armbrust commented on SPARK-5518: -

[jira] [Comment Edited] (SPARK-5265) Submitting applications on Standalone cluster controlled by Zookeeper forces to know active master

2015-02-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-5265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14319965#comment-14319965 ] Wojciech Pituła edited comment on SPARK-5265 at 2/13/15 11:24 AM:

[jira] [Updated] (SPARK-5795) api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java

2015-02-13 Thread Littlestar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Littlestar updated SPARK-5795: -- Attachment: TestStreamCompile.java my testcase on java 1.7 and spark 1.3 trunk. Thanks.

[jira] [Commented] (SPARK-4766) ML Estimator Params should subclass Transformer Params

2015-02-13 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320038#comment-14320038 ] Peter Rudenko commented on SPARK-4766: -- Very important feature that could make pretty

[jira] [Updated] (SPARK-4267) Failing to launch jobs on Spark on YARN with Hadoop 2.5.0 or later

2015-02-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4267: - Target Version/s: (was: 1.3.0) Fix Version/s: 1.2.2 Failing to launch jobs on Spark on YARN

[jira] [Commented] (SPARK-5795) api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java

2015-02-13 Thread Littlestar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320109#comment-14320109 ] Littlestar commented on SPARK-5795: --- Does it same problem as SPARK-5297, thanks.

[jira] [Updated] (SPARK-5252) Streaming StatefulNetworkWordCount example hangs

2015-02-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5252: - Component/s: PySpark Examples Looks like you have an environment problem: {code}

[jira] [Resolved] (SPARK-5756) Analyzer should not throw scala.NotImplementedError for illegitimate sql

2015-02-13 Thread wangfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangfei resolved SPARK-5756. Resolution: Fixed Analyzer should not throw scala.NotImplementedError for illegitimate sql

[jira] [Commented] (SPARK-5795) api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java

2015-02-13 Thread Littlestar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320083#comment-14320083 ] Littlestar commented on SPARK-5795: --- error info... The method

[jira] [Commented] (SPARK-5799) Compute aggregation function on specified numeric columns

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320352#comment-14320352 ] Apache Spark commented on SPARK-5799: - User 'viirya' has created a pull request for

[jira] [Created] (SPARK-5799) Compute aggregation function on specified numeric columns

2015-02-13 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5799: -- Summary: Compute aggregation function on specified numeric columns Key: SPARK-5799 URL: https://issues.apache.org/jira/browse/SPARK-5799 Project: Spark

[jira] [Created] (SPARK-5798) Spark shell issue

2015-02-13 Thread DeepakVohra (JIRA)
DeepakVohra created SPARK-5798: -- Summary: Spark shell issue Key: SPARK-5798 URL: https://issues.apache.org/jira/browse/SPARK-5798 Project: Spark Issue Type: Bug Components:

[jira] [Commented] (SPARK-5782) Python Worker / Pyspark Daemon Memory Issue

2015-02-13 Thread Mark Khaitman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320309#comment-14320309 ] Mark Khaitman commented on SPARK-5782: -- Would it make sense to instead make the

[jira] [Updated] (SPARK-5782) Python Worker / Pyspark Daemon Memory Issue

2015-02-13 Thread Mark Khaitman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Khaitman updated SPARK-5782: - Priority: Critical (was: Major) Python Worker / Pyspark Daemon Memory Issue

[jira] [Commented] (SPARK-5726) Hadamard Vector Product Transformer

2015-02-13 Thread Octavian Geagla (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320514#comment-14320514 ] Octavian Geagla commented on SPARK-5726: Ok, I've made the change on the PR.

[jira] [Resolved] (SPARK-5345) Fix unstable test case in FsHistoryProviderSuite

2015-02-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-5345. --- Resolution: Fixed It looks like this has been fixed by SPARK-5600, so I'm going to resolve this for

[jira] [Closed] (SPARK-5735) Replace uses of EasyMock with Mockito

2015-02-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5735. Resolution: Fixed Fix Version/s: 1.3.0 Replace uses of EasyMock with Mockito

[jira] [Created] (SPARK-5802) Cache scaled data in GLM

2015-02-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5802: Summary: Cache scaled data in GLM Key: SPARK-5802 URL: https://issues.apache.org/jira/browse/SPARK-5802 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-5770) Use addJar() to upload a new jar file to executor, it can't be added to classloader

2015-02-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320504#comment-14320504 ] Marcelo Vanzin commented on SPARK-5770: --- bq. but the classloader still load the old

[jira] [Updated] (SPARK-5785) Pyspark does not support narrow dependencies

2015-02-13 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-5785: Description: joins ( cogroups etc.) are always considered to have wide dependencies in pyspark,

[jira] [Updated] (SPARK-5801) Shuffle creates too many nested directories

2015-02-13 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-5801: -- Component/s: Shuffle Shuffle creates too many nested directories

[jira] [Commented] (SPARK-4903) RDD remains cached after DROP TABLE

2015-02-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320439#comment-14320439 ] Yin Huai commented on SPARK-4903: - I believe that it has been resolved in 1.3 ([see

[jira] [Closed] (SPARK-5732) Add an option to print the spark version in spark script

2015-02-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5732. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: uncleGen Add an option to print the spark

[jira] [Issue Comment Deleted] (SPARK-5782) Python Worker / Pyspark Daemon Memory Issue

2015-02-13 Thread Mark Khaitman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Khaitman updated SPARK-5782: - Comment: was deleted (was: Would it make sense to instead make the _next_limit return the MIN of

[jira] [Updated] (SPARK-5529) Executor is still hold while BlockManager has been removed

2015-02-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5529: - Component/s: YARN Executor is still hold while BlockManager has been removed

[jira] [Updated] (SPARK-5529) BlockManager heartbeat expiration does not kill executor

2015-02-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5529: - Assignee: Hong Shen BlockManager heartbeat expiration does not kill executor

[jira] [Commented] (SPARK-5296) Predicate Pushdown (BaseRelation) to have an interface that will accept OR filters

2015-02-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320499#comment-14320499 ] Michael Armbrust commented on SPARK-5296: - Oh, good point... We should pass down

[jira] [Commented] (SPARK-5770) Use addJar() to upload a new jar file to executor, it can't be added to classloader

2015-02-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320510#comment-14320510 ] Sean Owen commented on SPARK-5770: -- Yeah I think that's the point, that overwriting an

[jira] [Resolved] (SPARK-5626) Spurious test failures due to NullPointerException in EasyMock test code

2015-02-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-5626. --- Resolution: Fixed Spurious test failures due to NullPointerException in EasyMock test code

[jira] [Commented] (SPARK-5803) Use ArrayBuilder instead of ArrayBuffer for primitive types

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320528#comment-14320528 ] Apache Spark commented on SPARK-5803: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-5798) Spark shell issue

2015-02-13 Thread DeepakVohra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320533#comment-14320533 ] DeepakVohra commented on SPARK-5798: Thanks Sean for testing. Not all Spark/Scala

[jira] [Resolved] (SPARK-5503) Example code for Power Iteration Clustering

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5503. -- Resolution: Fixed Fix Version/s: 1.3.0 Example code for Power Iteration Clustering

[jira] [Created] (SPARK-5801) Shuffle creates too many nested directories

2015-02-13 Thread Kay Ousterhout (JIRA)
Kay Ousterhout created SPARK-5801: - Summary: Shuffle creates too many nested directories Key: SPARK-5801 URL: https://issues.apache.org/jira/browse/SPARK-5801 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-5529) BlockManager heartbeat expiration does not kill executor

2015-02-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5529: - Summary: BlockManager heartbeat expiration does not kill executor (was: Executor is still hold while

[jira] [Commented] (SPARK-5626) Spurious test failures due to NullPointerException in EasyMock test code

2015-02-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320519#comment-14320519 ] Josh Rosen commented on SPARK-5626: --- This should hopefully be fixed now that I've merged

[jira] [Created] (SPARK-5803) Use ArrayBuilder instead of ArrayBuffer for primitive types

2015-02-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5803: Summary: Use ArrayBuilder instead of ArrayBuffer for primitive types Key: SPARK-5803 URL: https://issues.apache.org/jira/browse/SPARK-5803 Project: Spark

[jira] [Commented] (SPARK-5805) Fix the type error in the final example given in MLlib - Clustering documentation

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320680#comment-14320680 ] Apache Spark commented on SPARK-5805: - User 'emres' has created a pull request for

[jira] [Updated] (SPARK-5805) Fix the type error in the final example given in MLlib - Clustering documentation

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5805: - Assignee: Emre Sevinç Fix the type error in the final example given in MLlib - Clustering

[jira] [Commented] (SPARK-5746) INSERT OVERWRITE throws FileNotFoundException when the source and destination point to the same table.

2015-02-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320562#comment-14320562 ] Yin Huai commented on SPARK-5746: - For now, we will throw an error when we find this case.

[jira] [Created] (SPARK-5806) Organize sections in mllib-clustering.md

2015-02-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5806: Summary: Organize sections in mllib-clustering.md Key: SPARK-5806 URL: https://issues.apache.org/jira/browse/SPARK-5806 Project: Spark Issue Type:

[jira] [Created] (SPARK-5804) Explicitly manage cache in Crossvalidation k-fold loop

2015-02-13 Thread Peter Rudenko (JIRA)
Peter Rudenko created SPARK-5804: Summary: Explicitly manage cache in Crossvalidation k-fold loop Key: SPARK-5804 URL: https://issues.apache.org/jira/browse/SPARK-5804 Project: Spark Issue

[jira] [Commented] (SPARK-5227) InputOutputMetricsSuite input metrics when reading text file with multiple splits test fails in branch-1.2 SBT Jenkins build w/hadoop1.0 and hadoop2.0 profiles

2015-02-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320653#comment-14320653 ] Josh Rosen commented on SPARK-5227: --- I think this might be caused by HADOOP-8490: the

[jira] [Commented] (SPARK-5770) Use addJar() to upload a new jar file to executor, it can't be added to classloader

2015-02-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320552#comment-14320552 ] Marcelo Vanzin commented on SPARK-5770: --- It might be possible to fix the behavior,

[jira] [Commented] (SPARK-5804) Explicitly manage cache in Crossvalidation k-fold loop

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320607#comment-14320607 ] Apache Spark commented on SPARK-5804: - User 'petro-rudenko' has created a pull request

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-02-13 Thread Chris Love (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320874#comment-14320874 ] Chris Love commented on SPARK-3821: --- I notice that the packer built ami comes with

[jira] [Commented] (SPARK-5779) Python broadcast does not work with Kryo serializer

2015-02-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320897#comment-14320897 ] Davies Liu commented on SPARK-5779: --- Yes, I will close it. Python broadcast does not

[jira] [Commented] (SPARK-5730) Group methods in the generated doc for spark.ml algorithms.

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5730?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320908#comment-14320908 ] Apache Spark commented on SPARK-5730: - User 'mengxr' has created a pull request for

[jira] [Created] (SPARK-5812) Potential flaky test JavaAPISuite.glom

2015-02-13 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-5812: Summary: Potential flaky test JavaAPISuite.glom Key: SPARK-5812 URL: https://issues.apache.org/jira/browse/SPARK-5812 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-5805) Fix the type error in the final example given in MLlib - Clustering documentation

2015-02-13 Thread JIRA
Emre Sevinç created SPARK-5805: -- Summary: Fix the type error in the final example given in MLlib - Clustering documentation Key: SPARK-5805 URL: https://issues.apache.org/jira/browse/SPARK-5805 Project:

[jira] [Updated] (SPARK-5806) Organize sections in mllib-clustering.md

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5806: - Description: We separate code examples from algorithm descriptions. It would be better if we put

[jira] [Updated] (SPARK-5731) Flaky Test: org.apache.spark.streaming.kafka.DirectKafkaStreamSuite.basic stream receiving with multiple topics and smallest starting offset

2015-02-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5731: --- Priority: Blocker (was: Major) Flaky Test:

[jira] [Closed] (SPARK-5779) Python broadcast does not work with Kryo serializer

2015-02-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu closed SPARK-5779. - Resolution: Duplicate Python broadcast does not work with Kryo serializer

[jira] [Commented] (SPARK-5227) InputOutputMetricsSuite input metrics when reading text file with multiple splits test fails in branch-1.2 SBT Jenkins build w/hadoop1.0 and hadoop2.0 profiles

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320903#comment-14320903 ] Apache Spark commented on SPARK-5227: - User 'JoshRosen' has created a pull request for

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-02-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320905#comment-14320905 ] Nicholas Chammas commented on SPARK-3821: - If you want Java 8 alongside 7, you can

[jira] [Commented] (SPARK-5679) Flaky tests in InputOutputMetricsSuite: input metrics with interleaved reads and input metrics with mixed read method

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320904#comment-14320904 ] Apache Spark commented on SPARK-5679: - User 'JoshRosen' has created a pull request for

[jira] [Updated] (SPARK-5779) Python broadcast does not work with Kryo serializer

2015-02-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5779: -- Affects Version/s: (was: 1.2.1) (was: 1.3.0) 1.2.0

[jira] [Updated] (SPARK-5812) Potential flaky test JavaAPISuite.glom

2015-02-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-5812: - Labels: flaky-test (was: ) Potential flaky test JavaAPISuite.glom

[jira] [Commented] (SPARK-5016) GaussianMixtureEM should distribute matrix inverse for large numFeatures, k

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320935#comment-14320935 ] Xiangrui Meng commented on SPARK-5016: -- I think we should compute the inverse in

[jira] [Resolved] (SPARK-5806) Organize sections in mllib-clustering.md

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5806. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4598

[jira] [Commented] (SPARK-5731) Flaky Test: org.apache.spark.streaming.kafka.DirectKafkaStreamSuite.basic stream receiving with multiple topics and smallest starting offset

2015-02-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320739#comment-14320739 ] Patrick Wendell commented on SPARK-5731: [~c...@koeninger.org] [~tdas] FYI we've

[jira] [Updated] (SPARK-5731) Flaky Test: org.apache.spark.streaming.kafka.DirectKafkaStreamSuite.basic stream receiving with multiple topics and smallest starting offset

2015-02-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5731: --- Labels: flaky-test (was: ) Flaky Test:

[jira] [Updated] (SPARK-5807) Parallel grid search

2015-02-13 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-5807: - Description: Right now in CrossValidator for each fold combination and ParamGrid hyperparameter

[jira] [Created] (SPARK-5807) Parallel grid search

2015-02-13 Thread Peter Rudenko (JIRA)
Peter Rudenko created SPARK-5807: Summary: Parallel grid search Key: SPARK-5807 URL: https://issues.apache.org/jira/browse/SPARK-5807 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-5731) Flaky Test: org.apache.spark.streaming.kafka.DirectKafkaStreamSuite.basic stream receiving with multiple topics and smallest starting offset

2015-02-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320754#comment-14320754 ] Tathagata Das commented on SPARK-5731: -- This is very weird. the stream is receiving

[jira] [Updated] (SPARK-5730) Group methods in the generated doc for spark.ml algorithms.

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5730: - Assignee: Xiangrui Meng Group methods in the generated doc for spark.ml algorithms.

[jira] [Created] (SPARK-5810) Maven Coordinate Inclusion failing in pySpark

2015-02-13 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-5810: -- Summary: Maven Coordinate Inclusion failing in pySpark Key: SPARK-5810 URL: https://issues.apache.org/jira/browse/SPARK-5810 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5731) Flaky Test: org.apache.spark.streaming.kafka.DirectKafkaStreamSuite.basic stream receiving with multiple topics and smallest starting offset

2015-02-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320749#comment-14320749 ] Tathagata Das commented on SPARK-5731: -- Let me take a pass at it. Flaky Test:

[jira] [Commented] (SPARK-5798) Spark shell issue

2015-02-13 Thread DeepakVohra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320763#comment-14320763 ] DeepakVohra commented on SPARK-5798: Re-tested on local OS Oracle Linux 6.5 and did

[jira] [Updated] (SPARK-5807) Parallel grid search

2015-02-13 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-5807: - Description: Right now in CrossValidator for each fold combination and ParamGrid hyperparameter

[jira] [Commented] (SPARK-5731) Flaky Test: org.apache.spark.streaming.kafka.DirectKafkaStreamSuite.basic stream receiving with multiple topics and smallest starting offset

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320796#comment-14320796 ] Apache Spark commented on SPARK-5731: - User 'tdas' has created a pull request for this

[jira] [Commented] (SPARK-5779) Python broadcast does not work with Kryo serializer

2015-02-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320803#comment-14320803 ] Josh Rosen commented on SPARK-5779: --- I thought we fixed this in SPARK-4882:

[jira] [Commented] (SPARK-4865) Include temporary tables in SHOW TABLES

2015-02-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320804#comment-14320804 ] Yin Huai commented on SPARK-4865: - I will start to work on it based on SPARK-3299.

[jira] [Updated] (SPARK-4865) Include temporary tables in SHOW TABLES

2015-02-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4865: Priority: Blocker (was: Critical) Include temporary tables in SHOW TABLES

[jira] [Created] (SPARK-5809) OutOfMemoryError in logDebug in RandomForest.scala

2015-02-13 Thread Devesh Parekh (JIRA)
Devesh Parekh created SPARK-5809: Summary: OutOfMemoryError in logDebug in RandomForest.scala Key: SPARK-5809 URL: https://issues.apache.org/jira/browse/SPARK-5809 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-5789) Throw a better error message if JsonRDD.parseJson encounters unrecoverable parsing errors.

2015-02-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5789. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4582

[jira] [Created] (SPARK-5811) Documentation for --packages and --repositories on Spark Shell

2015-02-13 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-5811: -- Summary: Documentation for --packages and --repositories on Spark Shell Key: SPARK-5811 URL: https://issues.apache.org/jira/browse/SPARK-5811 Project: Spark

[jira] [Commented] (SPARK-5363) Spark 1.2 freeze without error notification

2015-02-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320986#comment-14320986 ] Apache Spark commented on SPARK-5363: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-02-13 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320995#comment-14320995 ] Florian Verhein commented on SPARK-3821: RE: Java, that reminds me... We should

[jira] [Resolved] (SPARK-5730) Group methods in the generated doc for spark.ml algorithms.

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5730. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4600

[jira] [Commented] (SPARK-5016) GaussianMixtureEM should distribute matrix inverse for large numFeatures, k

2015-02-13 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14321099#comment-14321099 ] Travis Galoppo commented on SPARK-5016: --- Hmm. I'm having trouble conceptualizing how

[jira] [Resolved] (SPARK-5803) Use ArrayBuilder instead of ArrayBuffer for primitive types

2015-02-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5803. -- Resolution: Fixed Fix Version/s: 1.3.0 Use ArrayBuilder instead of ArrayBuffer for

[jira] [Commented] (SPARK-5363) Spark 1.2 freeze without error notification

2015-02-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320987#comment-14320987 ] Davies Liu commented on SPARK-5363: --- [~TJKlein] Could you try the patch in

[jira] [Created] (SPARK-5813) Spark-ec2: Switch to OracleJDK

2015-02-13 Thread Florian Verhein (JIRA)
Florian Verhein created SPARK-5813: -- Summary: Spark-ec2: Switch to OracleJDK Key: SPARK-5813 URL: https://issues.apache.org/jira/browse/SPARK-5813 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-5814) Remove JBLAS from runtime dependencies

2015-02-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5814: Summary: Remove JBLAS from runtime dependencies Key: SPARK-5814 URL: https://issues.apache.org/jira/browse/SPARK-5814 Project: Spark Issue Type: Dependency

[jira] [Created] (SPARK-5815) Deprecate SVDPlusPlus APIs that expose DoubleMatrix from JBLAS

2015-02-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5815: Summary: Deprecate SVDPlusPlus APIs that expose DoubleMatrix from JBLAS Key: SPARK-5815 URL: https://issues.apache.org/jira/browse/SPARK-5815 Project: Spark

[jira] [Updated] (SPARK-5124) Standardize internal RPC interface

2015-02-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-5124: Attachment: Pluggable RPC - draft 2.pdf Comparing to the first version, this docs adds

  1   2   >