[jira] [Updated] (SPARK-4406) SVD should check for k 1

2015-01-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4406: - Assignee: Manoj Kumar SVD should check for k 1 --

[jira] [Resolved] (SPARK-4406) SVD should check for k 1

2015-01-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4406. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3945

[jira] [Commented] (SPARK-4924) Factor out code to launch Spark applications into a separate library

2015-01-09 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272332#comment-14272332 ] Marcelo Vanzin commented on SPARK-4924: --- Hi, I'm not sure I understand your use

[jira] [Commented] (SPARK-2004) QA Automation

2015-01-09 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272373#comment-14272373 ] Nicholas Chammas commented on SPARK-2004: - I recently had a [related

[jira] [Commented] (SPARK-5188) make-distribution.sh should support curl, not only wget to get Tachyon

2015-01-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272167#comment-14272167 ] Apache Spark commented on SPARK-5188: - User 'sarutak' has created a pull request for

[jira] [Created] (SPARK-5188) make-distribution.sh should support curl, not only wget to get Tachyon

2015-01-09 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-5188: - Summary: make-distribution.sh should support curl, not only wget to get Tachyon Key: SPARK-5188 URL: https://issues.apache.org/jira/browse/SPARK-5188 Project:

[jira] [Updated] (SPARK-5104) Distributed Representations of Sentences and Documents

2015-01-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5104: - Target Version/s: (was: 1.3.0) Distributed Representations of Sentences and Documents

[jira] [Commented] (SPARK-4879) Missing output partitions after job completes with speculative execution

2015-01-09 Thread Zach Fry (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272291#comment-14272291 ] Zach Fry commented on SPARK-4879: - Hey Josh, I was able to reproduce the missing file

[jira] [Closed] (SPARK-4990) Search SPARK_CONF_DIR first when --properties-file is not specified

2015-01-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4990. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: WangTaoTheTonic Target

[jira] [Updated] (SPARK-4990) Search SPARK_CONF_DIR first when --properties-file is not specified

2015-01-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4990: - Affects Version/s: 1.0.0 Search SPARK_CONF_DIR first when --properties-file is not specified

[jira] [Commented] (SPARK-5010) native openblas library doesn't work: undefined symbol: cblas_dscal

2015-01-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272280#comment-14272280 ] Xiangrui Meng commented on SPARK-5010: -- Which linux distribution are you using? On

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-09 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272184#comment-14272184 ] Pedro Rodriguez commented on SPARK-1405: Sounds good Joseph. Have some good news.

[jira] [Commented] (SPARK-5104) Distributed Representations of Sentences and Documents

2015-01-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272276#comment-14272276 ] Xiangrui Meng commented on SPARK-5104: -- [~gq] Just want to note that we need design

[jira] [Commented] (SPARK-5056) Implementing Clara k-medoids clustering algorithm for large datasets

2015-01-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272277#comment-14272277 ] Xiangrui Meng commented on SPARK-5056: -- [~tmilinovic] We had some discussion in

[jira] [Resolved] (SPARK-5141) CaseInsensitiveMap throws java.io.NotSerializableException

2015-01-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5141. Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Assignee: Gankun Luo

[jira] [Updated] (SPARK-5141) CaseInsensitiveMap throws java.io.NotSerializableException

2015-01-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5141: --- Fix Version/s: (was: 1.2.1) CaseInsensitiveMap throws java.io.NotSerializableException

[jira] [Commented] (SPARK-4924) Factor out code to launch Spark applications into a separate library

2015-01-09 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272338#comment-14272338 ] Nicholas Chammas commented on SPARK-4924: - Okie doke. I ask because I've been

[jira] [Commented] (SPARK-4924) Factor out code to launch Spark applications into a separate library

2015-01-09 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272315#comment-14272315 ] Nicholas Chammas commented on SPARK-4924: - [~vanzin] - How does this proposal

[jira] [Commented] (SPARK-4737) Prevent serialization errors from ever crashing the DAG scheduler

2015-01-09 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272214#comment-14272214 ] Patrick Wendell commented on SPARK-4737: It's great to see this go in. Thanks

[jira] [Commented] (SPARK-4879) Missing output partitions after job completes with speculative execution

2015-01-09 Thread Zach Fry (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272292#comment-14272292 ] Zach Fry commented on SPARK-4879: - For clarity, here is the scala code I used in the REPL:

[jira] [Created] (SPARK-5187) CACHE TABLE AS SELECT fails with Hive UDFs

2015-01-09 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-5187: --- Summary: CACHE TABLE AS SELECT fails with Hive UDFs Key: SPARK-5187 URL: https://issues.apache.org/jira/browse/SPARK-5187 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5187) CACHE TABLE AS SELECT fails with Hive UDFs

2015-01-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272103#comment-14272103 ] Apache Spark commented on SPARK-5187: - User 'marmbrus' has created a pull request for

[jira] [Created] (SPARK-5177) Add environment variables in dev/run-tests to enable Hive 0.12 and Scala 2.11 Jenkins builder

2015-01-09 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-5177: - Summary: Add environment variables in dev/run-tests to enable Hive 0.12 and Scala 2.11 Jenkins builder Key: SPARK-5177 URL: https://issues.apache.org/jira/browse/SPARK-5177

[jira] [Commented] (SPARK-3619) Upgrade to Mesos 0.21 to work around MESOS-1688

2015-01-09 Thread Jongyoul Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14270730#comment-14270730 ] Jongyoul Lee commented on SPARK-3619: - [~tnachen] [~matei] I am finished some tests

[jira] [Updated] (SPARK-3019) Pluggable block transfer (data plane communication) interface

2015-01-09 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-3019: Attachment: Pluggable RPC - draft 1.pdf Pluggable block transfer (data plane communication)

[jira] [Closed] (SPARK-5170) fetch the correct max attempts

2015-01-09 Thread WangTaoTheTonic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] WangTaoTheTonic closed SPARK-5170. -- Resolution: Duplicate Duplicated submit as the server block issue. So close it. fetch the

[jira] [Created] (SPARK-5170) fetch the correct max attempts

2015-01-09 Thread WangTaoTheTonic (JIRA)
WangTaoTheTonic created SPARK-5170: -- Summary: fetch the correct max attempts Key: SPARK-5170 URL: https://issues.apache.org/jira/browse/SPARK-5170 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-5169) fetch the correct max attempts

2015-01-09 Thread WangTaoTheTonic (JIRA)
WangTaoTheTonic created SPARK-5169: -- Summary: fetch the correct max attempts Key: SPARK-5169 URL: https://issues.apache.org/jira/browse/SPARK-5169 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-5124) Standardize internal RPC interface

2015-01-09 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-5124: Attachment: Pluggable RPC - draft 1.pdf Standardize internal RPC interface

[jira] [Commented] (SPARK-5169) fetch the correct max attempts

2015-01-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14270743#comment-14270743 ] Apache Spark commented on SPARK-5169: - User 'WangTaoTheTonic' has created a pull

[jira] [Comment Edited] (SPARK-5124) Standardize internal RPC interface

2015-01-09 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14270771#comment-14270771 ] Reynold Xin edited comment on SPARK-5124 at 1/9/15 8:49 AM:

[jira] [Updated] (SPARK-5124) Standardize internal RPC interface

2015-01-09 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-5124: Attachment: (was: Pluggable RPC - draft 1.pdf) Standardize internal RPC interface

[jira] [Comment Edited] (SPARK-5022) Change VectorUDT to object

2015-01-09 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14270817#comment-14270817 ] Manoj Kumar edited comment on SPARK-5022 at 1/9/15 9:50 AM:

[jira] [Commented] (SPARK-5022) Change VectorUDT to object

2015-01-09 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14270817#comment-14270817 ] Manoj Kumar commented on SPARK-5022: @josephkb I want to have a go at this one. Should

[jira] [Comment Edited] (SPARK-5119) java.lang.ArrayIndexOutOfBoundsException on trying to train decision tree model

2015-01-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14270825#comment-14270825 ] Sean Owen edited comment on SPARK-5119 at 1/9/15 10:00 AM: --- Yes,

[jira] [Commented] (SPARK-5119) java.lang.ArrayIndexOutOfBoundsException on trying to train decision tree model

2015-01-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14270825#comment-14270825 ] Sean Owen commented on SPARK-5119: -- Yes, the input must contain categories that are

[jira] [Updated] (SPARK-5172) spark-examples-***.jar shades a wrong Hadoop distribution

2015-01-09 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-5172: Description: Steps to check it: 1. Download spark-1.2.0-bin-hadoop2.4.tgz from

[jira] [Created] (SPARK-5171) Standalone cluster: masters scheduling independently

2015-01-09 Thread Roberto Vaquerizo Rodriguez (JIRA)
Roberto Vaquerizo Rodriguez created SPARK-5171: -- Summary: Standalone cluster: masters scheduling independently Key: SPARK-5171 URL: https://issues.apache.org/jira/browse/SPARK-5171

[jira] [Updated] (SPARK-5124) Standardize internal RPC interface

2015-01-09 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-5124: Attachment: Pluggable RPC - draft 1.pdf Standardize internal RPC interface

[jira] [Commented] (SPARK-5124) Standardize internal RPC interface

2015-01-09 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14270809#comment-14270809 ] Shixiong Zhu commented on SPARK-5124: - {quote} 1. For DAGScheduler, we are probably OK

[jira] [Updated] (SPARK-3490) Alleviate port collisions during tests

2015-01-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-3490: - Fix Version/s: 0.9.3 Alleviate port collisions during tests --

[jira] [Updated] (SPARK-5145) Add BLAS.dsyr and use it in GaussianMixtureEM

2015-01-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5145: - Assignee: Liang-Chi Hsieh Add BLAS.dsyr and use it in GaussianMixtureEM

[jira] [Resolved] (SPARK-5145) Add BLAS.dsyr and use it in GaussianMixtureEM

2015-01-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5145. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3949

[jira] [Commented] (SPARK-5022) Change VectorUDT to object

2015-01-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271696#comment-14271696 ] Joseph K. Bradley commented on SPARK-5022: -- There are lots of items out there,

[jira] [Updated] (SPARK-5041) hive-exec jar should be generated with JDK 6

2015-01-09 Thread Ted Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated SPARK-5041: -- Labels: maven (was: ) hive-exec jar should be generated with JDK 6

[jira] [Commented] (SPARK-5022) Change VectorUDT to object

2015-01-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271573#comment-14271573 ] Joseph K. Bradley commented on SPARK-5022: -- A major refactor of SQL data types is

[jira] [Resolved] (SPARK-1143) ClusterSchedulerSuite (soon to be TaskSchedulerImplSuite) does not actually test the ClusterScheduler/TaskSchedulerImpl

2015-01-09 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-1143. Resolution: Fixed Assignee: Kay Ousterhout (was: Nan Zhu) ClusterSchedulerSuite

[jira] [Commented] (SPARK-5022) Change VectorUDT to object

2015-01-09 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271608#comment-14271608 ] Manoj Kumar commented on SPARK-5022: Hi, Thanks for the reply. I have worked on

[jira] [Updated] (SPARK-5015) GaussianMixtureEM should take random seed parameter

2015-01-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5015: - Priority: Critical (was: Minor) GaussianMixtureEM should take random seed parameter

[jira] [Commented] (SPARK-5015) GaussianMixtureEM should take random seed parameter

2015-01-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271656#comment-14271656 ] Joseph K. Bradley commented on SPARK-5015: -- I'm upping this to critical since it

[jira] [Resolved] (SPARK-3619) Upgrade to Mesos 0.21 to work around MESOS-1688

2015-01-09 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-3619. -- Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Jongyoul Lee (was: Timothy

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-09 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271700#comment-14271700 ] Joseph K. Bradley commented on SPARK-1405: -- That's great to hear that online

[jira] [Commented] (SPARK-5152) Let metrics.properties file take an hdfs:// path

2015-01-09 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271625#comment-14271625 ] Ryan Williams commented on SPARK-5152: -- so I've been fumbling my way around the

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-09 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271717#comment-14271717 ] Pedro Rodriguez commented on SPARK-1405: Second on nice design doc and proposal. I

[jira] [Updated] (SPARK-2910) Test with Python 2.6 on Jenkins

2015-01-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2910: - Fix Version/s: 0.9.2 Test with Python 2.6 on Jenkins ---

[jira] [Commented] (SPARK-5177) Add environment variables in dev/run-tests to enable Hive 0.12 and Scala 2.11 Jenkins builder

2015-01-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271490#comment-14271490 ] Apache Spark commented on SPARK-5177: - User 'liancheng' has created a pull request for

[jira] [Updated] (SPARK-2101) Python unit tests fail on Python 2.6 because of lack of unittest.skipIf()

2015-01-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2101: - Fix Version/s: 0.9.3 Python unit tests fail on Python 2.6 because of lack of unittest.skipIf()

[jira] [Updated] (SPARK-2954) PySpark MLlib serialization tests fail on Python 2.6

2015-01-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2954: - Fix Version/s: 0.9.3 PySpark MLlib serialization tests fail on Python 2.6

[jira] [Updated] (SPARK-2948) PySpark doesn't work on Python 2.6

2015-01-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2948: - Fix Version/s: 0.9.3 PySpark doesn't work on Python 2.6 --

[jira] [Resolved] (SPARK-5163) Load properties from configuration file for example spark-defaults.conf when creating SparkConf object

2015-01-09 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5163. Resolution: Won't Fix I'd prefer not to accept this patch for now - the spark-defaults.conf

[jira] [Updated] (SPARK-5073) spark.storage.memoryMapThreshold has two default values

2015-01-09 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5073: --- Summary: spark.storage.memoryMapThreshold has two default values (was:

[jira] [Commented] (SPARK-5095) Support launching multiple mesos executors in coarse grained mesos mode

2015-01-09 Thread Gerard Maas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271508#comment-14271508 ] Gerard Maas commented on SPARK-5095: If I understand correctly, setting

[jira] [Resolved] (SPARK-5136) Improve documentation around setting up Spark IntelliJ project

2015-01-09 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5136. Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Improve

[jira] [Updated] (SPARK-5073) spark.storage.memoryMapThreshold has two default value

2015-01-09 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5073: --- Summary: spark.storage.memoryMapThreshold has two default value (was:

[jira] [Commented] (SPARK-5053) Test maintenance branches on Jenkins using SBT

2015-01-09 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271793#comment-14271793 ] Josh Rosen commented on SPARK-5053: --- It doesn't look like the disable web UI in tests

[jira] [Updated] (SPARK-4006) Spark Driver crashes whenever an Executor is registered twice

2015-01-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4006: - Target Version/s: 1.2.0, 1.1.1, 0.9.3, 1.0.3 (was: 1.1.1, 1.2.0, 1.0.3) Spark Driver crashes whenever

[jira] [Updated] (SPARK-4006) Spark Driver crashes whenever an Executor is registered twice

2015-01-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4006: - Fix Version/s: 0.9.3 Spark Driver crashes whenever an Executor is registered twice

[jira] [Created] (SPARK-5184) Improve the performance of metadata operations

2015-01-09 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5184: --- Summary: Improve the performance of metadata operations Key: SPARK-5184 URL: https://issues.apache.org/jira/browse/SPARK-5184 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-5015) GaussianMixtureEM should take random seed parameter

2015-01-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271682#comment-14271682 ] Apache Spark commented on SPARK-5015: - User 'jkbradley' has created a pull request for

[jira] [Commented] (SPARK-5075) Memory Leak when repartitioning SchemaRDD or running queries in general

2015-01-09 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271706#comment-14271706 ] Brad Willard commented on SPARK-5075: - I wanted to add that this is greatly

[jira] [Commented] (SPARK-4879) Missing output partitions after job completes with speculative execution

2015-01-09 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271788#comment-14271788 ] Josh Rosen commented on SPARK-4879: --- Hey Zach, From my last round of attempts (maybe a

[jira] [Commented] (SPARK-5053) Test maintenance branches on Jenkins using SBT

2015-01-09 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271805#comment-14271805 ] Josh Rosen commented on SPARK-5053: --- Ah, looks like Andrew has a PR to backport the

[jira] [Created] (SPARK-5178) Integrate Python unit tests into Jenkins

2015-01-09 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-5178: --- Summary: Integrate Python unit tests into Jenkins Key: SPARK-5178 URL: https://issues.apache.org/jira/browse/SPARK-5178 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5162) Python yarn-cluster mode

2015-01-09 Thread Dana Klassen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271822#comment-14271822 ] Dana Klassen commented on SPARK-5162: - this is duplicate of

[jira] [Created] (SPARK-5185) pyspark --jars does not add classes to driver class path

2015-01-09 Thread Uri Laserson (JIRA)
Uri Laserson created SPARK-5185: --- Summary: pyspark --jars does not add classes to driver class path Key: SPARK-5185 URL: https://issues.apache.org/jira/browse/SPARK-5185 Project: Spark Issue

[jira] [Closed] (SPARK-4737) Prevent serialization errors from ever crashing the DAG scheduler

2015-01-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4737. Resolution: Fixed Fix Version/s: 1.3.0 Prevent serialization errors from ever crashing the DAG

[jira] [Commented] (SPARK-4574) Adding support for defining schema in foreign DDL commands.

2015-01-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271984#comment-14271984 ] Yin Huai commented on SPARK-4574: - Let me try to summarize [~scwf]'s PR (with my updates).

[jira] [Updated] (SPARK-5186) Vector.equals and Vector.hashCode are very inefficient

2015-01-09 Thread Derrick Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Derrick Burns updated SPARK-5186: - Description: The implementation of Vector.equals and Vector.hashCode are correct but slow for

[jira] [Commented] (SPARK-5186) Vector.equals and Vector.hashCode are very inefficient

2015-01-09 Thread Derrick Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271989#comment-14271989 ] Derrick Burns commented on SPARK-5186: -- My mistake! I mis-read the implementation of

[jira] [Updated] (SPARK-5179) Spark UI history job duration is wrong

2015-01-09 Thread Olivier Toupin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olivier Toupin updated SPARK-5179: -- Description: In the Web UI, the jobs duration times are wrong when using reviewing the job

[jira] [Commented] (SPARK-5186) Vector.equals and Vector.hashCode are very inefficient

2015-01-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272000#comment-14272000 ] Sean Owen commented on SPARK-5186: -- Agree, this could easily be specialized for much

[jira] [Commented] (SPARK-4912) Persistent data source tables

2015-01-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272016#comment-14272016 ] Yin Huai commented on SPARK-4912: - h3. Persistence of metadata Right now all tables

[jira] [Commented] (SPARK-5186) Vector.equals is broken

2015-01-09 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271979#comment-14271979 ] Sean Owen commented on SPARK-5186: -- It looks like equals and hashCode are based on the

[jira] [Updated] (SPARK-5186) Vector.equals and Vector.hashCode are very inefficient

2015-01-09 Thread Derrick Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Derrick Burns updated SPARK-5186: - Summary: Vector.equals and Vector.hashCode are very inefficient (was: Vector.equals is broken)

[jira] [Commented] (SPARK-5182) Partitioning support for tables created by the data source API

2015-01-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272002#comment-14272002 ] Yin Huai commented on SPARK-5182: - Here is the doc from [~marmbrus]. Partitioning data by

[jira] [Commented] (SPARK-4912) Persistent data source tables

2015-01-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272018#comment-14272018 ] Yin Huai commented on SPARK-4912: - h3. Persistence of data Right now the data sources API

[jira] [Commented] (SPARK-5184) Improve the performance of metadata operations

2015-01-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272026#comment-14272026 ] Yin Huai commented on SPARK-5184: - Here is the doc from [~marmbrus]. Metadata operations

[jira] [Commented] (SPARK-4983) Tag EC2 instances in the same call that launches them

2015-01-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272038#comment-14272038 ] Apache Spark commented on SPARK-4983: - User 'GenTang' has created a pull request for

[jira] [Resolved] (SPARK-5015) GaussianMixtureEM should take random seed parameter

2015-01-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5015. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3981

[jira] [Commented] (SPARK-5174) Missing Document for starting multiple workers/supervisors in actor-based receiver

2015-01-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271868#comment-14271868 ] Apache Spark commented on SPARK-5174: - User 'CodingCat' has created a pull request for

[jira] [Commented] (SPARK-5175) bug in updating counters when starting multiple workers/supervisors in actor-based receiver

2015-01-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271869#comment-14271869 ] Apache Spark commented on SPARK-5175: - User 'CodingCat' has created a pull request for

[jira] [Created] (SPARK-5179) Spark UI history job duration is wrong

2015-01-09 Thread Olivier Toupin (JIRA)
Olivier Toupin created SPARK-5179: - Summary: Spark UI history job duration is wrong Key: SPARK-5179 URL: https://issues.apache.org/jira/browse/SPARK-5179 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5181) inaccurate log when WAL is disabled

2015-01-09 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271887#comment-14271887 ] Apache Spark commented on SPARK-5181: - User 'CodingCat' has created a pull request for

[jira] [Closed] (SPARK-1953) yarn client mode Application Master memory size is same as driver memory size

2015-01-09 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-1953. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: WangTaoTheTonic Target

[jira] [Commented] (SPARK-5178) Integrate Python unit tests into Jenkins

2015-01-09 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271858#comment-14271858 ] Nicholas Chammas commented on SPARK-5178: - Relevant links: *

[jira] [Created] (SPARK-5180) Data source API improvement

2015-01-09 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5180: --- Summary: Data source API improvement Key: SPARK-5180 URL: https://issues.apache.org/jira/browse/SPARK-5180 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-4574) Adding support for defining schema in foreign DDL commands.

2015-01-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4574: Issue Type: Sub-task (was: New Feature) Parent: SPARK-5180 Adding support for defining schema in

[jira] [Updated] (SPARK-4912) Persistent data source tables

2015-01-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4912: Issue Type: Sub-task (was: Improvement) Parent: SPARK-5180 Persistent data source tables

[jira] [Updated] (SPARK-4574) Adding support for defining schema in foreign DDL commands.

2015-01-09 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-4574: Priority: Blocker (was: Major) Adding support for defining schema in foreign DDL commands.

[jira] [Created] (SPARK-5183) Document data source API

2015-01-09 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5183: --- Summary: Document data source API Key: SPARK-5183 URL: https://issues.apache.org/jira/browse/SPARK-5183 Project: Spark Issue Type: Sub-task Components:

  1   2   >