[jira] [Commented] (SPARK-6199) Support CTE

2015-03-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14350047#comment-14350047 ] Apache Spark commented on SPARK-6199: - User 'haiyangsea' has created a pull request fo

[jira] [Created] (SPARK-6199) Support CTE

2015-03-05 Thread haiyang (JIRA)
haiyang created SPARK-6199: -- Summary: Support CTE Key: SPARK-6199 URL: https://issues.apache.org/jira/browse/SPARK-6199 Project: Spark Issue Type: Improvement Components: SQL R

[jira] [Commented] (SPARK-6196) Add MAPR 4.0.2 support to the build

2015-03-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14350034#comment-14350034 ] Apache Spark commented on SPARK-6196: - User 'trystanleftwich' has created a pull reque

[jira] [Closed] (SPARK-3275) Socket receiver can not recover when the socket server restarted

2015-03-05 Thread Jack Hu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jack Hu closed SPARK-3275. -- Resolution: Fixed Fix Version/s: 1.1.0 Checked in 1.1.0, do not find similar issue. > Socket receiver c

[jira] [Commented] (SPARK-6197) handle json parse exception for eventlog file not finished writing

2015-03-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349984#comment-14349984 ] Apache Spark commented on SPARK-6197: - User 'liyezhang556520' has created a pull reque

[jira] [Commented] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-03-05 Thread Yi Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349929#comment-14349929 ] Yi Zhou commented on SPARK-5791: [~yhuai] Currently all of input tables are ORC file forma

[jira] [Commented] (SPARK-6198) Support "select current_database()"

2015-03-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349923#comment-14349923 ] Apache Spark commented on SPARK-6198: - User 'DoingDone9' has created a pull request fo

[jira] [Updated] (SPARK-6198) Support "select current_database()"

2015-03-05 Thread DoingDone9 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DoingDone9 updated SPARK-6198: -- Description: The method(evaluate) has changed in UDFCurrentDB, it just throws a exception.But hiveUdfs

[jira] [Commented] (SPARK-4588) Add API for feature attributes

2015-03-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349907#comment-14349907 ] Apache Spark commented on SPARK-4588: - User 'mengxr' has created a pull request for th

[jira] [Created] (SPARK-6198) Support "select current_database()"

2015-03-05 Thread DoingDone9 (JIRA)
DoingDone9 created SPARK-6198: - Summary: Support "select current_database()" Key: SPARK-6198 URL: https://issues.apache.org/jira/browse/SPARK-6198 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-6197) handle json parse exception for eventlog file not finished writing

2015-03-05 Thread Zhang, Liye (JIRA)
Zhang, Liye created SPARK-6197: -- Summary: handle json parse exception for eventlog file not finished writing Key: SPARK-6197 URL: https://issues.apache.org/jira/browse/SPARK-6197 Project: Spark

[jira] [Created] (SPARK-6196) Add MAPR 4.0.2 support to the build

2015-03-05 Thread Trystan Leftwich (JIRA)
Trystan Leftwich created SPARK-6196: --- Summary: Add MAPR 4.0.2 support to the build Key: SPARK-6196 URL: https://issues.apache.org/jira/browse/SPARK-6196 Project: Spark Issue Type: Improveme

[jira] [Updated] (SPARK-6141) Upgrade Breeze to 0.11 to fix convergence bug

2015-03-05 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6141: --- Fix Version/s: (was: 1.3.1) 1.3.0 > Upgrade Breeze to 0.11 to fix conve

[jira] [Commented] (SPARK-5124) Standardize internal RPC interface

2015-03-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349816#comment-14349816 ] Reynold Xin commented on SPARK-5124: RpcCallContext sounds good. > Standardize intern

[jira] [Commented] (SPARK-6113) Stabilize DecisionTree and ensembles APIs

2015-03-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349811#comment-14349811 ] Joseph K. Bradley commented on SPARK-6113: -- Pinging [~MechCoder] since you've bee

[jira] [Commented] (SPARK-5124) Standardize internal RPC interface

2015-03-05 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349808#comment-14349808 ] Shixiong Zhu commented on SPARK-5124: - 1. For the local Endpoint, let's put it in a fu

[jira] [Commented] (SPARK-6194) collect() in PySpark will cause memory leak in JVM

2015-03-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349803#comment-14349803 ] Apache Spark commented on SPARK-6194: - User 'davies' has created a pull request for th

[jira] [Created] (SPARK-6195) Specialized in-memory column type for decimal

2015-03-05 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-6195: - Summary: Specialized in-memory column type for decimal Key: SPARK-6195 URL: https://issues.apache.org/jira/browse/SPARK-6195 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-4925) Publish Spark SQL hive-thriftserver maven artifact

2015-03-05 Thread Peng Cheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349760#comment-14349760 ] Peng Cheng commented on SPARK-4925: --- Me too, though the patch is applied in 1.2.1. We ju

[jira] [Updated] (SPARK-6194) collect() in PySpark will cause memory leak in JVM

2015-03-05 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-6194: -- Description: It could be reproduced by: {code} for i in range(40): sc.parallelize(range(5000), 10)

[jira] [Created] (SPARK-6194) collect() in PySpark will cause memory leak in JVM

2015-03-05 Thread Davies Liu (JIRA)
Davies Liu created SPARK-6194: - Summary: collect() in PySpark will cause memory leak in JVM Key: SPARK-6194 URL: https://issues.apache.org/jira/browse/SPARK-6194 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-6193) Speed up how spark-ec2 searches for clusters

2015-03-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349713#comment-14349713 ] Apache Spark commented on SPARK-6193: - User 'nchammas' has created a pull request for

[jira] [Commented] (SPARK-5124) Standardize internal RPC interface

2015-03-05 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349689#comment-14349689 ] Marcelo Vanzin commented on SPARK-5124: --- re: a local Endpoint, if it's really necess

[jira] [Created] (SPARK-6193) Speed up how spark-ec2 searches for clusters

2015-03-05 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-6193: --- Summary: Speed up how spark-ec2 searches for clusters Key: SPARK-6193 URL: https://issues.apache.org/jira/browse/SPARK-6193 Project: Spark Issue Type:

[jira] [Updated] (SPARK-6095) Support model save/load in Python's linear models

2015-03-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6095: - Target Version/s: 1.4.0 > Support model save/load in Python's linear models >

[jira] [Updated] (SPARK-6095) Support model save/load in Python's linear models

2015-03-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6095: - Assignee: Yanbo Liang > Support model save/load in Python's linear models > --

[jira] [Commented] (SPARK-6142) 10-12% Performance regression with "finalize"

2015-03-05 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349646#comment-14349646 ] Nishkam Ravi commented on SPARK-6142: - [~zsxwing]Yes, the 10-12% regression was ve

[jira] [Commented] (SPARK-3369) Java mapPartitions Iterator->Iterable is inconsistent with Scala's Iterator->Iterator

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349593#comment-14349593 ] Sean Owen commented on SPARK-3369: -- I sympathize with this. The problem is that changing

[jira] [Commented] (SPARK-3369) Java mapPartitions Iterator->Iterable is inconsistent with Scala's Iterator->Iterator

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349594#comment-14349594 ] Sean Owen commented on SPARK-3369: -- I sympathize with this. The problem is that changing

[jira] [Commented] (SPARK-3369) Java mapPartitions Iterator->Iterable is inconsistent with Scala's Iterator->Iterator

2015-03-05 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349577#comment-14349577 ] Nicholas Chammas commented on SPARK-3369: - {quote} How about breaking backward com

[jira] [Comment Edited] (SPARK-3369) Java mapPartitions Iterator->Iterable is inconsistent with Scala's Iterator->Iterator

2015-03-05 Thread Lukas Nalezenec (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14122611#comment-14122611 ] Lukas Nalezenec edited comment on SPARK-3369 at 3/5/15 10:50 PM: ---

[jira] [Resolved] (SPARK-6145) ORDER BY fails to resolve nested fields

2015-03-05 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6145. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4918 [https:/

[jira] [Resolved] (SPARK-6163) jsonFile should be backed by the data source API

2015-03-05 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6163. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4896 [https:/

[jira] [Updated] (SPARK-6192) Enhance MLlib's Python API (GSoC 2015)

2015-03-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6192: - Labels: gsoc gsoc2015 mentor (was: gsoc gsoc2015) > Enhance MLlib's Python API (GSoC 2015) >

[jira] [Created] (SPARK-6192) Enhance MLlib's Python API (GSoC 2015)

2015-03-05 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6192: Summary: Enhance MLlib's Python API (GSoC 2015) Key: SPARK-6192 URL: https://issues.apache.org/jira/browse/SPARK-6192 Project: Spark Issue Type: Umbrella

[jira] [Commented] (SPARK-4311) ContainerLauncher setting up executor -- invalid Xms settings (-Xms0m -Xmx0m)

2015-03-05 Thread Rafael Alfaro Flores (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349468#comment-14349468 ] Rafael Alfaro Flores commented on SPARK-4311: - [~kadiyalakc] Were you able to

[jira] [Updated] (SPARK-6191) Generalize spark-ec2's ability to download libraries from PyPI

2015-03-05 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-6191: Description: Right now we have a method to specifically download boto. Let's generalize it

[jira] [Commented] (SPARK-6191) Generalize spark-ec2's ability to download libraries from PyPI

2015-03-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349434#comment-14349434 ] Apache Spark commented on SPARK-6191: - User 'nchammas' has created a pull request for

[jira] [Commented] (SPARK-6145) ORDER BY fails to resolve nested fields

2015-03-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349421#comment-14349421 ] Apache Spark commented on SPARK-6145: - User 'marmbrus' has created a pull request for

[jira] [Resolved] (SPARK-6175) Executor log links are using internal addresses in EC2; display `:0` when ephemeral ports are used

2015-03-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-6175. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4903 [https://github.com/

[jira] [Updated] (SPARK-6191) Generalize spark-ec2's ability to download libraries from PyPI

2015-03-05 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-6191: Description: Right now we have a method to specifically download boto. Let's generalize it

[jira] [Created] (SPARK-6191) Generalize spark-ec2's ability to download libraries from PyPI

2015-03-05 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-6191: --- Summary: Generalize spark-ec2's ability to download libraries from PyPI Key: SPARK-6191 URL: https://issues.apache.org/jira/browse/SPARK-6191 Project: Spark

[jira] [Updated] (SPARK-6191) Generalize spark-ec2's ability to download libraries from PyPI

2015-03-05 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-6191: Description: Right now we have a method to specifically download boto. Let's generalize it

[jira] [Resolved] (SPARK-6090) Add BinaryClassificationMetrics in PySpark/MLlib

2015-03-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6090. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4863 [https://githu

[jira] [Comment Edited] (SPARK-5981) pyspark ML models should support predict/transform on vector within map

2015-03-05 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349347#comment-14349347 ] Manoj Kumar edited comment on SPARK-5981 at 3/5/15 7:38 PM: Th

[jira] [Commented] (SPARK-5981) pyspark ML models should support predict/transform on vector within map

2015-03-05 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349347#comment-14349347 ] Manoj Kumar commented on SPARK-5981: Thanks a lot for your patient explanation, I'm al

[jira] [Updated] (SPARK-6175) Executor log links are using internal addresses in EC2; display `:0` when ephemeral ports are used

2015-03-05 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6175: --- Priority: Blocker (was: Major) > Executor log links are using internal addresses in EC2; disp

[jira] [Resolved] (SPARK-6182) spark-parent pom needs to be published for both 2.10 and 2.11

2015-03-05 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-6182. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Sean Owen > spark-parent po

[jira] [Commented] (SPARK-5981) pyspark ML models should support predict/transform on vector within map

2015-03-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349217#comment-14349217 ] Joseph K. Bradley commented on SPARK-5981: -- I'd recommend exploring the code path

[jira] [Commented] (SPARK-5396) Syntax error in spark scripts on windows.

2015-03-05 Thread Vladimir Protsenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349203#comment-14349203 ] Vladimir Protsenko commented on SPARK-5396: --- It says "Syntax error in command.".

[jira] [Commented] (SPARK-2311) Added additional GLMs (Poisson and Gamma) into MLlib

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349192#comment-14349192 ] Sean Owen commented on SPARK-2311: -- I think this one may be obsolete as a suggested chang

[jira] [Commented] (SPARK-6190) create LargeByteBuffer abstraction for eliminating 2GB limit on blocks

2015-03-05 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349191#comment-14349191 ] Imran Rashid commented on SPARK-6190: - Hi [~rxin], I've attached a design doc here. I

[jira] [Commented] (SPARK-4879) Missing output partitions after job completes with speculative execution

2015-03-05 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349188#comment-14349188 ] Matt Cheah commented on SPARK-4879: --- Can you perhaps jstack or profile the driver and th

[jira] [Updated] (SPARK-6190) create LargeByteBuffer abstraction for eliminating 2GB limit on blocks

2015-03-05 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-6190: Attachment: LargeByteBuffer.pdf design doc > create LargeByteBuffer abstraction for eliminating 2GB

[jira] [Created] (SPARK-6190) create LargeByteBuffer abstraction for eliminating 2GB limit on blocks

2015-03-05 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-6190: --- Summary: create LargeByteBuffer abstraction for eliminating 2GB limit on blocks Key: SPARK-6190 URL: https://issues.apache.org/jira/browse/SPARK-6190 Project: Spark

[jira] [Commented] (SPARK-5838) Changing SPARK_LOCAL_DIRS option in spark-env.sh does not take effect without daemon restart

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349184#comment-14349184 ] Sean Owen commented on SPARK-5838: -- I would link it to the continuation JIRA maybe, and r

[jira] [Commented] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-03-05 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349183#comment-14349183 ] Yin Huai commented on SPARK-5791: - Also, how large is the results of "name" subquery? > [

[jira] [Commented] (SPARK-5981) pyspark ML models should support predict/transform on vector within map

2015-03-05 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349179#comment-14349179 ] Manoj Kumar commented on SPARK-5981: Sorry for being slow. But do you mean when predic

[jira] [Comment Edited] (SPARK-5981) pyspark ML models should support predict/transform on vector within map

2015-03-05 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349179#comment-14349179 ] Manoj Kumar edited comment on SPARK-5981 at 3/5/15 6:04 PM: So

[jira] [Created] (SPARK-6189) Pandas to DataFrame conversion should check field names for periods

2015-03-05 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6189: Summary: Pandas to DataFrame conversion should check field names for periods Key: SPARK-6189 URL: https://issues.apache.org/jira/browse/SPARK-6189 Project: Sp

[jira] [Commented] (SPARK-5791) [Spark SQL] show poor performance when multiple table do join operation

2015-03-05 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349139#comment-14349139 ] Yin Huai commented on SPARK-5791: - [~jameszhouyi] Thank you for the updated physical plan.

[jira] [Commented] (SPARK-5838) Changing SPARK_LOCAL_DIRS option in spark-env.sh does not take effect without daemon restart

2015-03-05 Thread Theodore Vasiloudis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349131#comment-14349131 ] Theodore Vasiloudis commented on SPARK-5838: I had misunderstood the purpose o

[jira] [Commented] (SPARK-5838) Changing SPARK_LOCAL_DIRS option in spark-env.sh does not take effect without daemon restart

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349130#comment-14349130 ] Sean Owen commented on SPARK-5838: -- I don't expect that processes re-read the environment

[jira] [Commented] (SPARK-5838) Changing SPARK_LOCAL_DIRS option in spark-env.sh does not take effect without daemon restart

2015-03-05 Thread Theodore Vasiloudis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349124#comment-14349124 ] Theodore Vasiloudis commented on SPARK-5838: I guess what remains here is the

[jira] [Commented] (SPARK-6188) Instance types can be mislabeled when re-starting cluster with default arguments

2015-03-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349115#comment-14349115 ] Apache Spark commented on SPARK-6188: - User 'thvasilo' has created a pull request for

[jira] [Resolved] (SPARK-1867) Spark Documentation Error causes java.lang.IllegalStateException: unread block data

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1867. -- Resolution: Not a Problem OK, I think there are two problems with the same manifestation here, but I th

[jira] [Commented] (SPARK-5838) Changing SPARK_LOCAL_DIRS option in spark-env.sh does not take effect without daemon restart

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349099#comment-14349099 ] Sean Owen commented on SPARK-5838: -- Is there anything here then that's not covered by SPA

[jira] [Created] (SPARK-6188) Instance types can be mislabeled when re-starting cluster with default arguments

2015-03-05 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created SPARK-6188: -- Summary: Instance types can be mislabeled when re-starting cluster with default arguments Key: SPARK-6188 URL: https://issues.apache.org/jira/browse/SPARK-6188

[jira] [Commented] (SPARK-4560) Lambda deserialization error

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349095#comment-14349095 ] Sean Owen commented on SPARK-4560: -- Some poking around on the internet suggests that it m

[jira] [Assigned] (SPARK-6175) Executor log links are using internal addresses in EC2

2015-03-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-6175: - Assignee: Josh Rosen > Executor log links are using internal addresses in EC2 > -

[jira] [Updated] (SPARK-6175) Executor log links are using internal addresses in EC2; display `:0` when ephemeral ports are used

2015-03-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-6175: -- Summary: Executor log links are using internal addresses in EC2; display `:0` when ephemeral ports are u

[jira] [Updated] (SPARK-6187) Report full executor exceptions to the driver

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6187: - Priority: Minor (was: Major) It already does this: {{sc.parallelize(Array(1,2,3)).map { i => throw new I

[jira] [Commented] (SPARK-4925) Publish Spark SQL hive-thriftserver maven artifact

2015-03-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349059#comment-14349059 ] Ernesto Alejandro Menéndez Castillo commented on SPARK-4925: I

[jira] [Commented] (SPARK-5838) Changing SPARK_LOCAL_DIRS option in spark-env.sh does not take effect without daemon restart

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349052#comment-14349052 ] Sean Owen commented on SPARK-5838: -- I don't know a lot about this, but I don't know if yo

[jira] [Commented] (SPARK-5838) Changing SPARK_LOCAL_DIRS option in spark-env.sh does not take effect without daemon restart

2015-03-05 Thread Theodore Vasiloudis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349004#comment-14349004 ] Theodore Vasiloudis commented on SPARK-5838: So this seems to be the case: Wh

[jira] [Updated] (SPARK-6187) Report full executor exceptions to the driver

2015-03-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-6187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Piotr Kołaczkowski updated SPARK-6187: -- Description: If the task fails for some reason, the driver seems to report only the top

[jira] [Commented] (SPARK-6169) Shuffle based join

2015-03-05 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348995#comment-14348995 ] Mridul Muralidharan commented on SPARK-6169: I have not done too much research

[jira] [Created] (SPARK-6187) Report full executor exceptions to the driver

2015-03-05 Thread JIRA
Piotr Kołaczkowski created SPARK-6187: - Summary: Report full executor exceptions to the driver Key: SPARK-6187 URL: https://issues.apache.org/jira/browse/SPARK-6187 Project: Spark Issue T

[jira] [Comment Edited] (SPARK-5838) Changing SPARK_LOCAL_DIRS option in spark-env.sh does not take effect without daemon restart

2015-03-05 Thread Theodore Vasiloudis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348798#comment-14348798 ] Theodore Vasiloudis edited comment on SPARK-5838 at 3/5/15 3:41 PM:

[jira] [Commented] (SPARK-4879) Missing output partitions after job completes with speculative execution

2015-03-05 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348987#comment-14348987 ] Mridul Muralidharan commented on SPARK-4879: [~joshrosen] The former - the cal

[jira] [Comment Edited] (SPARK-5838) Changing SPARK_LOCAL_DIRS option in spark-env.sh does not take effect without daemon restart

2015-03-05 Thread Theodore Vasiloudis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348798#comment-14348798 ] Theodore Vasiloudis edited comment on SPARK-5838 at 3/5/15 3:41 PM:

[jira] [Closed] (SPARK-6158) Move private method boost in GradientBoostedTrees from Object to Class

2015-03-05 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar closed SPARK-6158. -- Resolution: Not a Problem > Move private method boost in GradientBoostedTrees from Object to Class > ---

[jira] [Commented] (SPARK-6158) Move private method boost in GradientBoostedTrees from Object to Class

2015-03-05 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348828#comment-14348828 ] Manoj Kumar commented on SPARK-6158: I'm sorry. I'm closing this. I think of the most

[jira] [Comment Edited] (SPARK-6158) Move private method boost in GradientBoostedTrees from Object to Class

2015-03-05 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348828#comment-14348828 ] Manoj Kumar edited comment on SPARK-6158 at 3/5/15 2:56 PM: I'

[jira] [Commented] (SPARK-6067) Spark sql hive dynamic partitions job will fail if task fails

2015-03-05 Thread Jason Hubbard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348826#comment-14348826 ] Jason Hubbard commented on SPARK-6067: -- This occurred because of an OOM, but we have

[jira] [Commented] (SPARK-5838) Changing SPARK_LOCAL_DIRS option in spark-env.sh does not take effect without daemon restart

2015-03-05 Thread Theodore Vasiloudis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348798#comment-14348798 ] Theodore Vasiloudis commented on SPARK-5838: OK so I think I have found a way

[jira] [Commented] (SPARK-5404) Statistic of Logical Plan is too aggresive

2015-03-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348793#comment-14348793 ] Apache Spark commented on SPARK-5404: - User 'chenghao-intel' has created a pull reques

[jira] [Commented] (SPARK-6182) spark-parent pom needs to be published for both 2.10 and 2.11

2015-03-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348767#comment-14348767 ] Apache Spark commented on SPARK-6182: - User 'srowen' has created a pull request for th

[jira] [Commented] (SPARK-6182) spark-parent pom needs to be published for both 2.10 and 2.11

2015-03-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348766#comment-14348766 ] Apache Spark commented on SPARK-6182: - User 'srowen' has created a pull request for th

[jira] [Commented] (SPARK-6182) spark-parent pom needs to be published for both 2.10 and 2.11

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348752#comment-14348752 ] Sean Owen commented on SPARK-6182: -- I think there are two ways to resolve this: 1) Repla

[jira] [Updated] (SPARK-6177) LDA should check partitions size of the input

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6177: - Component/s: Examples Priority: Minor (was: Major) > LDA should check partitions size of the input

[jira] [Updated] (SPARK-6153) intellij import from maven cannot debug sparksqlclidriver

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6153: - Fix Version/s: (was: 1.3.0) 1.4.0 Fix version = 1.4.0 for now as it was reverted fo

[jira] [Updated] (SPARK-6068) KMeans Parallel test may fail

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6068: - Component/s: Tests Priority: Minor (was: Major) > KMeans Parallel test may fail >

[jira] [Updated] (SPARK-6147) Move JDBC data source integration tests to the Spark integration tests project

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6147: - Priority: Minor (was: Major) Target Version/s: (was: 1.3.0) Issue Type: Improvemen

[jira] [Resolved] (SPARK-4833) spark-shell.cmd not starting again after a while in windows 8.1

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4833. -- Resolution: Cannot Reproduce > spark-shell.cmd not starting again after a while in windows 8.1 > ---

[jira] [Updated] (SPARK-6133) SparkContext#stop is not idempotent

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6133: - Labels: backport-needed (was: ) > SparkContext#stop is not idempotent > -

[jira] [Commented] (SPARK-5183) Document data source API

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348667#comment-14348667 ] Sean Owen commented on SPARK-5183: -- This is still listed as blocking 1.3. Is it resolved?

[jira] [Commented] (SPARK-5310) Update SQL programming guide for 1.3

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348669#comment-14348669 ] Sean Owen commented on SPARK-5310: -- Same here, the two PRs are resolved and this is still

[jira] [Updated] (SPARK-6169) Shuffle based join

2015-03-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6169: - Component/s: Shuffle Description: Leverage improved spark shuffle to do the join more efficiently - th

[jira] [Commented] (SPARK-6115) Description for SparkSQL Jobs doesn't show up correctly until after the job finishes

2015-03-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14348636#comment-14348636 ] Apache Spark commented on SPARK-6115: - User 'Leolh' has created a pull request for thi

  1   2   >