[jira] [Commented] (SPARK-4497) HiveThriftServer2 does not exit properly on failure

2015-12-15 Thread Yana Kadiyska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058324#comment-15058324 ] Yana Kadiyska commented on SPARK-4497: -- [~jeffzhang] I have moved on to 1.2 so I cannot comment on

[jira] [Commented] (SPARK-11255) R Test build should run on R 3.1.1

2015-12-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058459#comment-15058459 ] Josh Rosen commented on SPARK-11255: Just to be clear, my proposal was to bump from 3.1.1 to 3.1.2.

[jira] [Commented] (SPARK-12010) Spark JDBC requires support for column-name-free INSERT syntax

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058409#comment-15058409 ] Apache Spark commented on SPARK-12010: -- User 'CK50' has created a pull request for this issue:

[jira] [Resolved] (SPARK-4497) HiveThriftServer2 does not exit properly on failure

2015-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4497. -- Resolution: Cannot Reproduce > HiveThriftServer2 does not exit properly on failure >

[jira] [Commented] (SPARK-12263) IllegalStateException: Memory can't be 0 for SPARK_WORKER_MEMORY without unit

2015-12-15 Thread Neelesh Srinivas Salian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058429#comment-15058429 ] Neelesh Srinivas Salian commented on SPARK-12263: - I would like to work on this. Please

[jira] [Commented] (SPARK-12061) Persist for Map/filter with Lambda Functions don't always read from Cache

2015-12-15 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058419#comment-15058419 ] Xiao Li commented on SPARK-12061: - Start working on it. Thanks! > Persist for Map/filter with Lambda

[jira] [Updated] (SPARK-12336) Outer join using multiple columns results in wrong nullability

2015-12-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12336: Assignee: Cheng Lian (was: Apache Spark) > Outer join using multiple columns results in wrong

[jira] [Assigned] (SPARK-12317) Support configurate value with unit(e.g. kb/mb/gb) in SQL

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12317: Assignee: Apache Spark > Support configurate value with unit(e.g. kb/mb/gb) in SQL >

[jira] [Assigned] (SPARK-12317) Support configurate value with unit(e.g. kb/mb/gb) in SQL

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12317: Assignee: (was: Apache Spark) > Support configurate value with unit(e.g. kb/mb/gb) in

[jira] [Updated] (SPARK-11255) R Test build should run on R 3.1.2

2015-12-15 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shane knapp updated SPARK-11255: Summary: R Test build should run on R 3.1.2 (was: R Test build should run on R 3.1.1) > R Test

[jira] [Assigned] (SPARK-9886) Validate usages of Runtime.getRuntime.addShutdownHook

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9886: --- Assignee: Apache Spark > Validate usages of Runtime.getRuntime.addShutdownHook >

[jira] [Assigned] (SPARK-12317) Support configurate value with unit(e.g. kb/mb/gb) in SQL

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12317: Assignee: (was: Apache Spark) > Support configurate value with unit(e.g. kb/mb/gb) in

[jira] [Commented] (SPARK-12325) Inappropriate error messages in DataFrame StatFunctions

2015-12-15 Thread Narine Kokhlikyan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058522#comment-15058522 ] Narine Kokhlikyan commented on SPARK-12325: --- Thank you for your generous kindness, [~srowen]. I

[jira] [Created] (SPARK-12344) Remove env-based configurations

2015-12-15 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-12344: -- Summary: Remove env-based configurations Key: SPARK-12344 URL: https://issues.apache.org/jira/browse/SPARK-12344 Project: Spark Issue Type: Sub-task

[jira] [Reopened] (SPARK-11255) R Test build should run on R 3.1.1

2015-12-15 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shane knapp reopened SPARK-11255: - testing 3.1.2 on our staging instance right now. > R Test build should run on R 3.1.1 >

[jira] [Commented] (SPARK-12331) R^2 for regression through the origin

2015-12-15 Thread Imran Younus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058439#comment-15058439 ] Imran Younus commented on SPARK-12331: -- Sure, I can take care of this. > R^2 for regression through

[jira] [Commented] (SPARK-12327) lint-r checks fail with commented code

2015-12-15 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058464#comment-15058464 ] Shivaram Venkataraman commented on SPARK-12327: --- Do you have a link to the PR ? These are

[jira] [Created] (SPARK-12345) Mesos cluster mode is broken

2015-12-15 Thread Andrew Or (JIRA)
Andrew Or created SPARK-12345: - Summary: Mesos cluster mode is broken Key: SPARK-12345 URL: https://issues.apache.org/jira/browse/SPARK-12345 Project: Spark Issue Type: Bug Components:

[jira] [Resolved] (SPARK-11808) Remove Bagel

2015-12-15 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-11808. Resolution: Duplicate > Remove Bagel > > > Key: SPARK-11808 >

[jira] [Commented] (SPARK-11255) R Test build should run on R 3.1.2

2015-12-15 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058613#comment-15058613 ] shane knapp commented on SPARK-11255: - on staging, i ran the following shell commands: {quote} for x

[jira] [Commented] (SPARK-12317) Support configurate value with unit(e.g. kb/mb/gb) in SQL

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058629#comment-15058629 ] Apache Spark commented on SPARK-12317: -- User 'kevinyu98' has created a pull request for this issue:

[jira] [Updated] (SPARK-10250) Scala PairRDDFunctions.groupByKey() should be fault-tolerant of single large groups

2015-12-15 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10250: -- Summary: Scala PairRDDFunctions.groupByKey() should be fault-tolerant of single large groups (was:

[jira] [Created] (SPARK-12343) Remove YARN Client / ClientArguments

2015-12-15 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-12343: -- Summary: Remove YARN Client / ClientArguments Key: SPARK-12343 URL: https://issues.apache.org/jira/browse/SPARK-12343 Project: Spark Issue Type:

[jira] [Commented] (SPARK-11885) UDAF may nondeterministically generate wrong results

2015-12-15 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058514#comment-15058514 ] Yin Huai commented on SPARK-11885: -- Thank you for the update! > UDAF may nondeterministically generate

[jira] [Commented] (SPARK-10347) Investigate the usage of normalizePath()

2015-12-15 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058520#comment-15058520 ] Shivaram Venkataraman commented on SPARK-10347: --- I think its a good idea to fix and also a

[jira] [Commented] (SPARK-11255) R Test build should run on R 3.1.1

2015-12-15 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058456#comment-15058456 ] Shivaram Venkataraman commented on SPARK-11255: --- Yeah we could bump up the R version as a

[jira] [Commented] (SPARK-10312) Enhance SerDe to handle atomic vector

2015-12-15 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058478#comment-15058478 ] Shivaram Venkataraman commented on SPARK-10312: --- SInce the SerDe is an internal API I think

[jira] [Assigned] (SPARK-12317) Support configurate value with unit(e.g. kb/mb/gb) in SQL

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12317: Assignee: Apache Spark > Support configurate value with unit(e.g. kb/mb/gb) in SQL >

[jira] [Created] (SPARK-12334) Support read from multiple input paths for orc file in DataFrameReader.orc

2015-12-15 Thread Jeff Zhang (JIRA)
Jeff Zhang created SPARK-12334: -- Summary: Support read from multiple input paths for orc file in DataFrameReader.orc Key: SPARK-12334 URL: https://issues.apache.org/jira/browse/SPARK-12334 Project:

[jira] [Commented] (SPARK-11255) R Test build should run on R 3.1.1

2015-12-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15057580#comment-15057580 ] Josh Rosen commented on SPARK-11255: Hey, just curious: is 3.1.1 a hard requirement or would 3.1.2

[jira] [Assigned] (SPARK-8519) Blockify distance computation in k-means

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8519: --- Assignee: Apache Spark > Blockify distance computation in k-means >

[jira] [Assigned] (SPARK-8519) Blockify distance computation in k-means

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8519: --- Assignee: (was: Apache Spark) > Blockify distance computation in k-means >

[jira] [Assigned] (SPARK-12270) JDBC Where clause comparison doesn't work for DB2 char(n)

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12270: Assignee: Apache Spark > JDBC Where clause comparison doesn't work for DB2 char(n) >

[jira] [Updated] (SPARK-9042) Spark SQL incompatibility if security is enforced on the Hive warehouse

2015-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9042: - Summary: Spark SQL incompatibility if security is enforced on the Hive warehouse (was: Spark SQL

[jira] [Commented] (SPARK-8519) Blockify distance computation in k-means

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15057720#comment-15057720 ] Apache Spark commented on SPARK-8519: - User 'yanboliang' has created a pull request for this issue:

[jira] [Commented] (SPARK-11255) R Test build should run on R 3.1.1

2015-12-15 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15057603#comment-15057603 ] Sun Rui commented on SPARK-11255: - in http://spark.apache.org/docs/latest/, it is claimed that Spark runs

[jira] [Commented] (SPARK-11885) UDAF may nondeterministically generate wrong results

2015-12-15 Thread Milad Bourhani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15057641#comment-15057641 ] Milad Bourhani commented on SPARK-11885: Hi, I can no longer replicate the bug on branch 1.5 :)

[jira] [Assigned] (SPARK-12334) Support read from multiple input paths for orc file in DataFrameReader.orc

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12334: Assignee: Apache Spark > Support read from multiple input paths for orc file in

[jira] [Assigned] (SPARK-12334) Support read from multiple input paths for orc file in DataFrameReader.orc

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12334: Assignee: (was: Apache Spark) > Support read from multiple input paths for orc file

[jira] [Comment Edited] (SPARK-6521) Bypass network shuffle read if both endpoints are local

2015-12-15 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15033016#comment-15033016 ] Takeshi Yamamuro edited comment on SPARK-6521 at 12/15/15 9:23 AM: ---

[jira] [Updated] (SPARK-12334) Support read from multiple input paths for orc file in DataFrameReader.orc

2015-12-15 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated SPARK-12334: --- Component/s: PySpark > Support read from multiple input paths for orc file in DataFrameReader.orc >

[jira] [Resolved] (SPARK-12332) Typo in ResetSystemProperties.scala's comments

2015-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12332. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10303

[jira] [Updated] (SPARK-12332) Typo in ResetSystemProperties.scala's comments

2015-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12332: -- Assignee: holdenk Issue Type: Improvement (was: Bug) > Typo in ResetSystemProperties.scala's

[jira] [Created] (SPARK-12339) NullPointerException on stage kill from web UI

2015-12-15 Thread Jacek Laskowski (JIRA)
Jacek Laskowski created SPARK-12339: --- Summary: NullPointerException on stage kill from web UI Key: SPARK-12339 URL: https://issues.apache.org/jira/browse/SPARK-12339 Project: Spark Issue

[jira] [Assigned] (SPARK-12336) Outer join using multiple columns results in wrong nullability

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12336: Assignee: Apache Spark (was: Cheng Lian) > Outer join using multiple columns results in

[jira] [Resolved] (SPARK-12219) Spark 1.5.2 code does not build on Scala 2.11.7 with SBT assembly

2015-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12219. --- Resolution: Not A Problem master builds fine with SBT / Scala 2.11. Since building vs 2.11 is the

[jira] [Commented] (SPARK-12331) R^2 for regression through the origin

2015-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15057863#comment-15057863 ] Sean Owen commented on SPARK-12331: --- I'd support this change, if you're willing to do the legwork and

[jira] [Created] (SPARK-12336) Outer join using multiple columns results in wrong nullability

2015-12-15 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-12336: -- Summary: Outer join using multiple columns results in wrong nullability Key: SPARK-12336 URL: https://issues.apache.org/jira/browse/SPARK-12336 Project: Spark

[jira] [Assigned] (SPARK-12337) Implement dropDuplicates() method of DataFrame in SparkR

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12337: Assignee: (was: Apache Spark) > Implement dropDuplicates() method of DataFrame in

[jira] [Assigned] (SPARK-12337) Implement dropDuplicates() method of DataFrame in SparkR

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12337: Assignee: Apache Spark > Implement dropDuplicates() method of DataFrame in SparkR >

[jira] [Updated] (SPARK-12263) IllegalStateException: Memory can't be 0 for SPARK_WORKER_MEMORY without unit

2015-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12263: -- Labels: starter (was: ) > IllegalStateException: Memory can't be 0 for SPARK_WORKER_MEMORY without

[jira] [Created] (SPARK-12335) CentralMomentAgg should be nullable

2015-12-15 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-12335: -- Summary: CentralMomentAgg should be nullable Key: SPARK-12335 URL: https://issues.apache.org/jira/browse/SPARK-12335 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-8519) Blockify distance computation in k-means

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8519: --- Assignee: Apache Spark > Blockify distance computation in k-means >

[jira] [Assigned] (SPARK-12335) CentralMomentAgg should be nullable

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12335: Assignee: Apache Spark (was: Cheng Lian) > CentralMomentAgg should be nullable >

[jira] [Resolved] (SPARK-10157) Add ability to specify s3 bootstrap script to spark-ec2

2015-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10157. --- Resolution: Won't Fix > Add ability to specify s3 bootstrap script to spark-ec2 >

[jira] [Assigned] (SPARK-12336) Outer join using multiple columns results in wrong nullability

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12336: Assignee: Apache Spark (was: Cheng Lian) > Outer join using multiple columns results in

[jira] [Assigned] (SPARK-12336) Outer join using multiple columns results in wrong nullability

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12336: Assignee: Cheng Lian (was: Apache Spark) > Outer join using multiple columns results in

[jira] [Created] (SPARK-12338) Support dropping duplicated rows on selected columns in DataFrame in R style

2015-12-15 Thread Sun Rui (JIRA)
Sun Rui created SPARK-12338: --- Summary: Support dropping duplicated rows on selected columns in DataFrame in R style Key: SPARK-12338 URL: https://issues.apache.org/jira/browse/SPARK-12338 Project: Spark

[jira] [Resolved] (SPARK-12325) Inappropriate error messages in DataFrame StatFunctions

2015-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12325. --- Resolution: Invalid [~Narine] I'm going to push back on this, since it's inappropriate to open a

[jira] [Commented] (SPARK-12332) Typo in ResetSystemProperties.scala's comments

2015-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15057826#comment-15057826 ] Sean Owen commented on SPARK-12332: --- (I don't think this is worth your time for a JIRA, just a PR) >

[jira] [Commented] (SPARK-12332) Typo in ResetSystemProperties.scala's comments

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15057835#comment-15057835 ] Apache Spark commented on SPARK-12332: -- User 'holdenk' has created a pull request for this issue:

[jira] [Updated] (SPARK-12334) Support read from multiple input paths for orc file in DataFrameReader.orc

2015-12-15 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated SPARK-12334: --- Affects Version/s: 1.6.0 Target Version/s: 1.6.1 > Support read from multiple input paths for

[jira] [Created] (SPARK-12337) Implement dropDuplicates() method of DataFrame in SparkR

2015-12-15 Thread Sun Rui (JIRA)
Sun Rui created SPARK-12337: --- Summary: Implement dropDuplicates() method of DataFrame in SparkR Key: SPARK-12337 URL: https://issues.apache.org/jira/browse/SPARK-12337 Project: Spark Issue Type:

[jira] [Updated] (SPARK-12335) CentralMomentAgg should be nullable

2015-12-15 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-12335: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-12323 > CentralMomentAgg should be nullable >

[jira] [Updated] (SPARK-12336) Outer join using multiple columns results in wrong nullability

2015-12-15 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-12336: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-12323 > Outer join using multiple columns

[jira] [Resolved] (SPARK-11293) Spillable collections leak shuffle memory

2015-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-11293. --- Resolution: Fixed > Spillable collections leak shuffle memory >

[jira] [Commented] (SPARK-12330) Mesos coarse executor does not cleanup blockmgr properly on termination if data is stored on disk

2015-12-15 Thread Charles Allen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058819#comment-15058819 ] Charles Allen commented on SPARK-12330: --- Looks like the mesos coarse scheduler underwent a lot of

[jira] [Assigned] (SPARK-12345) Mesos cluster mode is broken

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12345: Assignee: Apache Spark (was: Iulian Dragos) > Mesos cluster mode is broken >

[jira] [Assigned] (SPARK-8745) Remove GenerateProjection

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8745: --- Assignee: Apache Spark (was: Davies Liu) > Remove GenerateProjection >

[jira] [Commented] (SPARK-12330) Mesos coarse executor does not cleanup blockmgr properly on termination if data is stored on disk

2015-12-15 Thread Charles Allen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058729#comment-15058729 ] Charles Allen commented on SPARK-12330: --- This is because the CoarseMesosSchedulerBackend does not

[jira] [Updated] (SPARK-11293) Spillable collections leak shuffle memory

2015-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11293: -- Target Version/s: 1.6.0 (was: 1.5.3, 1.6.0) Since the back-port for 1.5 was cancelled, I think this

[jira] [Commented] (SPARK-10915) Add support for UDAFs in Python

2015-12-15 Thread Tristan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058987#comment-15058987 ] Tristan commented on SPARK-10915: - Would the analogy to UDAF support in Python be lambdas, as mentioned

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2015-12-15 Thread Dean Wampler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058856#comment-15058856 ] Dean Wampler commented on SPARK-12177: -- Since the new Kafka 0.9 consumer API supports SSL, would

[jira] [Commented] (SPARK-12345) Mesos cluster mode is broken

2015-12-15 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15058873#comment-15058873 ] Iulian Dragos commented on SPARK-12345: --- [~skonto] pointed out this commit:

[jira] [Assigned] (SPARK-8745) Remove GenerateProjection

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8745: --- Assignee: Davies Liu (was: Apache Spark) > Remove GenerateProjection >

[jira] [Updated] (SPARK-12345) Mesos cluster mode is broken

2015-12-15 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Iulian Dragos updated SPARK-12345: -- Description: The same setup worked in 1.5.2 but is now failing for 1.6.0-RC2. The driver is

[jira] [Created] (SPARK-12346) GLM summary crashes with NoSuchElementException if attributes are missing names

2015-12-15 Thread Eric Liang (JIRA)
Eric Liang created SPARK-12346: -- Summary: GLM summary crashes with NoSuchElementException if attributes are missing names Key: SPARK-12346 URL: https://issues.apache.org/jira/browse/SPARK-12346 Project:

[jira] [Assigned] (SPARK-8745) Remove GenerateProjection

2015-12-15 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-8745: - Assignee: Davies Liu > Remove GenerateProjection > - > >

[jira] [Updated] (SPARK-9690) Add random seed Param to PySpark CrossValidator

2015-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9690: - Target Version/s: 2.0.0 (was: ) > Add random seed Param to PySpark CrossValidator >

[jira] [Comment Edited] (SPARK-10915) Add support for UDAFs in Python

2015-12-15 Thread Justin Uang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15059055#comment-15059055 ] Justin Uang edited comment on SPARK-10915 at 12/15/15 11:07 PM: An

[jira] [Updated] (SPARK-12309) Use sqlContext from MLlibTestSparkContext for spark.ml test suites

2015-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12309: -- Shepherd: Joseph K. Bradley Assignee: Yanbo Liang Target

[jira] [Updated] (SPARK-12324) The documentation sidebar does not collapse properly

2015-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12324: -- Assignee: Timothy Hunter (was: Apache Spark) > The documentation sidebar does not

[jira] [Issue Comment Deleted] (SPARK-8471) Implement Discrete Cosine Transform feature transformer

2015-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8471: - Comment: was deleted (was: User 'feynmanliang' has created a pull request for this issue:

[jira] [Updated] (SPARK-12348) PySpark _inferSchema crashes with incorrect exception on an empty RDD

2015-12-15 Thread Hurshal Patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hurshal Patel updated SPARK-12348: -- Description: {code} >>> rdd = sc.emptyRDD() >>> df = sqlContext.createDataFrame(rdd) Traceback

[jira] [Commented] (SPARK-12304) Make Spark Streaming web UI display more friendly Receiver graphs

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15059136#comment-15059136 ] Apache Spark commented on SPARK-12304: -- User 'proflin' has created a pull request for this issue:

[jira] [Resolved] (SPARK-12271) Improve error message for Dataset.as[] when the schema is incompatible.

2015-12-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-12271. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10260

[jira] [Created] (SPARK-12347) Write script to run all MLlib examples for testing

2015-12-15 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-12347: - Summary: Write script to run all MLlib examples for testing Key: SPARK-12347 URL: https://issues.apache.org/jira/browse/SPARK-12347 Project: Spark

[jira] [Updated] (SPARK-12348) PySpark _inferSchema crashes with incorrect exception on an empty RDD

2015-12-15 Thread Hurshal Patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hurshal Patel updated SPARK-12348: -- Description: {code} >>> rdd = sc.emptyRDD() >>> df = sqlContext.createDataFrame(rdd) Traceback

[jira] [Assigned] (SPARK-12330) Mesos coarse executor does not cleanup blockmgr properly on termination if data is stored on disk

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12330: Assignee: Apache Spark > Mesos coarse executor does not cleanup blockmgr properly on

[jira] [Updated] (SPARK-12281) Fixed potential exceptions when exiting a local cluster.

2015-12-15 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-12281: - Fix Version/s: (was: 1.6.1) (was: 2.0.0) 1.6.0 >

[jira] [Updated] (SPARK-12267) Standalone master keeps references to disassociated workers until they sent no heartbeats

2015-12-15 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-12267: - Fix Version/s: (was: 1.6.1) (was: 2.0.0) 1.6.0 >

[jira] [Commented] (SPARK-12330) Mesos coarse executor does not cleanup blockmgr properly on termination if data is stored on disk

2015-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15059169#comment-15059169 ] Apache Spark commented on SPARK-12330: -- User 'drcrallen' has created a pull request for this issue:

[jira] [Commented] (SPARK-10915) Add support for UDAFs in Python

2015-12-15 Thread Justin Uang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15059055#comment-15059055 ] Justin Uang commented on SPARK-10915: - An abstract base class would be fine, or something like

[jira] [Created] (SPARK-12348) PySpark _inferSchema crashes with incorrect exception on an empty RDD

2015-12-15 Thread Hurshal Patel (JIRA)
Hurshal Patel created SPARK-12348: - Summary: PySpark _inferSchema crashes with incorrect exception on an empty RDD Key: SPARK-12348 URL: https://issues.apache.org/jira/browse/SPARK-12348 Project:

[jira] [Updated] (SPARK-12345) Mesos cluster mode is broken

2015-12-15 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-12345: -- Target Version/s: 1.6.1 (was: 1.6.0) > Mesos cluster mode is broken > >

[jira] [Commented] (SPARK-12348) PySpark _inferSchema crashes with incorrect exception on an empty RDD

2015-12-15 Thread Hurshal Patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15059119#comment-15059119 ] Hurshal Patel commented on SPARK-12348: --- whoops, i think this was intentional but there is still

[jira] [Updated] (SPARK-12349) Make spark.ml PCAModel load backwards compatible

2015-12-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12349: -- Priority: Major (was: Critical) > Make spark.ml PCAModel load backwards compatible >

[jira] [Created] (SPARK-12349) Make spark.ml PCAModel load backwards compatible

2015-12-15 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-12349: - Summary: Make spark.ml PCAModel load backwards compatible Key: SPARK-12349 URL: https://issues.apache.org/jira/browse/SPARK-12349 Project: Spark

[jira] [Created] (SPARK-12350) VectorAssembler#transform() initially throws an exception

2015-12-15 Thread Jakob Odersky (JIRA)
Jakob Odersky created SPARK-12350: - Summary: VectorAssembler#transform() initially throws an exception Key: SPARK-12350 URL: https://issues.apache.org/jira/browse/SPARK-12350 Project: Spark

[jira] [Resolved] (SPARK-12236) JDBC filter tests all pass if filters are not really pushed down

2015-12-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-12236. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10221

  1   2   >