[jira] [Commented] (SPARK-10437) Support aggregation expressions in Order By

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14730542#comment-14730542 ] Apache Spark commented on SPARK-10437: -- User 'viirya' has created a pull request for this issue:

[jira] [Updated] (SPARK-10445) Extend maven version range (enforcer)

2015-09-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10445: -- Priority: Minor (was: Major) > Extend maven version range (enforcer) >

[jira] [Assigned] (SPARK-10446) Support to specify join type when calling join with usingColumns

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10446: Assignee: Apache Spark > Support to specify join type when calling join with usingColumns

[jira] [Resolved] (SPARK-10445) Extend maven version range (enforcer)

2015-09-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10445. --- Resolution: Won't Fix See https://issues.apache.org/jira/browse/SPARK-9521 -- we need 3.3+ but the

[jira] [Created] (SPARK-10446) Support to specify join type when calling join with usingColumns

2015-09-04 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-10446: --- Summary: Support to specify join type when calling join with usingColumns Key: SPARK-10446 URL: https://issues.apache.org/jira/browse/SPARK-10446 Project:

[jira] [Commented] (SPARK-10442) select cast('false' as boolean) returns true

2015-09-04 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14730657#comment-14730657 ] Cheng Lian commented on SPARK-10442: The reason is that all non-empty strings are converted to

[jira] [Updated] (SPARK-10310) [Spark SQL] All result records will be popluated into ONE line during the script transform due to missing the correct line/filed delimeter

2015-09-04 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-10310: --- Description: There is real case using python stream script in Spark SQL query. We found that all

[jira] [Commented] (SPARK-10446) Support to specify join type when calling join with usingColumns

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14730600#comment-14730600 ] Apache Spark commented on SPARK-10446: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10446) Support to specify join type when calling join with usingColumns

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10446: Assignee: (was: Apache Spark) > Support to specify join type when calling join with

[jira] [Commented] (SPARK-9235) PYSPARK_DRIVER_PYTHON env variable is not set on the YARN Node manager acting as driver in yarn-cluster mode

2015-09-04 Thread Aaron Glahe (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14730680#comment-14730680 ] Aaron Glahe commented on SPARK-9235: You set it in the spark-env.sh, e.g, since we use condo as our

[jira] [Assigned] (SPARK-10437) Support aggregation expressions in Order By

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10437: Assignee: (was: Apache Spark) > Support aggregation expressions in Order By >

[jira] [Assigned] (SPARK-10437) Support aggregation expressions in Order By

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10437: Assignee: Apache Spark > Support aggregation expressions in Order By >

[jira] [Updated] (SPARK-10298) PySpark can't JSON serialize a DataFrame with DecimalType columns.

2015-09-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10298: -- Assignee: Michael Armbrust > PySpark can't JSON serialize a DataFrame with DecimalType columns. >

[jira] [Updated] (SPARK-10159) Hive 1.3.x GenericUDFDate NPE issue

2015-09-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10159: -- Assignee: Michael Armbrust > Hive 1.3.x GenericUDFDate NPE issue > ---

[jira] [Created] (SPARK-10447) Upgrade pyspark to use py4j 0.9

2015-09-04 Thread Justin Uang (JIRA)
Justin Uang created SPARK-10447: --- Summary: Upgrade pyspark to use py4j 0.9 Key: SPARK-10447 URL: https://issues.apache.org/jira/browse/SPARK-10447 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-4940) Support more evenly distributing cores for Mesos mode

2015-09-04 Thread Martin Tapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14730911#comment-14730911 ] Martin Tapp commented on SPARK-4940: My principal use case is to cram as much as possible on the same

[jira] [Commented] (SPARK-10447) Upgrade pyspark to use py4j 0.9

2015-09-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14730990#comment-14730990 ] Sean Owen commented on SPARK-10447: --- I bet there are some upsides to updating, but the question is: do

[jira] [Created] (SPARK-10448) Parquet schema merging should NOT merge UDT

2015-09-04 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-10448: -- Summary: Parquet schema merging should NOT merge UDT Key: SPARK-10448 URL: https://issues.apache.org/jira/browse/SPARK-10448 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-10450) Minor SQL style, format, typo, readability fixes

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10450: Assignee: Andrew Or (was: Apache Spark) > Minor SQL style, format, typo, readability

[jira] [Commented] (SPARK-10450) Minor SQL style, format, typo, readability fixes

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731211#comment-14731211 ] Apache Spark commented on SPARK-10450: -- User 'andrewor14' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10451) Prevent unnecessary serializations in InMemoryColumnarTableScan

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10451: Assignee: Apache Spark > Prevent unnecessary serializations in InMemoryColumnarTableScan

[jira] [Commented] (SPARK-10451) Prevent unnecessary serializations in InMemoryColumnarTableScan

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731283#comment-14731283 ] Apache Spark commented on SPARK-10451: -- User 'saucam' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10451) Prevent unnecessary serializations in InMemoryColumnarTableScan

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10451: Assignee: (was: Apache Spark) > Prevent unnecessary serializations in

[jira] [Commented] (SPARK-8951) support CJK characters in collect()

2015-09-04 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731151#comment-14731151 ] Shivaram Venkataraman commented on SPARK-8951: -- Ah I should have retested this before merging

[jira] [Created] (SPARK-10449) StructType.merge shouldn't merge DecimalTypes with different precisions and/or scales

2015-09-04 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-10449: -- Summary: StructType.merge shouldn't merge DecimalTypes with different precisions and/or scales Key: SPARK-10449 URL: https://issues.apache.org/jira/browse/SPARK-10449

[jira] [Commented] (SPARK-9666) ML 1.5 QA: model save/load audit

2015-09-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731299#comment-14731299 ] Joseph K. Bradley commented on SPARK-9666: -- Thanks for checking. Shall I mark this complete? >

[jira] [Commented] (SPARK-8951) support CJK characters in collect()

2015-09-04 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731155#comment-14731155 ] Shivaram Venkataraman commented on SPARK-8951: -- Sent

[jira] [Created] (SPARK-10450) Minor SQL style, format, typo, readability fixes

2015-09-04 Thread Andrew Or (JIRA)
Andrew Or created SPARK-10450: - Summary: Minor SQL style, format, typo, readability fixes Key: SPARK-10450 URL: https://issues.apache.org/jira/browse/SPARK-10450 Project: Spark Issue Type:

[jira] [Created] (SPARK-10451) Prevent unnecessary serializations in InMemoryColumnarTableScan

2015-09-04 Thread Yash Datta (JIRA)
Yash Datta created SPARK-10451: -- Summary: Prevent unnecessary serializations in InMemoryColumnarTableScan Key: SPARK-10451 URL: https://issues.apache.org/jira/browse/SPARK-10451 Project: Spark

[jira] [Commented] (SPARK-10447) Upgrade pyspark to use py4j 0.9

2015-09-04 Thread Justin Uang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731164#comment-14731164 ] Justin Uang commented on SPARK-10447: - Agreed, I'm pretty sure that this will break some APIs and

[jira] [Assigned] (SPARK-10450) Minor SQL style, format, typo, readability fixes

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10450: Assignee: Apache Spark (was: Andrew Or) > Minor SQL style, format, typo, readability

[jira] [Created] (SPARK-10452) Pyspark worker security issue

2015-09-04 Thread Michael Procopio (JIRA)
Michael Procopio created SPARK-10452: Summary: Pyspark worker security issue Key: SPARK-10452 URL: https://issues.apache.org/jira/browse/SPARK-10452 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-10453) There's now way to use spark.dynmicAllocation.enabled with pyspark

2015-09-04 Thread Michael Procopio (JIRA)
Michael Procopio created SPARK-10453: Summary: There's now way to use spark.dynmicAllocation.enabled with pyspark Key: SPARK-10453 URL: https://issues.apache.org/jira/browse/SPARK-10453 Project:

[jira] [Resolved] (SPARK-10453) There's now way to use spark.dynmicAllocation.enabled with pyspark

2015-09-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-10453. Resolution: Not A Problem >From http://spark.apache.org/docs/latest/running-on-yarn.html:

[jira] [Commented] (SPARK-10454) Flaky test: o.a.s.scheduler.DAGSchedulerSuite.late fetch failures don't cause multiple concurrent attempts for the same map stage

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731380#comment-14731380 ] Apache Spark commented on SPARK-10454: -- User 'robbinspg' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10454) Flaky test: o.a.s.scheduler.DAGSchedulerSuite.late fetch failures don't cause multiple concurrent attempts for the same map stage

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10454: Assignee: (was: Apache Spark) > Flaky test: o.a.s.scheduler.DAGSchedulerSuite.late

[jira] [Assigned] (SPARK-10454) Flaky test: o.a.s.scheduler.DAGSchedulerSuite.late fetch failures don't cause multiple concurrent attempts for the same map stage

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10454: Assignee: Apache Spark > Flaky test: o.a.s.scheduler.DAGSchedulerSuite.late fetch

[jira] [Updated] (SPARK-10456) upgrade java 7 on amplab jenkins workers

2015-09-04 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shane knapp updated SPARK-10456: Description: our java 7 installation is really old (from last september). update this to the

[jira] [Resolved] (SPARK-10452) Pyspark worker security issue

2015-09-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-10452. Resolution: Not A Problem If you need your workers to run as you user, you need to

[jira] [Commented] (SPARK-10454) Flaky test: o.a.s.scheduler.DAGSchedulerSuite.late fetch failures don't cause multiple concurrent attempts for the same map stage

2015-09-04 Thread Pete Robbins (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731377#comment-14731377 ] Pete Robbins commented on SPARK-10454: -- This is another case of not waiting for events to drain form

[jira] [Created] (SPARK-10455) install java 8 on amplab jenkins workers

2015-09-04 Thread shane knapp (JIRA)
shane knapp created SPARK-10455: --- Summary: install java 8 on amplab jenkins workers Key: SPARK-10455 URL: https://issues.apache.org/jira/browse/SPARK-10455 Project: Spark Issue Type: Task

[jira] [Commented] (SPARK-10456) upgrade java 7 on amplab jenkins workers

2015-09-04 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731436#comment-14731436 ] shane knapp commented on SPARK-10456: - looks like we'll be installing 7u79 (we're at 7u51 currently).

[jira] [Commented] (SPARK-9963) ML RandomForest cleanup: replace predictNodeIndex with predictImpl

2015-09-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731462#comment-14731462 ] Joseph K. Bradley commented on SPARK-9963: -- Sorry for the slow response! (I've been traveling.)

[jira] [Commented] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2015-09-04 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731415#comment-14731415 ] Imran Rashid commented on SPARK-4105: - [~mvherweg] Do you know if the error occurred after there was

[jira] [Created] (SPARK-10456) upgrade java 7 on amplab jenkins workers

2015-09-04 Thread shane knapp (JIRA)
shane knapp created SPARK-10456: --- Summary: upgrade java 7 on amplab jenkins workers Key: SPARK-10456 URL: https://issues.apache.org/jira/browse/SPARK-10456 Project: Spark Issue Type: Task

[jira] [Updated] (SPARK-10456) upgrade java 7 on amplab jenkins workers

2015-09-04 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-10456: --- Assignee: shane knapp > upgrade java 7 on amplab jenkins workers >

[jira] [Updated] (SPARK-10455) install java 8 on amplab jenkins workers

2015-09-04 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-10455: --- Assignee: shane knapp > install java 8 on amplab jenkins workers >

[jira] [Commented] (SPARK-10455) install java 8 on amplab jenkins workers

2015-09-04 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731440#comment-14731440 ] Josh Rosen commented on SPARK-10455: Yep, I think we want the 64-bit version. > install java 8 on

[jira] [Created] (SPARK-10454) Flaky test: o.a.s.scheduler.DAGSchedulerSuite.late fetch failures don't cause multiple concurrent attempts for the same map stage

2015-09-04 Thread Pete Robbins (JIRA)
Pete Robbins created SPARK-10454: Summary: Flaky test: o.a.s.scheduler.DAGSchedulerSuite.late fetch failures don't cause multiple concurrent attempts for the same map stage Key: SPARK-10454 URL:

[jira] [Commented] (SPARK-10439) Catalyst should check for overflow / underflow of date and timestamp values

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731392#comment-14731392 ] Apache Spark commented on SPARK-10439: -- User 'vanzin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10439) Catalyst should check for overflow / underflow of date and timestamp values

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10439: Assignee: (was: Apache Spark) > Catalyst should check for overflow / underflow of

[jira] [Assigned] (SPARK-10439) Catalyst should check for overflow / underflow of date and timestamp values

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10439: Assignee: Apache Spark > Catalyst should check for overflow / underflow of date and

[jira] [Commented] (SPARK-10433) Gradient boosted trees

2015-09-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731398#comment-14731398 ] Joseph K. Bradley commented on SPARK-10433: --- Has this been reported on 1.5? I've seen reports

[jira] [Commented] (SPARK-10455) install java 8 on amplab jenkins workers

2015-09-04 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731428#comment-14731428 ] shane knapp commented on SPARK-10455: - looks like i'll be installing java 8u60. > install java 8 on

[jira] [Commented] (SPARK-9963) ML RandomForest cleanup: replace predictNodeIndex with predictImpl

2015-09-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731468#comment-14731468 ] Joseph K. Bradley commented on SPARK-9963: -- Yep, that first case in the if-else is for the

[jira] [Commented] (SPARK-10414) DenseMatrix gives different hashcode even though equals returns true

2015-09-04 Thread Vinod KC (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731757#comment-14731757 ] Vinod KC commented on SPARK-10414: -- Thanks Got the JIRA id

[jira] [Commented] (SPARK-9961) ML prediction abstractions should have defaultEvaluator fields

2015-09-04 Thread George Dittmar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731810#comment-14731810 ] George Dittmar commented on SPARK-9961: --- Can you expand on what you mean by Evaluator? Just looking

[jira] [Comment Edited] (SPARK-9961) ML prediction abstractions should have defaultEvaluator fields

2015-09-04 Thread George Dittmar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731810#comment-14731810 ] George Dittmar edited comment on SPARK-9961 at 9/5/15 5:23 AM: --- Can you

[jira] [Created] (SPARK-10459) PythonUDF could process UnsafeRow

2015-09-04 Thread Davies Liu (JIRA)
Davies Liu created SPARK-10459: -- Summary: PythonUDF could process UnsafeRow Key: SPARK-10459 URL: https://issues.apache.org/jira/browse/SPARK-10459 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-8632) Poor Python UDF performance because of RDD caching

2015-09-04 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-8632: - Assignee: Davies Liu > Poor Python UDF performance because of RDD caching >

[jira] [Commented] (SPARK-8951) support CJK characters in collect()

2015-09-04 Thread Jihong MA (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731140#comment-14731140 ] Jihong MA commented on SPARK-8951: -- This commit cause R style check failure.

[jira] [Commented] (SPARK-9925) Set SQLConf.SHUFFLE_PARTITIONS.key correctly for tests

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731170#comment-14731170 ] Apache Spark commented on SPARK-9925: - User 'davies' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-10452) Pyspark worker security issue

2015-09-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731332#comment-14731332 ] Marcelo Vanzin edited comment on SPARK-10452 at 9/4/15 9:43 PM: If you

[jira] [Resolved] (SPARK-10455) install java 8 on amplab jenkins workers

2015-09-04 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shane knapp resolved SPARK-10455. - Resolution: Done > install java 8 on amplab jenkins workers >

[jira] [Closed] (SPARK-10455) install java 8 on amplab jenkins workers

2015-09-04 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shane knapp closed SPARK-10455. --- FIN! > install java 8 on amplab jenkins workers > > >

[jira] [Created] (SPARK-10457) Unable to connect to MySQL with the DataFrame API

2015-09-04 Thread Mariano Simone (JIRA)
Mariano Simone created SPARK-10457: -- Summary: Unable to connect to MySQL with the DataFrame API Key: SPARK-10457 URL: https://issues.apache.org/jira/browse/SPARK-10457 Project: Spark Issue

[jira] [Updated] (SPARK-10457) Unable to connect to MySQL with the DataFrame API

2015-09-04 Thread Mariano Simone (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mariano Simone updated SPARK-10457: --- Description: I'm getting this error everytime I try to create a dataframe using jdbc:

[jira] [Closed] (SPARK-10457) Unable to connect to MySQL with the DataFrame API

2015-09-04 Thread Mariano Simone (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mariano Simone closed SPARK-10457. -- Resolution: Fixed Found the solution. spark.executor.extraClassPath needed configuration. >

[jira] [Updated] (SPARK-10311) In cluster mode, AppId and AttemptId should be update when ApplicationMaster is new

2015-09-04 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-10311: -- Affects Version/s: 1.5.0 1.4.1 > In cluster mode, AppId and AttemptId

[jira] [Updated] (SPARK-10311) In cluster mode, AppId and AttemptId should be update when ApplicationMaster is new

2015-09-04 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-10311: -- Target Version/s: 1.6.0, 1.5.1 > In cluster mode, AppId and AttemptId should be update when

[jira] [Commented] (SPARK-10433) Gradient boosted trees

2015-09-04 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731533#comment-14731533 ] DB Tsai commented on SPARK-10433: - [~sowen] I can confirm that this should be fixed in 1.5 > Gradient

[jira] [Updated] (SPARK-10420) Implementing Reactive Streams based Spark Streaming Receiver

2015-09-04 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-10420: -- Target Version/s: 1.6.0 (was: ) > Implementing Reactive Streams based Spark Streaming

[jira] [Commented] (SPARK-10447) Upgrade pyspark to use py4j 0.9

2015-09-04 Thread Justin Uang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731592#comment-14731592 ] Justin Uang commented on SPARK-10447: - Sure, I wouldn't mind doing the code review. Can you add me?

[jira] [Commented] (SPARK-10447) Upgrade pyspark to use py4j 0.9

2015-09-04 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731591#comment-14731591 ] holdenk commented on SPARK-10447: - I can give this a shot if no one else is interested in doing this

[jira] [Commented] (SPARK-10447) Upgrade pyspark to use py4j 0.9

2015-09-04 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731597#comment-14731597 ] holdenk commented on SPARK-10447: - Sure, I'll ping you when I've got the PR ready (probably sometime this

[jira] [Commented] (SPARK-10447) Upgrade pyspark to use py4j 0.9

2015-09-04 Thread Justin Uang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731598#comment-14731598 ] Justin Uang commented on SPARK-10447: - Sound good > Upgrade pyspark to use py4j 0.9 >

[jira] [Commented] (SPARK-10397) Make Python's SparkContext self-descriptive on "print sc"

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731618#comment-14731618 ] Apache Spark commented on SPARK-10397: -- User 'alexrovner' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10397) Make Python's SparkContext self-descriptive on "print sc"

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10397: Assignee: (was: Apache Spark) > Make Python's SparkContext self-descriptive on "print

[jira] [Assigned] (SPARK-10397) Make Python's SparkContext self-descriptive on "print sc"

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10397: Assignee: Apache Spark > Make Python's SparkContext self-descriptive on "print sc" >

[jira] [Commented] (SPARK-10397) Make Python's SparkContext self-descriptive on "print sc"

2015-09-04 Thread Alex Rovner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731619#comment-14731619 ] Alex Rovner commented on SPARK-10397: - Pull: https://github.com/apache/spark/pull/8608 {noformat}

[jira] [Created] (SPARK-10458) Would like to know if a given Spark Context is stopped or currently stopping

2015-09-04 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-10458: -- Summary: Would like to know if a given Spark Context is stopped or currently stopping Key: SPARK-10458 URL: https://issues.apache.org/jira/browse/SPARK-10458 Project:

[jira] [Resolved] (SPARK-10402) Add scaladoc for default values of params in ML

2015-09-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-10402. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue

[jira] [Resolved] (SPARK-9925) Set SQLConf.SHUFFLE_PARTITIONS.key correctly for tests

2015-09-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-9925. -- Resolution: Fixed Fix Version/s: 1.6.0 > Set SQLConf.SHUFFLE_PARTITIONS.key correctly for tests

[jira] [Commented] (SPARK-10414) DenseMatrix gives different hashcode even though equals returns true

2015-09-04 Thread Vinod KC (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731745#comment-14731745 ] Vinod KC commented on SPARK-10414: -- [~josephkb] Could you please share me that existing JIRA id to

[jira] [Commented] (SPARK-7257) Find nearest neighbor satisfying predicate

2015-09-04 Thread Luvsandondov Lkhamsuren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731744#comment-14731744 ] Luvsandondov Lkhamsuren commented on SPARK-7257: This sounds very interesting! If I

[jira] [Commented] (SPARK-10199) Avoid using reflections for parquet model save

2015-09-04 Thread Vinod KC (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731753#comment-14731753 ] Vinod KC commented on SPARK-10199: -- [~mengxr] Thanks for the suggestion. Shall I close the PR? > Avoid

[jira] [Updated] (SPARK-10402) Add scaladoc for default values of params in ML

2015-09-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10402: -- Shepherd: Joseph K. Bradley Assignee: holdenk Target Version/s:

[jira] [Commented] (SPARK-10456) upgrade java 7 on amplab jenkins workers

2015-09-04 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731480#comment-14731480 ] shane knapp commented on SPARK-10456: - ok, 79 is installed but i will wait until downtime to switch

[jira] [Commented] (SPARK-10455) install java 8 on amplab jenkins workers

2015-09-04 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731478#comment-14731478 ] shane knapp commented on SPARK-10455: - it's installed in: /usr/java/jdk1.8.0_60 i'll email the dev@

[jira] [Resolved] (SPARK-10450) Minor SQL style, format, typo, readability fixes

2015-09-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-10450. --- Resolution: Fixed Fix Version/s: 1.6.0 > Minor SQL style, format, typo, readability fixes >

[jira] [Updated] (SPARK-10304) Partition discovery does not throw an exception if the dir structure is invalid

2015-09-04 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-10304: -- Target Version/s: 1.6.0, 1.5.1 > Partition discovery does not throw an exception if the dir

[jira] [Updated] (SPARK-10304) Partition discovery does not throw an exception if the dir structure is invalid

2015-09-04 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-10304: -- Target Version/s: 1.6.0, 1.5.1 (was: 1.5.1,1.6.0) > Partition discovery does not throw an

[jira] [Commented] (SPARK-10013) Remove Java assert from Java unit tests

2015-09-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731585#comment-14731585 ] Apache Spark commented on SPARK-10013: -- User 'holdenk' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-10456) upgrade java 7 on amplab jenkins workers

2015-09-04 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731436#comment-14731436 ] shane knapp edited comment on SPARK-10456 at 9/4/15 9:46 PM: - looks like

[jira] [Updated] (SPARK-10176) Show partially analyzed plan when checkAnswer df fails to resolve

2015-09-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10176: -- Target Version/s: 1.6.0 (was: 1.5.0) > Show partially analyzed plan when checkAnswer df fails to

[jira] [Updated] (SPARK-10176) Show partially analyzed plan when checkAnswer df fails to resolve

2015-09-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10176: -- Fix Version/s: (was: 1.5.0) 1.6.0 > Show partially analyzed plan when

[jira] [Resolved] (SPARK-10176) Show partially analyzed plan when checkAnswer df fails to resolve

2015-09-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-10176. --- Resolution: Fixed Fix Version/s: 1.5.0 > Show partially analyzed plan when checkAnswer df

[jira] [Commented] (SPARK-10199) Avoid using reflections for parquet model save

2015-09-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14731529#comment-14731529 ] Xiangrui Meng commented on SPARK-10199: --- The improvement numbers also depends on the model size. In

[jira] [Resolved] (SPARK-9669) Support PySpark with Mesos Cluster mode

2015-09-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-9669. -- Resolution: Fixed Fix Version/s: 1.6.0 > Support PySpark with Mesos Cluster mode >

[jira] [Resolved] (SPARK-10454) Flaky test: o.a.s.scheduler.DAGSchedulerSuite.late fetch failures don't cause multiple concurrent attempts for the same map stage

2015-09-04 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-10454. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Target Version/s:

  1   2   >