[jira] [Closed] (SPARK-9584) HiveHBaseTableInputFormat can'be cached

2015-08-21 Thread meiyoula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] meiyoula closed SPARK-9584. --- Resolution: Duplicate HiveHBaseTableInputFormat can'be cached ---

[jira] [Commented] (SPARK-4454) Race condition in DAGScheduler

2015-08-21 Thread Andy Sloane (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707798#comment-14707798 ] Andy Sloane commented on SPARK-4454: We have been manually applying this hotfix to

[jira] [Created] (SPARK-10169) Evaluating AggregateFunction1 (old code path) may return wrong answers when grouping expressions are used as arguments of aggregate functions

2015-08-21 Thread Yin Huai (JIRA)
Yin Huai created SPARK-10169: Summary: Evaluating AggregateFunction1 (old code path) may return wrong answers when grouping expressions are used as arguments of aggregate functions Key: SPARK-10169 URL:

[jira] [Commented] (SPARK-3655) Support sorting of values in addition to keys (i.e. secondary sort)

2015-08-21 Thread Nick Xie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14706524#comment-14706524 ] Nick Xie commented on SPARK-3655: - I need a sessionize example whereby all records are

[jira] [Updated] (SPARK-10155) Memory leak in SQL parsers

2015-08-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-10155: - Description: I saw a lot of `ThreadLocal` objects in the following app: {code} import

[jira] [Updated] (SPARK-10088) Support stored as avro HiveQL construct

2015-08-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10088: -- Assignee: Marcelo Vanzin Support stored as avro HiveQL construct

[jira] [Updated] (SPARK-9400) Implement code generation for StringLocate

2015-08-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9400: - Assignee: Davies Liu Implement code generation for StringLocate

[jira] [Resolved] (SPARK-9439) ExternalShuffleService should be robust to NodeManager restarts in yarn

2015-08-21 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-9439. -- Resolution: Fixed Assignee: Imran Rashid Fix Version/s: 1.6.0

[jira] [Assigned] (SPARK-9708) Spark should create local temporary directories in Mesos sandbox when launched with Mesos

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9708: --- Assignee: Apache Spark Spark should create local temporary directories in Mesos sandbox

[jira] [Commented] (SPARK-9708) Spark should create local temporary directories in Mesos sandbox when launched with Mesos

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14706799#comment-14706799 ] Apache Spark commented on SPARK-9708: - User 'Zariel' has created a pull request for

[jira] [Assigned] (SPARK-9708) Spark should create local temporary directories in Mesos sandbox when launched with Mesos

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9708: --- Assignee: (was: Apache Spark) Spark should create local temporary directories in Mesos

[jira] [Commented] (SPARK-6192) Enhance MLlib's Python API (GSoC 2015)

2015-08-21 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14706722#comment-14706722 ] Manoj Kumar commented on SPARK-6192: [~rxin] It gets over in a few hours from now. I

[jira] [Assigned] (SPARK-10155) Memory leak in SQL parsers

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10155: Assignee: (was: Apache Spark) Memory leak in SQL parsers --

[jira] [Assigned] (SPARK-10155) Memory leak in SQL parsers

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10155: Assignee: Apache Spark Memory leak in SQL parsers --

[jira] [Commented] (SPARK-3655) Support sorting of values in addition to keys (i.e. secondary sort)

2015-08-21 Thread Koert Kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14706817#comment-14706817 ] Koert Kuipers commented on SPARK-3655: -- hey nick, i believe your problem sounds like

[jira] [Commented] (SPARK-10155) Memory leak in SQL parsers

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14706790#comment-14706790 ] Apache Spark commented on SPARK-10155: -- User 'zsxwing' has created a pull request

[jira] [Updated] (SPARK-9864) Replace `@since` JavaDoc tag by `@Since` annotation in MLlib

2015-08-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9864: - Shepherd: Xiangrui Meng Replace `@since` JavaDoc tag by `@Since` annotation in MLlib

[jira] [Resolved] (SPARK-10112) ValueError: Can only zip with RDD which has the same number of partitions on one machine but not on another

2015-08-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10112. --- Resolution: Cannot Reproduce I can't reproduce this on Ubuntu or OS X, with the latest Spark master.

[jira] [Commented] (SPARK-10143) Parquet changed the behavior of calculating splits

2015-08-21 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14706875#comment-14706875 ] Ryan Blue commented on SPARK-10143: --- [~yhuai], you're right that the input format now

[jira] [Commented] (SPARK-10157) Add ability to specify s3 bootstrap script to spark-ec2

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14706957#comment-14706957 ] Apache Spark commented on SPARK-10157: -- User 'mdagost' has created a pull request

[jira] [Assigned] (SPARK-10157) Add ability to specify s3 bootstrap script to spark-ec2

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10157: Assignee: (was: Apache Spark) Add ability to specify s3 bootstrap script to

[jira] [Updated] (SPARK-9864) Replace `@since` JavaDoc tag by `@Since` annotation in MLlib

2015-08-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9864: - Summary: Replace `@since` JavaDoc tag by `@Since` annotation in MLlib (was: Replace `@since`

[jira] [Created] (SPARK-10157) Add ability to specify s3 bootstrap script to spark-ec2

2015-08-21 Thread Michelangelo D'Agostino (JIRA)
Michelangelo D'Agostino created SPARK-10157: --- Summary: Add ability to specify s3 bootstrap script to spark-ec2 Key: SPARK-10157 URL: https://issues.apache.org/jira/browse/SPARK-10157

[jira] [Commented] (SPARK-1153) Generalize VertexId in GraphX so that UUIDs can be used as vertex IDs.

2015-08-21 Thread JJ Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14706877#comment-14706877 ] JJ Zhang commented on SPARK-1153: - We would also really like a general customized ID

[jira] [Assigned] (SPARK-10157) Add ability to specify s3 bootstrap script to spark-ec2

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10157: Assignee: Apache Spark Add ability to specify s3 bootstrap script to spark-ec2

[jira] [Updated] (SPARK-10136) Parquet support fail to decode Avro/Thrift arrays of primitive array (e.g. arrayarrayint)

2015-08-21 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-10136: --- Summary: Parquet support fail to decode Avro/Thrift arrays of primitive array (e.g. arrayarrayint)

[jira] [Created] (SPARK-10160) Support Spark shell over Mesos Cluster Mode

2015-08-21 Thread Timothy Chen (JIRA)
Timothy Chen created SPARK-10160: Summary: Support Spark shell over Mesos Cluster Mode Key: SPARK-10160 URL: https://issues.apache.org/jira/browse/SPARK-10160 Project: Spark Issue Type:

[jira] [Updated] (SPARK-4366) Aggregation Improvement

2015-08-21 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-4366: --- Target Version/s: 1.6.0 Aggregation Improvement --- Key:

[jira] [Commented] (SPARK-10145) Executor exit without useful messages when spark runs in spark-streaming

2015-08-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707406#comment-14707406 ] Tathagata Das commented on SPARK-10145: --- [~hshreedharan] Could you take a look at

[jira] [Assigned] (SPARK-10121) Custom Class added through Spark SQL's add jar command may not be in the class loader used by metadataHive

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10121: Assignee: (was: Apache Spark) Custom Class added through Spark SQL's add jar command

[jira] [Commented] (SPARK-10121) Custom Class added through Spark SQL's add jar command may not be in the class loader used by metadataHive

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707452#comment-14707452 ] Apache Spark commented on SPARK-10121: -- User 'yhuai' has created a pull request for

[jira] [Resolved] (SPARK-10130) type coercion for IF should have children resolved first

2015-08-21 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-10130. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 8331

[jira] [Updated] (SPARK-9933) Test the new receiver scheduling

2015-08-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-9933: - Assignee: Shixiong Zhu Test the new receiver scheduling

[jira] [Resolved] (SPARK-10122) AttributeError: 'RDD' object has no attribute 'offsetRanges'

2015-08-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-10122. --- Resolution: Fixed Assignee: Saisai Shao Fix Version/s: 1.5.1

[jira] [Commented] (SPARK-10143) Parquet changed the behavior of calculating splits

2015-08-21 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707414#comment-14707414 ] Ryan Blue commented on SPARK-10143: --- [~yhuai], yes, you'd want to determine the number

[jira] [Commented] (SPARK-8400) ml.ALS doesn't handle -1 block size

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707421#comment-14707421 ] Apache Spark commented on SPARK-8400: - User 'BryanCutler' has created a pull request

[jira] [Reopened] (SPARK-5836) Highlight in Spark documentation that by default Spark does not delete its temporary files

2015-08-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das reopened SPARK-5836: -- Highlight in Spark documentation that by default Spark does not delete its temporary files

[jira] [Commented] (SPARK-10143) Parquet changed the behavior of calculating splits

2015-08-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707363#comment-14707363 ] Yin Huai commented on SPARK-10143: -- Yeah, the setting is not the real row group size and

[jira] [Resolved] (SPARK-5836) Highlight in Spark documentation that by default Spark does not delete its temporary files

2015-08-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-5836. -- Resolution: Not A Problem Highlight in Spark documentation that by default Spark does not

[jira] [Assigned] (SPARK-10121) Custom Class added through Spark SQL's add jar command may not be in the class loader used by metadataHive

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10121: Assignee: Apache Spark Custom Class added through Spark SQL's add jar command may not be

[jira] [Commented] (SPARK-8400) ml.ALS doesn't handle -1 block size

2015-08-21 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707409#comment-14707409 ] Bryan Cutler commented on SPARK-8400: - No problem! It does LocalIndexEncoder once

[jira] [Commented] (SPARK-10143) Parquet changed the behavior of calculating splits

2015-08-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707112#comment-14707112 ] Yin Huai commented on SPARK-10143: -- I did a test yesterday that scans a table with 1824

[jira] [Commented] (SPARK-9662) ML 1.5 QA: API: Python API coverage

2015-08-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707186#comment-14707186 ] Joseph K. Bradley commented on SPARK-9662: -- OK thanks! I'll mark this as

[jira] [Resolved] (SPARK-9662) ML 1.5 QA: API: Python API coverage

2015-08-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-9662. -- Resolution: Done Fix Version/s: 1.5.0 ML 1.5 QA: API: Python API coverage

[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

2015-08-21 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707120#comment-14707120 ] Shivaram Venkataraman commented on SPARK-9325: -- Yeah that sounds good

[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

2015-08-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707136#comment-14707136 ] Felix Cheung commented on SPARK-9325: - I'll take a shot Support `collect` on

[jira] [Commented] (SPARK-10143) Parquet changed the behavior of calculating splits

2015-08-21 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707148#comment-14707148 ] Ryan Blue commented on SPARK-10143: --- [~yhuai] if you do that, you will get the current

[jira] [Commented] (SPARK-10143) Parquet changed the behavior of calculating splits

2015-08-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707194#comment-14707194 ] Yin Huai commented on SPARK-10143: -- oh, I meant the current value for the configuration

[jira] [Resolved] (SPARK-6192) Enhance MLlib's Python API (GSoC 2015)

2015-08-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-6192. -- Resolution: Done Fix Version/s: 1.5.0 Enhance MLlib's Python API (GSoC 2015)

[jira] [Assigned] (SPARK-9317) Change `show` to print DataFrame entries

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9317: --- Assignee: (was: Apache Spark) Change `show` to print DataFrame entries

[jira] [Assigned] (SPARK-9317) Change `show` to print DataFrame entries

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9317: --- Assignee: Apache Spark Change `show` to print DataFrame entries

[jira] [Commented] (SPARK-9317) Change `show` to print DataFrame entries

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707062#comment-14707062 ] Apache Spark commented on SPARK-9317: - User 'felixcheung' has created a pull request

[jira] [Commented] (SPARK-10136) Parquet support fail to decode Avro/Thrift arrays of primitive array (e.g. arrayarrayint)

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707137#comment-14707137 ] Apache Spark commented on SPARK-10136: -- User 'liancheng' has created a pull request

[jira] [Commented] (SPARK-9741) approx count distinct function

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707168#comment-14707168 ] Apache Spark commented on SPARK-9741: - User 'hvanhovell' has created a pull request

[jira] [Assigned] (SPARK-9741) approx count distinct function

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9741: --- Assignee: (was: Apache Spark) approx count distinct function

[jira] [Assigned] (SPARK-9741) approx count distinct function

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9741: --- Assignee: Apache Spark approx count distinct function --

[jira] [Commented] (SPARK-10143) Parquet changed the behavior of calculating splits

2015-08-21 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707222#comment-14707222 ] Ryan Blue commented on SPARK-10143: --- I think you're going to end up assuming every row

[jira] [Commented] (SPARK-6192) Enhance MLlib's Python API (GSoC 2015)

2015-08-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707233#comment-14707233 ] Joseph K. Bradley commented on SPARK-6192: -- I'll mark it resolved. Thanks again

[jira] [Created] (SPARK-10159) Hive 1.3.x GenericUDFDate NPE issue

2015-08-21 Thread Alex Liu (JIRA)
Alex Liu created SPARK-10159: Summary: Hive 1.3.x GenericUDFDate NPE issue Key: SPARK-10159 URL: https://issues.apache.org/jira/browse/SPARK-10159 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-9319) Add support for setting column names, types

2015-08-21 Thread Hossein Falaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707263#comment-14707263 ] Hossein Falaki commented on SPARK-9319: --- I have a PR half ready. But I got

[jira] [Updated] (SPARK-10159) Hive 1.3.x GenericUDFDate NPE issue

2015-08-21 Thread Alex Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Liu updated SPARK-10159: - Description: When run sql query with HiveContext, Hive 1.3.x GenericUDFDate NPE issue. The following is

[jira] [Commented] (SPARK-10143) Parquet changed the behavior of calculating splits

2015-08-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707067#comment-14707067 ] Yin Huai commented on SPARK-10143: -- [~rdblue] Thank you for the detailed info! One thing

[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

2015-08-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707075#comment-14707075 ] Felix Cheung commented on SPARK-9325: - So we should add collect and head to Column?

[jira] [Commented] (SPARK-9319) Add support for setting column names, types

2015-08-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707077#comment-14707077 ] Felix Cheung commented on SPARK-9319: - [~falaki]Would you be working on this? Add

[jira] [Created] (SPARK-10158) ALS should print better errors when given Long IDs

2015-08-21 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10158: - Summary: ALS should print better errors when given Long IDs Key: SPARK-10158 URL: https://issues.apache.org/jira/browse/SPARK-10158 Project: Spark

[jira] [Updated] (SPARK-8012) ArrayIndexOutOfBoundsException in SerializationDebugger

2015-08-21 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8012: --- Fix Version/s: 1.4.1 1.5.0 ArrayIndexOutOfBoundsException in

[jira] [Commented] (SPARK-5836) Highlight in Spark documentation that by default Spark does not delete its temporary files

2015-08-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707482#comment-14707482 ] Tathagata Das commented on SPARK-5836: -- For everyone who come across this JIRA, the

[jira] [Assigned] (SPARK-10163) Allow single-category features for GBT models

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10163: Assignee: Joseph K. Bradley (was: Apache Spark) Allow single-category features for GBT

[jira] [Commented] (SPARK-10163) Allow single-category features for GBT models

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707574#comment-14707574 ] Apache Spark commented on SPARK-10163: -- User 'jkbradley' has created a pull request

[jira] [Assigned] (SPARK-10163) Allow single-category features for GBT models

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10163: Assignee: Apache Spark (was: Joseph K. Bradley) Allow single-category features for GBT

[jira] [Commented] (SPARK-10121) Custom Class added through Spark SQL's add jar command may not be in the class loader used by metadataHive

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707573#comment-14707573 ] Apache Spark commented on SPARK-10121: -- User 'yhuai' has created a pull request for

[jira] [Commented] (SPARK-10143) Parquet changed the behavior of calculating splits

2015-08-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707500#comment-14707500 ] Yin Huai commented on SPARK-10143: -- Just a note about this change. If the parallelism is

[jira] [Comment Edited] (SPARK-10162) PySpark filters with datetimes mess up when datetimes have timezones.

2015-08-21 Thread Kevin Cox (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707526#comment-14707526 ] Kevin Cox edited comment on SPARK-10162 at 8/21/15 10:05 PM: -

[jira] [Comment Edited] (SPARK-10162) PySpark filters with datetimes mess up when datetimes have timezones.

2015-08-21 Thread Kevin Cox (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707526#comment-14707526 ] Kevin Cox edited comment on SPARK-10162 at 8/21/15 10:05 PM: -

[jira] [Commented] (SPARK-10162) PySpark filters with datetimes mess up when datetimes have timezones.

2015-08-21 Thread Kevin Cox (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707526#comment-14707526 ] Kevin Cox commented on SPARK-10162: --- This is probably because the filter argument is

[jira] [Assigned] (SPARK-10142) Python checkpoint recovery does not work with non-local file path

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10142: Assignee: Apache Spark (was: Tathagata Das) Python checkpoint recovery does not work

[jira] [Created] (SPARK-10162) PySpark filters with datetimes mess up when datetimes have timezones.

2015-08-21 Thread Kevin Cox (JIRA)
Kevin Cox created SPARK-10162: - Summary: PySpark filters with datetimes mess up when datetimes have timezones. Key: SPARK-10162 URL: https://issues.apache.org/jira/browse/SPARK-10162 Project: Spark

[jira] [Commented] (SPARK-10061) User guide for ml ensembles (random forest and GBT)

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707578#comment-14707578 ] Apache Spark commented on SPARK-10061: -- User 'jkbradley' has created a pull request

[jira] [Resolved] (SPARK-9864) Replace `@since` JavaDoc tag by `@Since` annotation in MLlib

2015-08-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-9864. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 8352

[jira] [Assigned] (SPARK-10143) Parquet changed the behavior of calculating splits

2015-08-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai reassigned SPARK-10143: Assignee: Yin Huai Parquet changed the behavior of calculating splits

[jira] [Resolved] (SPARK-10143) Parquet changed the behavior of calculating splits

2015-08-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-10143. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 8346

[jira] [Updated] (SPARK-10161) Support Pyspark shell over Mesos Cluster Mode

2015-08-21 Thread Timothy Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Chen updated SPARK-10161: - Component/s: (was: Spark Shell) PySpark Support Pyspark shell over Mesos

[jira] [Created] (SPARK-10161) Support Pyspark shell over Mesos Cluster Mode

2015-08-21 Thread Timothy Chen (JIRA)
Timothy Chen created SPARK-10161: Summary: Support Pyspark shell over Mesos Cluster Mode Key: SPARK-10161 URL: https://issues.apache.org/jira/browse/SPARK-10161 Project: Spark Issue Type:

[jira] [Updated] (SPARK-10161) Support Pyspark shell over Mesos Cluster Mode

2015-08-21 Thread Timothy Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Chen updated SPARK-10161: - Description: It's not possible to run Pyspark shell with cluster mode since the shell that is

[jira] [Commented] (SPARK-10142) Python checkpoint recovery does not work with non-local file path

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707553#comment-14707553 ] Apache Spark commented on SPARK-10142: -- User 'tdas' has created a pull request for

[jira] [Assigned] (SPARK-10142) Python checkpoint recovery does not work with non-local file path

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10142: Assignee: Tathagata Das (was: Apache Spark) Python checkpoint recovery does not work

[jira] [Created] (SPARK-10163) Allow single-category features for GBT models

2015-08-21 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10163: - Summary: Allow single-category features for GBT models Key: SPARK-10163 URL: https://issues.apache.org/jira/browse/SPARK-10163 Project: Spark

[jira] [Comment Edited] (SPARK-10118) Improve SparkR API docs for 1.5 release

2015-08-21 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707631#comment-14707631 ] Yu Ishikawa edited comment on SPARK-10118 at 8/21/15 11:22 PM:

[jira] [Resolved] (SPARK-9893) User guide for VectorSlicer

2015-08-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-9893. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 8267

[jira] [Assigned] (SPARK-10165) Nested Hive UDF resolution fails in Analyzer

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10165: Assignee: Apache Spark (was: Michael Armbrust) Nested Hive UDF resolution fails in

[jira] [Created] (SPARK-10164) GMM bug: match error

2015-08-21 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10164: - Summary: GMM bug: match error Key: SPARK-10164 URL: https://issues.apache.org/jira/browse/SPARK-10164 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-10164) GMM bug: match error

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707641#comment-14707641 ] Apache Spark commented on SPARK-10164: -- User 'jkbradley' has created a pull request

[jira] [Assigned] (SPARK-10164) GMM bug: match error

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10164: Assignee: Joseph K. Bradley (was: Apache Spark) GMM bug: match error

[jira] [Assigned] (SPARK-10164) GMM bug: match error

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10164: Assignee: Apache Spark (was: Joseph K. Bradley) GMM bug: match error

[jira] [Assigned] (SPARK-10165) Nested Hive UDF resolution fails in Analyzer

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10165: Assignee: Michael Armbrust (was: Apache Spark) Nested Hive UDF resolution fails in

[jira] [Commented] (SPARK-10165) Nested Hive UDF resolution fails in Analyzer

2015-08-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14707645#comment-14707645 ] Apache Spark commented on SPARK-10165: -- User 'marmbrus' has created a pull request

[jira] [Created] (SPARK-10168) Streaming assembly jars doesn't publish correctly

2015-08-21 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-10168: Summary: Streaming assembly jars doesn't publish correctly Key: SPARK-10168 URL: https://issues.apache.org/jira/browse/SPARK-10168 Project: Spark Issue

[jira] [Updated] (SPARK-10168) Streaming assembly jars doesn't publish correctly

2015-08-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-10168: - Target Version/s: 1.5.0 Fix Version/s: (was: 1.5.0) Streaming assembly jars doesn't

[jira] [Updated] (SPARK-10168) Streaming assembly jars doesn't publish correctly

2015-08-21 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-10168: -- Assignee: Shixiong Zhu Streaming assembly jars doesn't publish correctly

[jira] [Created] (SPARK-10165) Nested Hive UDF resolution fails in Analyzer

2015-08-21 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-10165: Summary: Nested Hive UDF resolution fails in Analyzer Key: SPARK-10165 URL: https://issues.apache.org/jira/browse/SPARK-10165 Project: Spark Issue

  1   2   >