[jira] [Updated] (SPARK-18537) Add a REST api to spark streaming

2016-11-21 Thread Peter Chan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Chan updated SPARK-18537: --- Description: trying to monitoring our streaming application using Spark REST interface and found out

[jira] [Commented] (SPARK-18532) Code generation memory issue

2016-11-21 Thread Georg Heiler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685920#comment-15685920 ] Georg Heiler commented on SPARK-18532: -- Please find a minimal example here:

[jira] [Commented] (SPARK-18507) Major performance regression in SHOW PARTITIONS on partitioned Hive tables

2016-11-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685899#comment-15685899 ] Wenchen Fan commented on SPARK-18507: - Can you provide the table metadata? e.g. how many partition

[jira] [Comment Edited] (SPARK-18403) ObjectHashAggregateSuite is being flaky (occasional OOM errors)

2016-11-21 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684659#comment-15684659 ] Cheng Lian edited comment on SPARK-18403 at 11/22/16 6:54 AM: -- Here is a

[jira] [Commented] (SPARK-18403) ObjectHashAggregateSuite is being flaky (occasional OOM errors)

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685870#comment-15685870 ] Apache Spark commented on SPARK-18403: -- User 'liancheng' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18538) Concurrent Fetching DataFrameReader JDBC APIs Do Not Work

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18538: Assignee: Apache Spark (was: Xiao Li) > Concurrent Fetching DataFrameReader JDBC APIs Do

[jira] [Commented] (SPARK-18538) Concurrent Fetching DataFrameReader JDBC APIs Do Not Work

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685803#comment-15685803 ] Apache Spark commented on SPARK-18538: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18538) Concurrent Fetching DataFrameReader JDBC APIs Do Not Work

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18538: Assignee: Xiao Li (was: Apache Spark) > Concurrent Fetching DataFrameReader JDBC APIs Do

[jira] [Created] (SPARK-18538) Concurrent Fetching DataFrameReader JDBC APIs Do Not Work

2016-11-21 Thread Xiao Li (JIRA)
Xiao Li created SPARK-18538: --- Summary: Concurrent Fetching DataFrameReader JDBC APIs Do Not Work Key: SPARK-18538 URL: https://issues.apache.org/jira/browse/SPARK-18538 Project: Spark Issue Type:

[jira] [Commented] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-11-21 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685755#comment-15685755 ] Saleem Ansari commented on SPARK-18531: --- [~yuhaoyan] Thanks for your suggestion to increase Java

[jira] [Resolved] (SPARK-18425) Test `CompactibleFileStreamLog` directly

2016-11-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-18425. -- Resolution: Fixed Assignee: Liwei Lin Fix Version/s: 2.1.0 > Test

[jira] [Assigned] (SPARK-18537) Add a REST api to spark streaming

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18537: Assignee: Apache Spark > Add a REST api to spark streaming >

[jira] [Assigned] (SPARK-18537) Add a REST api to spark streaming

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18537: Assignee: (was: Apache Spark) > Add a REST api to spark streaming >

[jira] [Commented] (SPARK-18537) Add a REST api to spark streaming

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685707#comment-15685707 ] Apache Spark commented on SPARK-18537: -- User 'ChorPangChan' has created a pull request for this

[jira] [Created] (SPARK-18537) Add a REST api to spark streaming

2016-11-21 Thread Peter Chan (JIRA)
Peter Chan created SPARK-18537: -- Summary: Add a REST api to spark streaming Key: SPARK-18537 URL: https://issues.apache.org/jira/browse/SPARK-18537 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-18536) Failed to save to hive table when case class with empty field

2016-11-21 Thread pin_zhang (JIRA)
pin_zhang created SPARK-18536: - Summary: Failed to save to hive table when case class with empty field Key: SPARK-18536 URL: https://issues.apache.org/jira/browse/SPARK-18536 Project: Spark

[jira] [Closed] (SPARK-17398) Failed to query on external JSon Partitioned table

2016-11-21 Thread pin_zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pin_zhang closed SPARK-17398. - Resolution: Fixed Fix Version/s: 2.0.1 > Failed to query on external JSon Partitioned table >

[jira] [Commented] (SPARK-18181) Huge managed memory leak (2.7G) when running reduceByKey

2016-11-21 Thread DjvuLee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685568#comment-15685568 ] DjvuLee commented on SPARK-18181: - [~barrybecker4] can you reproduce this on the spark2.x version? >

[jira] [Commented] (SPARK-18528) limit + groupBy leads to java.lang.NullPointerException

2016-11-21 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685497#comment-15685497 ] Takeshi Yamamuro commented on SPARK-18528: -- I reproduced this in master; {code} scala> val df =

[jira] [Commented] (SPARK-18504) Scalar subquery with extra group by columns returning incorrect result

2016-11-21 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685461#comment-15685461 ] Nattavut Sutyanyong commented on SPARK-18504: - Revise the reproduction script to be more

[jira] [Comment Edited] (SPARK-18504) Scalar subquery with extra group by columns returning incorrect result

2016-11-21 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685461#comment-15685461 ] Nattavut Sutyanyong edited comment on SPARK-18504 at 11/22/16 2:32 AM:

[jira] [Commented] (SPARK-18528) limit + groupBy leads to java.lang.NullPointerException

2016-11-21 Thread DjvuLee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685451#comment-15685451 ] DjvuLee commented on SPARK-18528: - I just test your example, but it works. >>>

[jira] [Commented] (SPARK-18403) ObjectHashAggregateSuite is being flaky (occasional OOM errors)

2016-11-21 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685389#comment-15685389 ] Cheng Lian commented on SPARK-18403: Figured it out. It's caused by a false sharing issue inside

[jira] [Comment Edited] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-11-21 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685387#comment-15685387 ] yuhao yang edited comment on SPARK-18531 at 11/22/16 1:53 AM: -- I don't think

[jira] [Commented] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-11-21 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685387#comment-15685387 ] yuhao yang commented on SPARK-18531: I don't think it's infinite recursive invocation, but perhaps

[jira] [Commented] (SPARK-18319) ML, Graph 2.1 QA: API: Experimental, DeveloperApi, final, sealed audit

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685345#comment-15685345 ] Apache Spark commented on SPARK-18319: -- User 'hhbyyh' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18319) ML, Graph 2.1 QA: API: Experimental, DeveloperApi, final, sealed audit

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18319: Assignee: yuhao yang (was: Apache Spark) > ML, Graph 2.1 QA: API: Experimental,

[jira] [Assigned] (SPARK-18319) ML, Graph 2.1 QA: API: Experimental, DeveloperApi, final, sealed audit

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18319: Assignee: Apache Spark (was: yuhao yang) > ML, Graph 2.1 QA: API: Experimental,

[jira] [Resolved] (SPARK-18493) Add withWatermark and checkpoint to python dataframe

2016-11-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-18493. -- Resolution: Fixed Assignee: Burak Yavuz Fix Version/s: 2.1.0 > Add

[jira] [Comment Edited] (SPARK-18535) Redact sensitive information from Spark logs and UI

2016-11-21 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685225#comment-15685225 ] Mark Grover edited comment on SPARK-18535 at 11/22/16 12:36 AM: I just

[jira] [Resolved] (SPARK-18282) Add model summaries for Python GMM and BisectingKMeans

2016-11-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-18282. - Resolution: Fixed Fix Version/s: 2.1.0 > Add model summaries for Python GMM and

[jira] [Commented] (SPARK-18535) Redact sensitive information from Spark logs and UI

2016-11-21 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685225#comment-15685225 ] Mark Grover commented on SPARK-18535: - I just issued a PR for this, that adds a new customizable

[jira] [Updated] (SPARK-18535) Redact sensitive information from Spark logs and UI

2016-11-21 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Grover updated SPARK-18535: Attachment: redacted.png > Redact sensitive information from Spark logs and UI >

[jira] [Commented] (SPARK-18535) Redact sensitive information from Spark logs and UI

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685222#comment-15685222 ] Apache Spark commented on SPARK-18535: -- User 'markgrover' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18535) Redact sensitive information from Spark logs and UI

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18535: Assignee: Apache Spark > Redact sensitive information from Spark logs and UI >

[jira] [Assigned] (SPARK-18535) Redact sensitive information from Spark logs and UI

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18535: Assignee: (was: Apache Spark) > Redact sensitive information from Spark logs and UI >

[jira] [Commented] (SPARK-18506) kafka 0.10 with Spark 2.02 auto.offset.reset=earliest will only read from a single partition on a multi partition topic

2016-11-21 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685190#comment-15685190 ] Cody Koeninger commented on SPARK-18506: I'd try to isolate aws vs gce as a possible cause before

[jira] [Created] (SPARK-18535) Redact sensitive information from Spark logs and UI

2016-11-21 Thread Mark Grover (JIRA)
Mark Grover created SPARK-18535: --- Summary: Redact sensitive information from Spark logs and UI Key: SPARK-18535 URL: https://issues.apache.org/jira/browse/SPARK-18535 Project: Spark Issue

[jira] [Assigned] (SPARK-18134) SQL: MapType in Group BY and Joins not working

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18134: Assignee: Apache Spark > SQL: MapType in Group BY and Joins not working >

[jira] [Commented] (SPARK-18134) SQL: MapType in Group BY and Joins not working

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685164#comment-15685164 ] Apache Spark commented on SPARK-18134: -- User 'hvanhovell' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18134) SQL: MapType in Group BY and Joins not working

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18134: Assignee: (was: Apache Spark) > SQL: MapType in Group BY and Joins not working >

[jira] [Created] (SPARK-18534) Datasets Aggregation with Maps

2016-11-21 Thread Anton Okolnychyi (JIRA)
Anton Okolnychyi created SPARK-18534: Summary: Datasets Aggregation with Maps Key: SPARK-18534 URL: https://issues.apache.org/jira/browse/SPARK-18534 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-18530) Kafka timestamp should be TimestampType

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18530: Assignee: Shixiong Zhu (was: Apache Spark) > Kafka timestamp should be TimestampType >

[jira] [Commented] (SPARK-18530) Kafka timestamp should be TimestampType

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685040#comment-15685040 ] Apache Spark commented on SPARK-18530: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18530) Kafka timestamp should be TimestampType

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18530: Assignee: Apache Spark (was: Shixiong Zhu) > Kafka timestamp should be TimestampType >

[jira] [Assigned] (SPARK-18533) Raise correct error upon specification of schema for datasource tables created through CTAS

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18533: Assignee: Apache Spark > Raise correct error upon specification of schema for datasource

[jira] [Commented] (SPARK-18533) Raise correct error upon specification of schema for datasource tables created through CTAS

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15685018#comment-15685018 ] Apache Spark commented on SPARK-18533: -- User 'dilipbiswal' has created a pull request for this

[jira] [Assigned] (SPARK-18533) Raise correct error upon specification of schema for datasource tables created through CTAS

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18533: Assignee: (was: Apache Spark) > Raise correct error upon specification of schema for

[jira] [Created] (SPARK-18533) Raise correct error upon specification of schema for datasource tables created through CTAS

2016-11-21 Thread Dilip Biswal (JIRA)
Dilip Biswal created SPARK-18533: Summary: Raise correct error upon specification of schema for datasource tables created through CTAS Key: SPARK-18533 URL: https://issues.apache.org/jira/browse/SPARK-18533

[jira] [Commented] (SPARK-18506) kafka 0.10 with Spark 2.02 auto.offset.reset=earliest will only read from a single partition on a multi partition topic

2016-11-21 Thread Heji Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684995#comment-15684995 ] Heji Kim commented on SPARK-18506: -- Just confirming that when I use ConsumerStrategy.Assign with all

[jira] [Commented] (SPARK-18403) ObjectHashAggregateSuite is being flaky (occasional OOM errors)

2016-11-21 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684928#comment-15684928 ] Herman van Hovell commented on SPARK-18403: --- The 5a5a5a5a5a5a means that the page has been

[jira] [Commented] (SPARK-18532) Code generation memory issue

2016-11-21 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684908#comment-15684908 ] Herman van Hovell commented on SPARK-18532: --- The code generated by whole stage code generation

[jira] [Commented] (SPARK-18512) FileNotFoundException on _temporary directory with Spark Streaming 2.0.1 and S3A

2016-11-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684863#comment-15684863 ] Shixiong Zhu commented on SPARK-18512: -- Did you enable speculation? > FileNotFoundException on

[jira] [Commented] (SPARK-18506) kafka 0.10 with Spark 2.02 auto.offset.reset=earliest will only read from a single partition on a multi partition topic

2016-11-21 Thread Heji Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684844#comment-15684844 ] Heji Kim commented on SPARK-18506: -- Firstly thank you Cody for the quick response. Our intention was not

[jira] [Created] (SPARK-18532) Code generation memory issue

2016-11-21 Thread Georg Heiler (JIRA)
Georg Heiler created SPARK-18532: Summary: Code generation memory issue Key: SPARK-18532 URL: https://issues.apache.org/jira/browse/SPARK-18532 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-9487) Use the same num. worker threads in Scala/Python unit tests

2016-11-21 Thread Saikat Kanjilal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684758#comment-15684758 ] Saikat Kanjilal commented on SPARK-9487: [~srowen] following up, thoughts on how to proceed on

[jira] [Assigned] (SPARK-18073) Migrate wiki to spark.apache.org web site

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18073: Assignee: Sean Owen (was: Apache Spark) > Migrate wiki to spark.apache.org web site >

[jira] [Resolved] (SPARK-18517) DROP TABLE IF EXISTS should not warn for non-existing tables

2016-11-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-18517. --- Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.1.0 Target

[jira] [Commented] (SPARK-18073) Migrate wiki to spark.apache.org web site

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684748#comment-15684748 ] Apache Spark commented on SPARK-18073: -- User 'srowen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18073) Migrate wiki to spark.apache.org web site

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18073: Assignee: Apache Spark (was: Sean Owen) > Migrate wiki to spark.apache.org web site >

[jira] [Assigned] (SPARK-18530) Kafka timestamp should be TimestampType

2016-11-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu reassigned SPARK-18530: Assignee: Shixiong Zhu > Kafka timestamp should be TimestampType >

[jira] [Resolved] (SPARK-18361) Expose RDD localCheckpoint in PySpark

2016-11-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-18361. --- Resolution: Fixed Fix Version/s: 2.1.0 Target Version/s: 2.1.0 > Expose RDD

[jira] [Updated] (SPARK-18361) Expose RDD localCheckpoint in PySpark

2016-11-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-18361: -- Assignee: Gabriel Huang > Expose RDD localCheckpoint in PySpark >

[jira] [Commented] (SPARK-18529) Timeouts shouldn't be AssertionErrors

2016-11-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684703#comment-15684703 ] Shixiong Zhu commented on SPARK-18529: -- This will be fixed in

[jira] [Commented] (SPARK-17850) HadoopRDD should not swallow EOFException

2016-11-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684699#comment-15684699 ] Shixiong Zhu commented on SPARK-17850: -- [~mgrover] you're right. This is only in 2.1. Removed 2.0.2.

[jira] [Updated] (SPARK-17850) HadoopRDD should not swallow EOFException

2016-11-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-17850: - Fix Version/s: (was: 2.0.2) > HadoopRDD should not swallow EOFException >

[jira] [Updated] (SPARK-17850) HadoopRDD should not swallow EOFException

2016-11-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-17850: - Target Version/s: 2.1.0 (was: 2.0.2, 2.1.0) > HadoopRDD should not swallow EOFException >

[jira] [Commented] (SPARK-17850) HadoopRDD should not swallow EOFException

2016-11-21 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684685#comment-15684685 ] Mark Grover commented on SPARK-17850: - Hi [~zsxwing] and [~srowen], the JIRA fix version seems to

[jira] [Commented] (SPARK-18403) ObjectHashAggregateSuite is being flaky (occasional OOM errors)

2016-11-21 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684659#comment-15684659 ] Cheng Lian commented on SPARK-18403: Here is a minimal test case (add it to

[jira] [Commented] (SPARK-18413) Add a property to control the number of partitions when save a jdbc rdd

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684631#comment-15684631 ] Apache Spark commented on SPARK-18413: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Updated] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-11-21 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saleem Ansari updated SPARK-18531: -- Description: More details can be found here:

[jira] [Created] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-11-21 Thread Saleem Ansari (JIRA)
Saleem Ansari created SPARK-18531: - Summary: Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError Key: SPARK-18531 URL: https://issues.apache.org/jira/browse/SPARK-18531

[jira] [Updated] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-11-21 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saleem Ansari updated SPARK-18531: -- Description: More details can be found here:

[jira] [Updated] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-11-21 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saleem Ansari updated SPARK-18531: -- Description: More details can be found here:

[jira] [Updated] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-11-21 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saleem Ansari updated SPARK-18531: -- Description: More details can be found here:

[jira] [Comment Edited] (SPARK-18515) AlterTableDropPartitions fails for non-string columns

2016-11-21 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684512#comment-15684512 ] Dongjoon Hyun edited comment on SPARK-18515 at 11/21/16 7:38 PM: - This is

[jira] [Commented] (SPARK-18515) AlterTableDropPartitions fails for non-string columns

2016-11-21 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684512#comment-15684512 ] Dongjoon Hyun commented on SPARK-18515: --- This is tightly related with `AlterTableAddPartitions`. We

[jira] [Resolved] (SPARK-17765) org.apache.spark.mllib.linalg.VectorUDT cannot be cast to org.apache.spark.sql.types.StructType

2016-11-21 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-17765. - Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.1.0 >

[jira] [Resolved] (SPARK-15513) Bzip2Factory in Hadoop 2.7.1 is not thread safe

2016-11-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-15513. -- Resolution: Won't Fix > Bzip2Factory in Hadoop 2.7.1 is not thread safe >

[jira] [Commented] (SPARK-15513) Bzip2Factory in Hadoop 2.7.1 is not thread safe

2016-11-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684302#comment-15684302 ] Yin Huai commented on SPARK-15513: -- I am closing this jira since the fix has been released with 2.7.2.

[jira] [Comment Edited] (SPARK-15513) Bzip2Factory in Hadoop 2.7.1 is not thread safe

2016-11-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684302#comment-15684302 ] Yin Huai edited comment on SPARK-15513 at 11/21/16 6:17 PM: I am closing this

[jira] [Commented] (SPARK-16532) Provide a REST API for submitting and tracking status of jobs

2016-11-21 Thread Dan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15684232#comment-15684232 ] Dan commented on SPARK-16532: - Is there any update on this? Is the existing API supported and can be relied

[jira] [Created] (SPARK-18530) Kafka timestamp should be TimestampType

2016-11-21 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-18530: Summary: Kafka timestamp should be TimestampType Key: SPARK-18530 URL: https://issues.apache.org/jira/browse/SPARK-18530 Project: Spark Issue Type:

[jira] [Updated] (SPARK-18339) Don't push down current_timestamp for filters in StructuredStreaming

2016-11-21 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18339: - Priority: Critical (was: Major) > Don't push down current_timestamp for filters in

[jira] [Updated] (SPARK-18513) Record and recover watermark

2016-11-21 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18513: - Priority: Blocker (was: Major) > Record and recover watermark >

[jira] [Updated] (SPARK-18513) Record and recover watermark

2016-11-21 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18513: - Target Version/s: 2.1.0 > Record and recover watermark > >

[jira] [Created] (SPARK-18529) Timeouts shouldn't be AssertionErrors

2016-11-21 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-18529: Summary: Timeouts shouldn't be AssertionErrors Key: SPARK-18529 URL: https://issues.apache.org/jira/browse/SPARK-18529 Project: Spark Issue Type:

[jira] [Created] (SPARK-18528) limit + groupBy leads to java.lang.NullPointerException

2016-11-21 Thread Corey (JIRA)
Corey created SPARK-18528: - Summary: limit + groupBy leads to java.lang.NullPointerException Key: SPARK-18528 URL: https://issues.apache.org/jira/browse/SPARK-18528 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-12978) Skip unnecessary final group-by when input data already clustered with group-by keys

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12978: Assignee: Takeshi Yamamuro (was: Apache Spark) > Skip unnecessary final group-by when

[jira] [Assigned] (SPARK-12978) Skip unnecessary final group-by when input data already clustered with group-by keys

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12978: Assignee: Apache Spark (was: Takeshi Yamamuro) > Skip unnecessary final group-by when

[jira] [Commented] (SPARK-14222) Cross-publish jackson-module-scala for Scala 2.12

2016-11-21 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15683998#comment-15683998 ] Steve Loughran commented on SPARK-14222: Hadoop 2.9 just went to Java 2.7.8; latest update that

[jira] [Created] (SPARK-18527) UDAFPercentile (bigint, array) needs explicity cast to double

2016-11-21 Thread Fabian Boehnlein (JIRA)
Fabian Boehnlein created SPARK-18527: Summary: UDAFPercentile (bigint, array) needs explicity cast to double Key: SPARK-18527 URL: https://issues.apache.org/jira/browse/SPARK-18527 Project: Spark

[jira] [Comment Edited] (SPARK-18455) General support for subquery processing

2016-11-21 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15668577#comment-15668577 ] Nattavut Sutyanyong edited comment on SPARK-18455 at 11/21/16 3:02 PM:

[jira] [Commented] (SPARK-18455) General support for subquery processing

2016-11-21 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15683798#comment-15683798 ] Nattavut Sutyanyong commented on SPARK-18455: - Incorrect results problem > General support

[jira] [Issue Comment Deleted] (SPARK-18455) General support for subquery processing

2016-11-21 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nattavut Sutyanyong updated SPARK-18455: Comment: was deleted (was: Incorrect results problem) > General support for

[jira] [Commented] (SPARK-18356) Issue + Resolution: Kmeans Spark Performances (ML package)

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15683700#comment-15683700 ] Apache Spark commented on SPARK-18356: -- User 'ZakariaHili' has created a pull request for this

[jira] [Assigned] (SPARK-18356) Issue + Resolution: Kmeans Spark Performances (ML package)

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18356: Assignee: Apache Spark > Issue + Resolution: Kmeans Spark Performances (ML package) >

[jira] [Commented] (SPARK-18356) Issue + Resolution: Kmeans Spark Performances (ML package)

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15683669#comment-15683669 ] Apache Spark commented on SPARK-18356: -- User 'ZakariaHili' has created a pull request for this

[jira] [Assigned] (SPARK-18356) Issue + Resolution: Kmeans Spark Performances (ML package)

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18356: Assignee: (was: Apache Spark) > Issue + Resolution: Kmeans Spark Performances (ML

[jira] [Commented] (SPARK-18471) In treeAggregate, generate (big) zeros instead of sending them.

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15683636#comment-15683636 ] Apache Spark commented on SPARK-18471: -- User 'AnthonyTruchet' has created a pull request for this

  1   2   >