[jira] [Resolved] (SPARK-13125) makes the ratio of KafkaRDD partition to kafka topic partition configurable.

2016-02-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-13125. --- Resolution: Not A Problem [~zhengcanbin] don't reopen an issue unless the discussion has

[jira] [Issue Comment Deleted] (SPARK-13125) makes the ratio of KafkaRDD partition to kafka topic partition configurable.

2016-02-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-13125: -- Comment: was deleted (was: Shuffle will increase net burden, and number of partitions is limited by

[jira] [Commented] (SPARK-13157) ADD JAR command cannot handle path with @ character

2016-02-03 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15129994#comment-15129994 ] Davies Liu commented on SPARK-13157: THis is introduced by

[jira] [Commented] (SPARK-13157) ADD JAR command cannot handle path with @ character

2016-02-03 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15129995#comment-15129995 ] Davies Liu commented on SPARK-13157: Could be reproduce by {code} test("path with @") { val

[jira] [Issue Comment Deleted] (SPARK-13125) makes the ratio of KafkaRDD partition to kafka topic partition configurable.

2016-02-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-13125: -- Comment: was deleted (was: Shuffle will increase net burden, and number of partitions is limited by

[jira] [Closed] (SPARK-13125) makes the ratio of KafkaRDD partition to kafka topic partition configurable.

2016-02-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen closed SPARK-13125. - > makes the ratio of KafkaRDD partition to kafka topic partition configurable. >

[jira] [Resolved] (SPARK-13009) spark-streaming-twitter_2.10 does not make it possible to access the raw twitter json

2016-02-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-13009. --- Resolution: Not A Problem > spark-streaming-twitter_2.10 does not make it possible to access the raw

[jira] [Commented] (SPARK-13158) Show the information of broadcast blocks in WebUI

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130071#comment-15130071 ] Apache Spark commented on SPARK-13158: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13158) Show the information of broadcast blocks in WebUI

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13158: Assignee: (was: Apache Spark) > Show the information of broadcast blocks in WebUI >

[jira] [Assigned] (SPARK-13158) Show the information of broadcast blocks in WebUI

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13158: Assignee: Apache Spark > Show the information of broadcast blocks in WebUI >

[jira] [Assigned] (SPARK-13139) Create native DDL commands

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13139: Assignee: Apache Spark > Create native DDL commands > -- > >

[jira] [Commented] (SPARK-13139) Create native DDL commands

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130174#comment-15130174 ] Apache Spark commented on SPARK-13139: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13139) Create native DDL commands

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13139: Assignee: (was: Apache Spark) > Create native DDL commands >

[jira] [Assigned] (SPARK-8321) Authorization Support(on all operations not only DDL) in Spark Sql

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8321: --- Assignee: (was: Apache Spark) > Authorization Support(on all operations not only DDL) in

[jira] [Assigned] (SPARK-8321) Authorization Support(on all operations not only DDL) in Spark Sql

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8321: --- Assignee: Apache Spark > Authorization Support(on all operations not only DDL) in Spark Sql

[jira] [Commented] (SPARK-8321) Authorization Support(on all operations not only DDL) in Spark Sql

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15129990#comment-15129990 ] Apache Spark commented on SPARK-8321: - User 'winningsix' has created a pull request for this issue:

[jira] [Commented] (SPARK-12985) Spark Hive thrift server big decimal data issue

2016-02-03 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130013#comment-15130013 ] Adrian Wang commented on SPARK-12985: - I think this is a problem of Simba. JDBC never require a

[jira] [Created] (SPARK-13158) Show the information of broadcast blocks in WebUI

2016-02-03 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-13158: Summary: Show the information of broadcast blocks in WebUI Key: SPARK-13158 URL: https://issues.apache.org/jira/browse/SPARK-13158 Project: Spark

[jira] [Assigned] (SPARK-13002) Mesos scheduler backend does not follow the property spark.dynamicAllocation.initialExecutors

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13002: Assignee: Apache Spark > Mesos scheduler backend does not follow the property >

[jira] [Commented] (SPARK-13002) Mesos scheduler backend does not follow the property spark.dynamicAllocation.initialExecutors

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130160#comment-15130160 ] Apache Spark commented on SPARK-13002: -- User 'skyluc' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13002) Mesos scheduler backend does not follow the property spark.dynamicAllocation.initialExecutors

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13002: Assignee: (was: Apache Spark) > Mesos scheduler backend does not follow the property

[jira] [Commented] (SPARK-13103) HashTF dosn't count TF correctly

2016-02-03 Thread Louis Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130187#comment-15130187 ] Louis Liu commented on SPARK-13103: --- I'm sorry, you are right. The negative numbers doesn't matter.

[jira] [Updated] (SPARK-12807) Spark External Shuffle not working in Hadoop clusters with Jackson 2.2.3

2016-02-03 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-12807: --- Target Version/s: 1.6.1 > Spark External Shuffle not working in Hadoop clusters with Jackson

[jira] [Commented] (SPARK-13157) ADD JAR command cannot handle path with @ character

2016-02-03 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130045#comment-15130045 ] Herman van Hovell commented on SPARK-13157: --- Hmmm... The lexer is swallowing @'s. The easiest

[jira] [Updated] (SPARK-13156) JDBC using multiple partitions creates additional tasks but only executes on one

2016-02-03 Thread Charles Drotar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Charles Drotar updated SPARK-13156: --- Description: I can successfully kick off a query through JDBC to Teradata, and when it runs

[jira] [Updated] (SPARK-13156) JDBC using multiple partitions creates additional tasks but only executes on one

2016-02-03 Thread Charles Drotar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Charles Drotar updated SPARK-13156: --- Description: I can successfully kick off a query through JDBC to Teradata, and when it runs

[jira] [Commented] (SPARK-1239) Don't fetch all map output statuses at each reducer during shuffles

2016-02-03 Thread Daniel Darabos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130351#comment-15130351 ] Daniel Darabos commented on SPARK-1239: --- I've read an interesting article about the "Kylix"

[jira] [Commented] (SPARK-13156) JDBC using multiple partitions creates additional tasks but only executes on one

2016-02-03 Thread Charles Drotar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130411#comment-15130411 ] Charles Drotar commented on SPARK-13156: Thanks Sean for the quick response! That was exactly

[jira] [Commented] (SPARK-13125) makes the ratio of KafkaRDD partition to kafka topic partition configurable.

2016-02-03 Thread zhengcanbin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130088#comment-15130088 ] zhengcanbin commented on SPARK-13125: - Sorry, I got it, it's my first time to create a jira, I don't

[jira] [Commented] (SPARK-13163) Column width on new History Server DataTables not getting set correctly

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131092#comment-15131092 ] Apache Spark commented on SPARK-13163: -- User 'ajbozarth' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13163) Column width on new History Server DataTables not getting set correctly

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13163: Assignee: (was: Apache Spark) > Column width on new History Server DataTables not

[jira] [Created] (SPARK-13164) Replace deprecated synchronizedBuffer in core

2016-02-03 Thread holdenk (JIRA)
holdenk created SPARK-13164: --- Summary: Replace deprecated synchronizedBuffer in core Key: SPARK-13164 URL: https://issues.apache.org/jira/browse/SPARK-13164 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-13116) TungstenAggregate though it is supposedly capable of all processing unsafe & safe rows, fails if the input is safe rows

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13116: Assignee: Apache Spark > TungstenAggregate though it is supposedly capable of all

[jira] [Commented] (SPARK-13116) TungstenAggregate though it is supposedly capable of all processing unsafe & safe rows, fails if the input is safe rows

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131102#comment-15131102 ] Apache Spark commented on SPARK-13116: -- User 'ahshahid' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13116) TungstenAggregate though it is supposedly capable of all processing unsafe & safe rows, fails if the input is safe rows

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13116: Assignee: (was: Apache Spark) > TungstenAggregate though it is supposedly capable of

[jira] [Commented] (SPARK-13046) Partitioning looks broken in 1.6

2016-02-03 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131170#comment-15131170 ] Davies Liu commented on SPARK-13046: I tried Spark 1.6 and master with a directory like this {code}

[jira] [Updated] (SPARK-13131) Use best time and average time in micro benchmark

2016-02-03 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-13131: --- Summary: Use best time and average time in micro benchmark (was: Use median time in benchmark) >

[jira] [Created] (SPARK-13166) Remove DataStreamReader/Writer

2016-02-03 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-13166: --- Summary: Remove DataStreamReader/Writer Key: SPARK-13166 URL: https://issues.apache.org/jira/browse/SPARK-13166 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-13163) Column width on new History Server DataTables not getting set correctly

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13163: Assignee: Apache Spark > Column width on new History Server DataTables not getting set

[jira] [Assigned] (SPARK-13164) Replace deprecated synchronizedBuffer in core

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13164: Assignee: (was: Apache Spark) > Replace deprecated synchronizedBuffer in core >

[jira] [Commented] (SPARK-13164) Replace deprecated synchronizedBuffer in core

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131126#comment-15131126 ] Apache Spark commented on SPARK-13164: -- User 'holdenk' has created a pull request for this issue:

[jira] [Commented] (SPARK-11316) isEmpty before coalesce seems to cause huge performance issue in setupGroups

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131196#comment-15131196 ] Apache Spark commented on SPARK-11316: -- User 'zhuoliu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11316) isEmpty before coalesce seems to cause huge performance issue in setupGroups

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11316: Assignee: Apache Spark > isEmpty before coalesce seems to cause huge performance issue in

[jira] [Updated] (SPARK-13131) Use best time and average time in micro benchmark

2016-02-03 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-13131: --- Description: Best time should be more stable than average time in benchmark, together with average

[jira] [Commented] (SPARK-13166) Remove DataStreamReader/Writer

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131252#comment-15131252 ] Apache Spark commented on SPARK-13166: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13166) Remove DataStreamReader/Writer

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13166: Assignee: Apache Spark (was: Reynold Xin) > Remove DataStreamReader/Writer >

[jira] [Assigned] (SPARK-13166) Remove DataStreamReader/Writer

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13166: Assignee: Reynold Xin (was: Apache Spark) > Remove DataStreamReader/Writer >

[jira] [Commented] (SPARK-9414) HiveContext:saveAsTable creates wrong partition for existing hive table(append mode)

2016-02-03 Thread Xiu (Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131054#comment-15131054 ] Xiu (Joe) Guo commented on SPARK-9414: -- With the current master

[jira] [Updated] (SPARK-13163) Column width on new History Server DataTables not getting set correctly

2016-02-03 Thread Alex Bozarth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Bozarth updated SPARK-13163: - Attachment: width_long_name.png page_width_fixed.png I have a fix and will open

[jira] [Assigned] (SPARK-13164) Replace deprecated synchronizedBuffer in core

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13164: Assignee: Apache Spark > Replace deprecated synchronizedBuffer in core >

[jira] [Assigned] (SPARK-11316) isEmpty before coalesce seems to cause huge performance issue in setupGroups

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11316: Assignee: (was: Apache Spark) > isEmpty before coalesce seems to cause huge

[jira] [Created] (SPARK-13163) Column width on new History Server DataTables not getting set correctly

2016-02-03 Thread Alex Bozarth (JIRA)
Alex Bozarth created SPARK-13163: Summary: Column width on new History Server DataTables not getting set correctly Key: SPARK-13163 URL: https://issues.apache.org/jira/browse/SPARK-13163 Project:

[jira] [Commented] (SPARK-13151) Investigate replacing SynchronizedBuffer as it is deprecated/unreliable

2016-02-03 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131100#comment-15131100 ] holdenk commented on SPARK-13151: - This seems pretty reasonable we already use concurrentlinkedqueue

[jira] [Resolved] (SPARK-13150) Flaky test: org.apache.spark.sql.hive.thriftserver.SingleSessionSuite.test single session

2016-02-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13150. - Resolution: Fixed Assignee: Herman van Hovell (was: Cheng Lian) > Flaky test:

[jira] [Commented] (SPARK-13116) TungstenAggregate though it is supposedly capable of all processing unsafe & safe rows, fails if the input is safe rows

2016-02-03 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131138#comment-15131138 ] Davies Liu commented on SPARK-13116: Could you provide a test to reproduce this issue? >

[jira] [Commented] (SPARK-13131) Use median time in benchmark

2016-02-03 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131205#comment-15131205 ] Davies Liu commented on SPARK-13131: [~piccolbo] Thanks for you comments, we also have lots of

[jira] [Created] (SPARK-13167) JDBC data source does not include null value partition columns rows in the result.

2016-02-03 Thread Suresh Thalamati (JIRA)
Suresh Thalamati created SPARK-13167: Summary: JDBC data source does not include null value partition columns rows in the result. Key: SPARK-13167 URL: https://issues.apache.org/jira/browse/SPARK-13167

[jira] [Resolved] (SPARK-13157) ADD JAR command cannot handle path with @ character

2016-02-03 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13157. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11052

[jira] [Created] (SPARK-13165) Replace deprecated synchronizedBuffer in streaming

2016-02-03 Thread holdenk (JIRA)
holdenk created SPARK-13165: --- Summary: Replace deprecated synchronizedBuffer in streaming Key: SPARK-13165 URL: https://issues.apache.org/jira/browse/SPARK-13165 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-12739) Details of batch in Streaming tab uses two Duration columns

2016-02-03 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-12739. -- Resolution: Fixed > Details of batch in Streaming tab uses two Duration columns >

[jira] [Commented] (SPARK-13116) TungstenAggregate though it is supposedly capable of all processing unsafe & safe rows, fails if the input is safe rows

2016-02-03 Thread Asif Hussain Shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131175#comment-15131175 ] Asif Hussain Shahid commented on SPARK-13116: - I will check if my tests encounter issue with

[jira] [Commented] (SPARK-13167) JDBC data source does not include null value partition columns rows in the result.

2016-02-03 Thread Suresh Thalamati (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131258#comment-15131258 ] Suresh Thalamati commented on SPARK-13167: -- I am working on fix for this issue. > JDBC data

[jira] [Commented] (SPARK-12982) SQLContext: temporary table registration does not accept valid identifier

2016-02-03 Thread Thomas Sebastian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130471#comment-15130471 ] Thomas Sebastian commented on SPARK-12982: -- Adding changes in the SQLContext.scala and testing

[jira] [Commented] (SPARK-13160) PySpark CDH 5

2016-02-03 Thread David Vega (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130572#comment-15130572 ] David Vega commented on SPARK-13160: I got to attach the files. > PySpark CDH 5 > - > >

[jira] [Commented] (SPARK-8688) Hadoop Configuration has to disable client cache when writing or reading delegation tokens.

2016-02-03 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130421#comment-15130421 ] Steve Loughran commented on SPARK-8688: --- Has anyone filed a bug against HDFS for this? > Hadoop

[jira] [Assigned] (SPARK-12725) SQL generation suffers from name conficts introduced by some analysis rules

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12725: Assignee: Xiao Li (was: Apache Spark) > SQL generation suffers from name conficts

[jira] [Commented] (SPARK-12982) SQLContext: temporary table registration does not accept valid identifier

2016-02-03 Thread Thomas Sebastian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130478#comment-15130478 ] Thomas Sebastian commented on SPARK-12982: -- One main difference for this with the description of

[jira] [Commented] (SPARK-12725) SQL generation suffers from name conficts introduced by some analysis rules

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130481#comment-15130481 ] Apache Spark commented on SPARK-12725: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12725) SQL generation suffers from name conficts introduced by some analysis rules

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12725: Assignee: Apache Spark (was: Xiao Li) > SQL generation suffers from name conficts

[jira] [Commented] (SPARK-12982) SQLContext: temporary table registration does not accept valid identifier

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130489#comment-15130489 ] Apache Spark commented on SPARK-12982: -- User 'jayadevanmurali' has created a pull request for this

[jira] [Closed] (SPARK-13159) External shuffle service broken w/ Mesos

2016-02-03 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Iulian Dragos closed SPARK-13159. - Resolution: Duplicate > External shuffle service broken w/ Mesos >

[jira] [Updated] (SPARK-13160) PySpark CDH 5

2016-02-03 Thread David Vega (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Vega updated SPARK-13160: --- Attachment: workflow.xml wordcount.py job.properties > PySpark CDH 5

[jira] [Created] (SPARK-13159) External shuffle service broken w/ Mesos

2016-02-03 Thread Iulian Dragos (JIRA)
Iulian Dragos created SPARK-13159: - Summary: External shuffle service broken w/ Mesos Key: SPARK-13159 URL: https://issues.apache.org/jira/browse/SPARK-13159 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-12430) Temporary folders do not get deleted after Task completes causing problems with disk space.

2016-02-03 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15130554#comment-15130554 ] Iulian Dragos commented on SPARK-12430: --- I guess the thinking was that Mesos would clean up those

[jira] [Created] (SPARK-13160) PySpark CDH 5

2016-02-03 Thread David Vega (JIRA)
David Vega created SPARK-13160: -- Summary: PySpark CDH 5 Key: SPARK-13160 URL: https://issues.apache.org/jira/browse/SPARK-13160 Project: Spark Issue Type: Question Components: Deploy,

[jira] [Assigned] (SPARK-13167) JDBC data source does not include null value partition columns rows in the result.

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13167: Assignee: Apache Spark > JDBC data source does not include null value partition columns

[jira] [Assigned] (SPARK-13167) JDBC data source does not include null value partition columns rows in the result.

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13167: Assignee: (was: Apache Spark) > JDBC data source does not include null value

[jira] [Commented] (SPARK-13165) Replace deprecated synchronizedBuffer in streaming

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131383#comment-15131383 ] Apache Spark commented on SPARK-13165: -- User 'holdenk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13165) Replace deprecated synchronizedBuffer in streaming

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13165: Assignee: (was: Apache Spark) > Replace deprecated synchronizedBuffer in streaming >

[jira] [Assigned] (SPARK-13165) Replace deprecated synchronizedBuffer in streaming

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13165: Assignee: Apache Spark > Replace deprecated synchronizedBuffer in streaming >

[jira] [Resolved] (SPARK-6715) Eliminate duplicate filters from pushdown predicates

2016-02-03 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-6715. --- Resolution: Won't Fix I believe that this has been addressed by

[jira] [Commented] (SPARK-7376) Python: Add validation functionality to individual Param

2016-02-03 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131401#comment-15131401 ] Seth Hendrickson commented on SPARK-7376: - I am seeing this Jira now after several related Jiras

[jira] [Created] (SPARK-13174) Add API and options for csv data sources

2016-02-03 Thread Davies Liu (JIRA)
Davies Liu created SPARK-13174: -- Summary: Add API and options for csv data sources Key: SPARK-13174 URL: https://issues.apache.org/jira/browse/SPARK-13174 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-13170) Investigate replacing SynchronizedQueue as it is deprecated

2016-02-03 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-13170: Issue Type: Sub-task (was: Improvement) Parent: SPARK-13175 > Investigate replacing

[jira] [Updated] (SPARK-13171) Update promise & future to Promise and Future as the old ones are deprecated

2016-02-03 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-13171: Issue Type: Sub-task (was: Improvement) Parent: SPARK-13175 > Update promise & future to Promise

[jira] [Commented] (SPARK-13178) RRDD faces with concurrency issue in case of rdd.zip(rdd).count()

2016-02-03 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131430#comment-15131430 ] Xusen Yin commented on SPARK-13178: --- Ping [~mengxr] [~shivaram] to know about the concurrency issue. I

[jira] [Updated] (SPARK-13178) RRDD faces with concurrency issue in case of rdd.zip(rdd).count()

2016-02-03 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xusen Yin updated SPARK-13178: -- Description: In Kmeans algorithm, there is a zip operation before taking samples, i.e.

[jira] [Commented] (SPARK-13178) RRDD faces with concurrency issue in case of rdd.zip(rdd).count()

2016-02-03 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131436#comment-15131436 ] Shivaram Venkataraman commented on SPARK-13178: --- Hmm this is tricky to debug -- A higher

[jira] [Commented] (SPARK-12720) SQL generation support for cube, rollup, and grouping set

2016-02-03 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131447#comment-15131447 ] Xiao Li commented on SPARK-12720: - CUBE(a, b, c) = GROUPING SETS((a,b,c), (a,b), (a,c), (b,c), (a), (b),

[jira] [Commented] (SPARK-13167) JDBC data source does not include null value partition columns rows in the result.

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131285#comment-15131285 ] Apache Spark commented on SPARK-13167: -- User 'sureshthalamati' has created a pull request for this

[jira] [Commented] (SPARK-13131) Use best time and average time in micro benchmark

2016-02-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131353#comment-15131353 ] Sean Owen commented on SPARK-13131: --- Isn't best time on fact the best estimator of what this benchmark

[jira] [Created] (SPARK-13173) Fail to load CSV file with NPE

2016-02-03 Thread Davies Liu (JIRA)
Davies Liu created SPARK-13173: -- Summary: Fail to load CSV file with NPE Key: SPARK-13173 URL: https://issues.apache.org/jira/browse/SPARK-13173 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-13046) Partitioning looks broken in 1.6

2016-02-03 Thread Julien Baley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131418#comment-15131418 ] Julien Baley commented on SPARK-13046: -- Hi Davies, I have no other file in the middle of the paths,

[jira] [Updated] (SPARK-13176) Ignore deprecation warning for ProcessBuilder lines_!

2016-02-03 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-13176: Issue Type: Sub-task (was: Improvement) Parent: SPARK-13175 > Ignore deprecation warning for

[jira] [Created] (SPARK-13176) Ignore deprecation warning for ProcessBuilder lines_!

2016-02-03 Thread holdenk (JIRA)
holdenk created SPARK-13176: --- Summary: Ignore deprecation warning for ProcessBuilder lines_! Key: SPARK-13176 URL: https://issues.apache.org/jira/browse/SPARK-13176 Project: Spark Issue Type:

[jira] [Commented] (SPARK-12514) Spark MetricsSystem can fill disks/cause OOMs when using GangliaSink

2016-02-03 Thread Jonathan Kelly (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131439#comment-15131439 ] Jonathan Kelly commented on SPARK-12514: As of Spark 1.6.0, there don't seem to be *any* Spark

[jira] [Commented] (SPARK-13178) RRDD faces with concurrency issue in case of rdd.zip(rdd).count()

2016-02-03 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131455#comment-15131455 ] Xusen Yin commented on SPARK-13178: --- I don't zip RRDD with itself. Actually, the bug exists when I

[jira] [Created] (SPARK-13168) Collapse adjacent Repartition operations

2016-02-03 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-13168: -- Summary: Collapse adjacent Repartition operations Key: SPARK-13168 URL: https://issues.apache.org/jira/browse/SPARK-13168 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-13095) improve performance of hash join with dimension table

2016-02-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131352#comment-15131352 ] Apache Spark commented on SPARK-13095: -- User 'davies' has created a pull request for this issue:

[jira] [Updated] (SPARK-13149) Add FileStreamSource

2016-02-03 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-13149: - Summary: Add FileStreamSource (was: Add FileStreamSource and a simple version of

  1   2   3   >