[jira] [Commented] (SPARK-18009) Spark 2.0.1 SQL Thrift Error

2016-10-25 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606693#comment-15606693 ] Xiao Li commented on SPARK-18009: - [~dkbiswal] Please fix it tonight. Thanks! > Spark 2.

[jira] [Created] (SPARK-18104) Don't build KafkaSource doc

2016-10-25 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-18104: Summary: Don't build KafkaSource doc Key: SPARK-18104 URL: https://issues.apache.org/jira/browse/SPARK-18104 Project: Spark Issue Type: Documentation

[jira] [Assigned] (SPARK-18104) Don't build KafkaSource doc

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18104: Assignee: Apache Spark (was: Shixiong Zhu) > Don't build KafkaSource doc > --

[jira] [Commented] (SPARK-18104) Don't build KafkaSource doc

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606795#comment-15606795 ] Apache Spark commented on SPARK-18104: -- User 'zsxwing' has created a pull request fo

[jira] [Assigned] (SPARK-18104) Don't build KafkaSource doc

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18104: Assignee: Shixiong Zhu (was: Apache Spark) > Don't build KafkaSource doc > --

[jira] [Created] (SPARK-18105) LZ4 failed to decompress a stream of shuffled data

2016-10-25 Thread Davies Liu (JIRA)
Davies Liu created SPARK-18105: -- Summary: LZ4 failed to decompress a stream of shuffled data Key: SPARK-18105 URL: https://issues.apache.org/jira/browse/SPARK-18105 Project: Spark Issue Type: Bu

[jira] [Assigned] (SPARK-18105) LZ4 failed to decompress a stream of shuffled data

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18105: Assignee: Davies Liu (was: Apache Spark) > LZ4 failed to decompress a stream of shuffled

[jira] [Commented] (SPARK-18105) LZ4 failed to decompress a stream of shuffled data

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606849#comment-15606849 ] Apache Spark commented on SPARK-18105: -- User 'davies' has created a pull request for

[jira] [Assigned] (SPARK-18105) LZ4 failed to decompress a stream of shuffled data

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18105: Assignee: Apache Spark (was: Davies Liu) > LZ4 failed to decompress a stream of shuffled

[jira] [Commented] (SPARK-17829) Stable format for offset log

2016-10-25 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606896#comment-15606896 ] Tathagata Das commented on SPARK-17829: --- Based on [~tcondie] PR above, I think its

[jira] [Created] (SPARK-18106) Analyze Table accepts a garbage identifier at the end

2016-10-25 Thread Srinath (JIRA)
Srinath created SPARK-18106: --- Summary: Analyze Table accepts a garbage identifier at the end Key: SPARK-18106 URL: https://issues.apache.org/jira/browse/SPARK-18106 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-18106) Analyze Table accepts a garbage identifier at the end

2016-10-25 Thread Srinath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Srinath updated SPARK-18106: Description: {noformat} scala> sql("create table test(a int)") res2: org.apache.spark.sql.DataFrame = [] s

[jira] [Updated] (SPARK-18106) Analyze Table accepts a garbage identifier at the end

2016-10-25 Thread Srinath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Srinath updated SPARK-18106: Description: {noformat} scala> sql("create table test(a int)") res2: org.apache.spark.sql.DataFrame = [] s

[jira] [Assigned] (SPARK-18087) Optimize insert to not require REPAIR TABLE

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18087: Assignee: Apache Spark > Optimize insert to not require REPAIR TABLE > ---

[jira] [Commented] (SPARK-18087) Optimize insert to not require REPAIR TABLE

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606990#comment-15606990 ] Apache Spark commented on SPARK-18087: -- User 'ericl' has created a pull request for

[jira] [Assigned] (SPARK-18087) Optimize insert to not require REPAIR TABLE

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18087: Assignee: (was: Apache Spark) > Optimize insert to not require REPAIR TABLE >

[jira] [Assigned] (SPARK-18087) Optimize insert to not require REPAIR TABLE

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18087: Assignee: (was: Apache Spark) > Optimize insert to not require REPAIR TABLE >

[jira] [Assigned] (SPARK-18087) Optimize insert to not require REPAIR TABLE

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18087: Assignee: Apache Spark > Optimize insert to not require REPAIR TABLE > ---

[jira] [Closed] (SPARK-18077) Run insert overwrite statements in spark to overwrite a partitioned table is very slow

2016-10-25 Thread J.P Feng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] J.P Feng closed SPARK-18077. Resolution: Won't Fix i would try to open another one, for there are some mistakes in this issue. > Run in

[jira] [Assigned] (SPARK-18103) Rename *FileCatalog to *FileProvider

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18103: Assignee: Apache Spark > Rename *FileCatalog to *FileProvider > --

[jira] [Commented] (SPARK-18103) Rename *FileCatalog to *FileProvider

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607020#comment-15607020 ] Apache Spark commented on SPARK-18103: -- User 'ericl' has created a pull request for

[jira] [Assigned] (SPARK-18103) Rename *FileCatalog to *FileProvider

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18103: Assignee: (was: Apache Spark) > Rename *FileCatalog to *FileProvider > ---

[jira] [Commented] (SPARK-18009) Spark 2.0.1 SQL Thrift Error

2016-10-25 Thread Jerryjung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607052#comment-15607052 ] Jerryjung commented on SPARK-18009: --- Yes! But In my case, it's necessary option for int

[jira] [Comment Edited] (SPARK-18009) Spark 2.0.1 SQL Thrift Error

2016-10-25 Thread Jerryjung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607052#comment-15607052 ] Jerryjung edited comment on SPARK-18009 at 10/26/16 1:44 AM: -

[jira] [Created] (SPARK-18107) Insert overwrite statement runs much slower in spark-sql than it does in hive-client

2016-10-25 Thread J.P Feng (JIRA)
J.P Feng created SPARK-18107: Summary: Insert overwrite statement runs much slower in spark-sql than it does in hive-client Key: SPARK-18107 URL: https://issues.apache.org/jira/browse/SPARK-18107 Project:

[jira] [Created] (SPARK-18108) Partition discovery fails with explicitly written long partitions

2016-10-25 Thread Richard Moorhead (JIRA)
Richard Moorhead created SPARK-18108: Summary: Partition discovery fails with explicitly written long partitions Key: SPARK-18108 URL: https://issues.apache.org/jira/browse/SPARK-18108 Project: Sp

[jira] [Updated] (SPARK-18108) Partition discovery fails with explicitly written long partitions

2016-10-25 Thread Richard Moorhead (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Moorhead updated SPARK-18108: - Attachment: stacktrace.out > Partition discovery fails with explicitly written long parti

[jira] [Commented] (SPARK-18100) Improve the performance of get_json_object using Gson

2016-10-25 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607204#comment-15607204 ] Liang-Chi Hsieh commented on SPARK-18100: - Looks like Gson has no native support

[jira] [Updated] (SPARK-18000) Aggregation function for computing endpoints for histograms

2016-10-25 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-18000: - Summary: Aggregation function for computing endpoints for histograms (was: Aggregation function

[jira] [Comment Edited] (SPARK-17935) Add KafkaForeachWriter in external kafka-0.8.0 for structured streaming module

2016-10-25 Thread zhangxinyu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15601778#comment-15601778 ] zhangxinyu edited comment on SPARK-17935 at 10/26/16 3:26 AM: -

[jira] [Updated] (SPARK-18000) Aggregation function for computing endpoints for histograms

2016-10-25 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-18000: - Description: For a column, we will generate a equi-width or equi-height histogram, depending on

[jira] [Created] (SPARK-18109) Log instrumentation in GMM

2016-10-25 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-18109: Summary: Log instrumentation in GMM Key: SPARK-18109 URL: https://issues.apache.org/jira/browse/SPARK-18109 Project: Spark Issue Type: Sub-task Com

[jira] [Assigned] (SPARK-18109) Log instrumentation in GMM

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18109: Assignee: (was: Apache Spark) > Log instrumentation in GMM > -

[jira] [Assigned] (SPARK-18109) Log instrumentation in GMM

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18109: Assignee: Apache Spark > Log instrumentation in GMM > -- > >

[jira] [Commented] (SPARK-18109) Log instrumentation in GMM

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607289#comment-15607289 ] Apache Spark commented on SPARK-18109: -- User 'zhengruifeng' has created a pull reque

[jira] [Commented] (SPARK-18000) Aggregation function for computing endpoints for histograms

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607297#comment-15607297 ] Apache Spark commented on SPARK-18000: -- User 'wzhfy' has created a pull request for

[jira] [Assigned] (SPARK-18000) Aggregation function for computing endpoints for histograms

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18000: Assignee: (was: Apache Spark) > Aggregation function for computing endpoints for histo

[jira] [Assigned] (SPARK-18000) Aggregation function for computing endpoints for histograms

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18000: Assignee: Apache Spark > Aggregation function for computing endpoints for histograms > ---

[jira] [Updated] (SPARK-17074) generate histogram information for column

2016-10-25 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-17074: - Description: We support two kinds of histograms: - Equi-width histogram: We have a fixed w

[jira] [Commented] (SPARK-18000) Aggregation function for computing endpoints for histograms

2016-10-25 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607306#comment-15607306 ] Zhenhua Wang commented on SPARK-18000: -- This issue is included in another issue SPAR

[jira] [Issue Comment Deleted] (SPARK-18000) Aggregation function for computing endpoints for histograms

2016-10-25 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-18000: - Comment: was deleted (was: This issue is included in another issue SPARK-17881, so I'll close thi

[jira] [Commented] (SPARK-17881) Aggregation function for generating string histograms

2016-10-25 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607308#comment-15607308 ] Zhenhua Wang commented on SPARK-17881: -- This issue is included in another issue SPAR

[jira] [Closed] (SPARK-17881) Aggregation function for generating string histograms

2016-10-25 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang closed SPARK-17881. Resolution: Duplicate > Aggregation function for generating string histograms > ---

[jira] [Commented] (SPARK-18009) Spark 2.0.1 SQL Thrift Error

2016-10-25 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607343#comment-15607343 ] Dilip Biswal commented on SPARK-18009: -- [~smilegator][~jerryjung] [~martha.solarte]

[jira] [Commented] (SPARK-18036) Decision Trees do not handle edge cases

2016-10-25 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607357#comment-15607357 ] Weichen Xu commented on SPARK-18036: i am working on this... > Decision Trees do no

[jira] [Commented] (SPARK-18106) Analyze Table accepts a garbage identifier at the end

2016-10-25 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607374#comment-15607374 ] Dongjoon Hyun commented on SPARK-18106: --- Thank you for reporting this bug, [~skomat

[jira] [Comment Edited] (SPARK-18106) Analyze Table accepts a garbage identifier at the end

2016-10-25 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607374#comment-15607374 ] Dongjoon Hyun edited comment on SPARK-18106 at 10/26/16 4:30 AM: --

[jira] [Comment Edited] (SPARK-18106) Analyze Table accepts a garbage identifier at the end

2016-10-25 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607374#comment-15607374 ] Dongjoon Hyun edited comment on SPARK-18106 at 10/26/16 4:31 AM: --

[jira] [Resolved] (SPARK-18007) update SparkR MLP - add initalWeights parameter

2016-10-25 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-18007. -- Resolution: Fixed Assignee: Weichen Xu Fix Version/s: 2.1.0 > update SparkR MLP

[jira] [Created] (SPARK-18110) Missing parameter in Python for RandomForest regression and classification

2016-10-25 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18110: Summary: Missing parameter in Python for RandomForest regression and classification Key: SPARK-18110 URL: https://issues.apache.org/jira/browse/SPARK-18110 Project: S

[jira] [Assigned] (SPARK-18110) Missing parameter in Python for RandomForest regression and classification

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18110: Assignee: Apache Spark (was: Felix Cheung) > Missing parameter in Python for RandomForest

[jira] [Commented] (SPARK-18110) Missing parameter in Python for RandomForest regression and classification

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607444#comment-15607444 ] Apache Spark commented on SPARK-18110: -- User 'felixcheung' has created a pull reques

[jira] [Assigned] (SPARK-18110) Missing parameter in Python for RandomForest regression and classification

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18110: Assignee: Felix Cheung (was: Apache Spark) > Missing parameter in Python for RandomForest

[jira] [Commented] (SPARK-18106) Analyze Table accepts a garbage identifier at the end

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607496#comment-15607496 ] Apache Spark commented on SPARK-18106: -- User 'dongjoon-hyun' has created a pull requ

[jira] [Assigned] (SPARK-18106) Analyze Table accepts a garbage identifier at the end

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18106: Assignee: Apache Spark > Analyze Table accepts a garbage identifier at the end > -

[jira] [Assigned] (SPARK-18106) Analyze Table accepts a garbage identifier at the end

2016-10-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18106: Assignee: (was: Apache Spark) > Analyze Table accepts a garbage identifier at the end

[jira] [Created] (SPARK-18111) Wrong ApproximatePercentile answer when multiple records have the minimum value

2016-10-25 Thread Zhenhua Wang (JIRA)
Zhenhua Wang created SPARK-18111: Summary: Wrong ApproximatePercentile answer when multiple records have the minimum value Key: SPARK-18111 URL: https://issues.apache.org/jira/browse/SPARK-18111 Proje

[jira] [Updated] (SPARK-18111) Wrong ApproximatePercentile answer when multiple records have the minimum value

2016-10-25 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-18111: - Description: When multiple records have the minimum value, the answer of ApproximatePercentile i

<    1   2