[jira] [Commented] (SPARK-18939) Timezone support in partition values.

2016-12-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15763378#comment-15763378 ] Reynold Xin commented on SPARK-18939: - FWIW it's probably unlikely timestamp is used as a partition

[jira] [Commented] (SPARK-18939) Timezone support in partition values.

2016-12-19 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15763375#comment-15763375 ] Takuya Ueshin commented on SPARK-18939: --- Yes, I think so. > Timezone support in partition values.

[jira] [Commented] (SPARK-18939) Timezone support in partition values.

2016-12-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15763372#comment-15763372 ] Reynold Xin commented on SPARK-18939: - This only impacts timestamp data type right? > Timezone

[jira] [Created] (SPARK-18939) Timezone support in partition values.

2016-12-19 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-18939: - Summary: Timezone support in partition values. Key: SPARK-18939 URL: https://issues.apache.org/jira/browse/SPARK-18939 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-18938) Addition of peak memory usage metric for an executor

2016-12-19 Thread Suresh Bahuguna (JIRA)
Suresh Bahuguna created SPARK-18938: --- Summary: Addition of peak memory usage metric for an executor Key: SPARK-18938 URL: https://issues.apache.org/jira/browse/SPARK-18938 Project: Spark

[jira] [Assigned] (SPARK-18936) Infrastructure for session local timezone support

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18936: Assignee: Apache Spark (was: Takuya Ueshin) > Infrastructure for session local timezone

[jira] [Commented] (SPARK-18936) Infrastructure for session local timezone support

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15763343#comment-15763343 ] Apache Spark commented on SPARK-18936: -- User 'ueshin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18936) Infrastructure for session local timezone support

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18936: Assignee: Takuya Ueshin (was: Apache Spark) > Infrastructure for session local timezone

[jira] [Created] (SPARK-18937) Timezone support in CSV/JSON parsing

2016-12-19 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18937: --- Summary: Timezone support in CSV/JSON parsing Key: SPARK-18937 URL: https://issues.apache.org/jira/browse/SPARK-18937 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-18936) Infrastructure for session local timezone support

2016-12-19 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18936: --- Summary: Infrastructure for session local timezone support Key: SPARK-18936 URL: https://issues.apache.org/jira/browse/SPARK-18936 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-12-19 Thread Alok Bhandari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15763324#comment-15763324 ] Alok Bhandari edited comment on SPARK-16473 at 12/20/16 5:38 AM: -

[jira] [Commented] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-12-19 Thread Alok Bhandari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15763324#comment-15763324 ] Alok Bhandari commented on SPARK-16473: --- [~imatiach] , thanks for showing interest in this issue. I

[jira] [Closed] (SPARK-17632) make console sink and other sinks work with 'recoverFromCheckpointLocation' option enabled

2016-12-19 Thread Chuanlei Ni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chuanlei Ni closed SPARK-17632. --- Resolution: Not A Problem > make console sink and other sinks work with

[jira] [Commented] (SPARK-18924) Improve collect/createDataFrame performance in SparkR

2016-12-19 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15763165#comment-15763165 ] Felix Cheung commented on SPARK-18924: -- Thank you for bring this up. JVM<->Java performance has been

[jira] [Updated] (SPARK-18935) Use Mesos "Dynamic Reservation" resource for Spark

2016-12-19 Thread jackyoh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jackyoh updated SPARK-18935: Affects Version/s: 2.0.1 > Use Mesos "Dynamic Reservation" resource for Spark >

[jira] [Resolved] (SPARK-18913) append to a table with special column names should work

2016-12-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-18913. - Resolution: Fixed Fix Version/s: 2.2.0 2.1.1 > append to a table with special

[jira] [Created] (SPARK-18935) Use Mesos "Dynamic Reservation" resource for Spark

2016-12-19 Thread jackyoh (JIRA)
jackyoh created SPARK-18935: --- Summary: Use Mesos "Dynamic Reservation" resource for Spark Key: SPARK-18935 URL: https://issues.apache.org/jira/browse/SPARK-18935 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-18912) append to a non-file-based data source table should detect columns number mismatch

2016-12-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-18912. - Resolution: Fixed Fix Version/s: 2.2.0 2.1.1 > append to a non-file-based data

[jira] [Updated] (SPARK-18899) append data to a bucketed table with mismatched bucketing should fail

2016-12-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-18899: Fix Version/s: 2.2.0 2.1.1 > append data to a bucketed table with mismatched bucketing

[jira] [Updated] (SPARK-18899) append data to a bucketed table with mismatched bucketing should fail

2016-12-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-18899: Target Version/s: 2.1.1 (was: 2.1.1, 2.2.0) > append data to a bucketed table with mismatched bucketing

[jira] [Resolved] (SPARK-18899) append data to a bucketed table with mismatched bucketing should fail

2016-12-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-18899. - Resolution: Fixed Target Version/s: 2.1.1, 2.2.0 (was: 2.1.1) > append data to a bucketed

[jira] [Commented] (SPARK-17755) Master may ask a worker to launch an executor before the worker actually got the response of registration

2016-12-19 Thread Shuai Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15763089#comment-15763089 ] Shuai Lin commented on SPARK-17755: --- A (sort-of) similar problem for coarse grained scheduler backends

[jira] [Resolved] (SPARK-18761) Uncancellable / unkillable tasks may starve jobs of resoures

2016-12-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-18761. -- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16189

[jira] [Assigned] (SPARK-18934) Writing to dynamic partitions does not preserve sort order if spill occurs

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18934: Assignee: (was: Apache Spark) > Writing to dynamic partitions does not preserve sort

[jira] [Commented] (SPARK-18934) Writing to dynamic partitions does not preserve sort order if spill occurs

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15763010#comment-15763010 ] Apache Spark commented on SPARK-18934: -- User 'junegunn' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18934) Writing to dynamic partitions does not preserve sort order if spill occurs

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18934: Assignee: Apache Spark > Writing to dynamic partitions does not preserve sort order if

[jira] [Created] (SPARK-18934) Writing to dynamic partitions does not preserve sort order if spill occurs

2016-12-19 Thread Junegunn Choi (JIRA)
Junegunn Choi created SPARK-18934: - Summary: Writing to dynamic partitions does not preserve sort order if spill occurs Key: SPARK-18934 URL: https://issues.apache.org/jira/browse/SPARK-18934

[jira] [Created] (SPARK-18933) Different log output between Terminal screen and stderr file

2016-12-19 Thread Sean Wong (JIRA)
Sean Wong created SPARK-18933: - Summary: Different log output between Terminal screen and stderr file Key: SPARK-18933 URL: https://issues.apache.org/jira/browse/SPARK-18933 Project: Spark

[jira] [Assigned] (SPARK-16654) UI Should show blacklisted executors & nodes

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16654: Assignee: Apache Spark (was: Jose Soltren) > UI Should show blacklisted executors &

[jira] [Assigned] (SPARK-16654) UI Should show blacklisted executors & nodes

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16654: Assignee: Jose Soltren (was: Apache Spark) > UI Should show blacklisted executors &

[jira] [Commented] (SPARK-16654) UI Should show blacklisted executors & nodes

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762908#comment-15762908 ] Apache Spark commented on SPARK-16654: -- User 'jsoltren' has created a pull request for this issue:

[jira] [Closed] (SPARK-18915) Return Nothing when Querying a Partitioned Data Source Table without Repairing it

2016-12-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li closed SPARK-18915. --- Resolution: Won't Fix > Return Nothing when Querying a Partitioned Data Source Table without > Repairing it

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-12-19 Thread Barry Becker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762805#comment-15762805 ] Barry Becker commented on SPARK-16845: -- I found a workaround that allows me to avoid the 64 KB

[jira] [Commented] (SPARK-18492) GeneratedIterator grows beyond 64 KB

2016-12-19 Thread Roberto Mirizzi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762785#comment-15762785 ] Roberto Mirizzi commented on SPARK-18492: - I would also like to understand if this error causes

[jira] [Commented] (SPARK-18710) Add offset to GeneralizedLinearRegression models

2016-12-19 Thread Wayne Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762752#comment-15762752 ] Wayne Zhang commented on SPARK-18710: - [~yanboliang] It seems that I would need to change the case

[jira] [Commented] (SPARK-17344) Kafka 0.8 support for Structured Streaming

2016-12-19 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762756#comment-15762756 ] Michael Armbrust commented on SPARK-17344: -- [KAFKA-4462] aims to give us backwards compatibility

[jira] [Resolved] (SPARK-18928) FileScanRDD, JDBCRDD, and UnsafeSorter should support task cancellation

2016-12-19 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-18928. --- Resolution: Fixed Fix Version/s: 2.1.1 > FileScanRDD, JDBCRDD, and

[jira] [Updated] (SPARK-18928) FileScanRDD, JDBCRDD, and UnsafeSorter should support task cancellation

2016-12-19 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-18928: -- Fix Version/s: 2.2.0 > FileScanRDD, JDBCRDD, and UnsafeSorter should support task

[jira] [Updated] (SPARK-17344) Kafka 0.8 support for Structured Streaming

2016-12-19 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-17344: - Target Version/s: 2.1.1 > Kafka 0.8 support for Structured Streaming >

[jira] [Updated] (SPARK-18908) It's hard for the user to see the failure if StreamExecution fails to create the logical plan

2016-12-19 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18908: - Target Version/s: 2.1.1 > It's hard for the user to see the failure if StreamExecution

[jira] [Created] (SPARK-18932) Partial aggregation for collect_set / collect_list

2016-12-19 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-18932: Summary: Partial aggregation for collect_set / collect_list Key: SPARK-18932 URL: https://issues.apache.org/jira/browse/SPARK-18932 Project: Spark

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-12-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762719#comment-15762719 ] Dongjoon Hyun commented on SPARK-16845: --- Hi, All. I removed the target version since 2.1.0 is

[jira] [Updated] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-12-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-16845: -- Target Version/s: (was: 2.1.0) >

[jira] [Updated] (SPARK-18899) append data to a bucketed table with mismatched bucketing should fail

2016-12-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-18899: -- Target Version/s: 2.1.1 (was: 2.1.0) > append data to a bucketed table with mismatched

[jira] [Updated] (SPARK-18894) Event time watermark delay threshold specified in months or years gives incorrect results

2016-12-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-18894: -- Target Version/s: 2.1.1 (was: 2.1.0) > Event time watermark delay threshold specified in

[jira] [Updated] (SPARK-18913) append to a table with special column names should work

2016-12-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-18913: -- Target Version/s: 2.1.1 (was: 2.1.0) > append to a table with special column names should

[jira] [Updated] (SPARK-18912) append to a non-file-based data source table should detect columns number mismatch

2016-12-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-18912: -- Target Version/s: 2.1.1 (was: 2.1.0) > append to a non-file-based data source table should

[jira] [Updated] (SPARK-18909) The error message in `ExpressionEncoder.toRow` and `fromRow` is too verbose

2016-12-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-18909: -- Target Version/s: 2.1.1 (was: 2.1.0) > The error message in `ExpressionEncoder.toRow` and

[jira] [Created] (SPARK-18931) Create empty staging directory in partitioned table on insert

2016-12-19 Thread Egor Pahomov (JIRA)
Egor Pahomov created SPARK-18931: Summary: Create empty staging directory in partitioned table on insert Key: SPARK-18931 URL: https://issues.apache.org/jira/browse/SPARK-18931 Project: Spark

[jira] [Commented] (SPARK-18492) GeneratedIterator grows beyond 64 KB

2016-12-19 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762706#comment-15762706 ] Nicholas Chammas commented on SPARK-18492: -- Yup, I'm seeming the same high-level behavior as

[jira] [Assigned] (SPARK-17755) Master may ask a worker to launch an executor before the worker actually got the response of registration

2016-12-19 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu reassigned SPARK-17755: Assignee: Shixiong Zhu > Master may ask a worker to launch an executor before the worker

[jira] [Commented] (SPARK-18924) Improve collect/createDataFrame performance in SparkR

2016-12-19 Thread Hossein Falaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762687#comment-15762687 ] Hossein Falaki commented on SPARK-18924: Would be good to think about this along with the efforts

[jira] [Assigned] (SPARK-18929) Add Tweedie distribution in GLM

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18929: Assignee: (was: Apache Spark) > Add Tweedie distribution in GLM >

[jira] [Assigned] (SPARK-18929) Add Tweedie distribution in GLM

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18929: Assignee: Apache Spark > Add Tweedie distribution in GLM >

[jira] [Assigned] (SPARK-17755) Master may ask a worker to launch an executor before the worker actually got the response of registration

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17755: Assignee: Apache Spark > Master may ask a worker to launch an executor before the worker

[jira] [Commented] (SPARK-18929) Add Tweedie distribution in GLM

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762684#comment-15762684 ] Apache Spark commented on SPARK-18929: -- User 'actuaryzhang' has created a pull request for this

[jira] [Assigned] (SPARK-17755) Master may ask a worker to launch an executor before the worker actually got the response of registration

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17755: Assignee: (was: Apache Spark) > Master may ask a worker to launch an executor before

[jira] [Commented] (SPARK-17755) Master may ask a worker to launch an executor before the worker actually got the response of registration

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762683#comment-15762683 ] Apache Spark commented on SPARK-17755: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Created] (SPARK-18930) Inserting in partitioned table - partitioned field should be last in select statement.

2016-12-19 Thread Egor Pahomov (JIRA)
Egor Pahomov created SPARK-18930: Summary: Inserting in partitioned table - partitioned field should be last in select statement. Key: SPARK-18930 URL: https://issues.apache.org/jira/browse/SPARK-18930

[jira] [Updated] (SPARK-10413) ML models should support prediction on single instances

2016-12-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10413: -- Summary: ML models should support prediction on single instances (was: Model should

[jira] [Updated] (SPARK-15572) ML persistence in R format: compatibility with other languages

2016-12-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15572: -- Summary: ML persistence in R format: compatibility with other languages (was: MLlib

[jira] [Created] (SPARK-18929) Add Tweedie distribution in GLM

2016-12-19 Thread Wayne Zhang (JIRA)
Wayne Zhang created SPARK-18929: --- Summary: Add Tweedie distribution in GLM Key: SPARK-18929 URL: https://issues.apache.org/jira/browse/SPARK-18929 Project: Spark Issue Type: New Feature

[jira] [Comment Edited] (SPARK-18492) GeneratedIterator grows beyond 64 KB

2016-12-19 Thread Roberto Mirizzi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762628#comment-15762628 ] Roberto Mirizzi edited comment on SPARK-18492 at 12/19/16 11:32 PM:

[jira] [Comment Edited] (SPARK-18492) GeneratedIterator grows beyond 64 KB

2016-12-19 Thread Roberto Mirizzi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762628#comment-15762628 ] Roberto Mirizzi edited comment on SPARK-18492 at 12/19/16 11:29 PM:

[jira] [Commented] (SPARK-18492) GeneratedIterator grows beyond 64 KB

2016-12-19 Thread Roberto Mirizzi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762628#comment-15762628 ] Roberto Mirizzi commented on SPARK-18492: - I'm having exactly the same issue on Spark 2.0.2. I

[jira] [Issue Comment Deleted] (SPARK-18492) GeneratedIterator grows beyond 64 KB

2016-12-19 Thread Roberto Mirizzi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Roberto Mirizzi updated SPARK-18492: Comment: was deleted (was: I'm having exactly the same issue. My exception is:

[jira] [Commented] (SPARK-18492) GeneratedIterator grows beyond 64 KB

2016-12-19 Thread Roberto Mirizzi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762609#comment-15762609 ] Roberto Mirizzi commented on SPARK-18492: - I'm having exactly the same issue. My exception is:

[jira] [Commented] (SPARK-18588) KafkaSourceStressForDontFailOnDataLossSuite is flaky

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762559#comment-15762559 ] Apache Spark commented on SPARK-18588: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18588) KafkaSourceStressForDontFailOnDataLossSuite is flaky

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18588: Assignee: Shixiong Zhu (was: Apache Spark) > KafkaSourceStressForDontFailOnDataLossSuite

[jira] [Assigned] (SPARK-18588) KafkaSourceStressForDontFailOnDataLossSuite is flaky

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18588: Assignee: Apache Spark (was: Shixiong Zhu) > KafkaSourceStressForDontFailOnDataLossSuite

[jira] [Commented] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2016-12-19 Thread Ed Tyrrill (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762555#comment-15762555 ] Ed Tyrrill commented on SPARK-15544: I'm going to add that this is very easy to reproduce. It will

[jira] [Resolved] (SPARK-18836) Serialize Task Metrics once per stage

2016-12-19 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-18836. Resolution: Fixed Assignee: Shivaram Venkataraman Fix Version/s: 1.3.0 >

[jira] [Commented] (SPARK-18924) Improve collect/createDataFrame performance in SparkR

2016-12-19 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762548#comment-15762548 ] Shivaram Venkataraman commented on SPARK-18924: --- This is a good thing to investigate - Just

[jira] [Commented] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-12-19 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762532#comment-15762532 ] Ilya Matiach commented on SPARK-16473: -- I'm interested in looking into this issue. Would it be

[jira] [Assigned] (SPARK-18927) MemorySink for StructuredStreaming can't recover from checkpoint if location is provided in conf

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18927: Assignee: (was: Apache Spark) > MemorySink for StructuredStreaming can't recover from

[jira] [Commented] (SPARK-18927) MemorySink for StructuredStreaming can't recover from checkpoint if location is provided in conf

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762497#comment-15762497 ] Apache Spark commented on SPARK-18927: -- User 'brkyvz' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18927) MemorySink for StructuredStreaming can't recover from checkpoint if location is provided in conf

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18927: Assignee: Apache Spark > MemorySink for StructuredStreaming can't recover from checkpoint

[jira] [Commented] (SPARK-18832) Spark SQL: Incorrect error message on calling registered UDF.

2016-12-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762427#comment-15762427 ] Dongjoon Hyun commented on SPARK-18832: --- Ah, I think I reproduced the situation you meet. Let me

[jira] [Commented] (SPARK-18928) FileScanRDD, JDBCRDD, and UnsafeSorter should support task cancellation

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762384#comment-15762384 ] Apache Spark commented on SPARK-18928: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18928) FileScanRDD, JDBCRDD, and UnsafeSorter should support task cancellation

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18928: Assignee: Josh Rosen (was: Apache Spark) > FileScanRDD, JDBCRDD, and UnsafeSorter should

[jira] [Assigned] (SPARK-18928) FileScanRDD, JDBCRDD, and UnsafeSorter should support task cancellation

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18928: Assignee: Apache Spark (was: Josh Rosen) > FileScanRDD, JDBCRDD, and UnsafeSorter should

[jira] [Created] (SPARK-18928) FileScanRDD, JDBCRDD, and UnsafeSorter should support task cancellation

2016-12-19 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-18928: -- Summary: FileScanRDD, JDBCRDD, and UnsafeSorter should support task cancellation Key: SPARK-18928 URL: https://issues.apache.org/jira/browse/SPARK-18928 Project: Spark

[jira] [Commented] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2016-12-19 Thread Ed Tyrrill (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762360#comment-15762360 ] Ed Tyrrill commented on SPARK-15544: I am experiencing the same problem with Spark 1.6.2 and ZK 3.4.8

[jira] [Commented] (SPARK-18886) Delay scheduling should not delay some executors indefinitely if one task is scheduled before delay timeout

2016-12-19 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762296#comment-15762296 ] Imran Rashid commented on SPARK-18886: -- Thanks [~mridul], that helps -- in particular I was only

[jira] [Commented] (SPARK-18926) run-example SparkPi terminates with error message

2016-12-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762210#comment-15762210 ] Dongjoon Hyun commented on SPARK-18926: --- Hi, [~alex.decastro]. It looks like some timing issue.

[jira] [Resolved] (SPARK-18624) Implict cast between ArrayTypes

2016-12-19 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-18624. --- Resolution: Fixed Assignee: Jiang Xingbo Fix Version/s: 2.2.0 >

[jira] [Commented] (SPARK-18877) Unable to read given csv data. Excepion: java.lang.IllegalArgumentException: requirement failed: Decimal precision 28 exceeds max precision 20

2016-12-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762186#comment-15762186 ] Dongjoon Hyun commented on SPARK-18877: --- +1 > Unable to read given csv data. Excepion:

[jira] [Commented] (SPARK-18832) Spark SQL: Incorrect error message on calling registered UDF.

2016-12-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762182#comment-15762182 ] Dongjoon Hyun commented on SPARK-18832: --- Thank you for confirming. Actually, Spark already has the

[jira] [Updated] (SPARK-18908) It's hard for the user to see the failure if StreamExecution fails to create the logical plan

2016-12-19 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18908: - Priority: Critical (was: Blocker) > It's hard for the user to see the failure if

[jira] [Resolved] (SPARK-18921) check database existence with Hive.databaseExists instead of getDatabase

2016-12-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-18921. -- Resolution: Fixed Fix Version/s: 2.1.1 Issue resolved by pull request 16332

[jira] [Resolved] (SPARK-18700) getCached in HiveMetastoreCatalog not thread safe cause driver OOM

2016-12-19 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-18700. --- Resolution: Fixed Assignee: Li Yuanjian Fix Version/s: 2.2.0

[jira] [Commented] (SPARK-16654) UI Should show blacklisted executors & nodes

2016-12-19 Thread Jose Soltren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762067#comment-15762067 ] Jose Soltren commented on SPARK-16654: -- SPARK-8425 is resolved so I'll be working to get this

[jira] [Updated] (SPARK-18924) Improve collect/createDataFrame performance in SparkR

2016-12-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-18924: -- Description: SparkR has its own SerDe for data serialization between JVM and R. The SerDe on

[jira] [Created] (SPARK-18927) MemorySink for StructuredStreaming can't recover from checkpoint if location is provided in conf

2016-12-19 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-18927: --- Summary: MemorySink for StructuredStreaming can't recover from checkpoint if location is provided in conf Key: SPARK-18927 URL: https://issues.apache.org/jira/browse/SPARK-18927

[jira] [Commented] (SPARK-18716) Restrict the disk usage of spark event log.

2016-12-19 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15761941#comment-15761941 ] Marcelo Vanzin commented on SPARK-18716: For posterity, another problem with this feature that I

[jira] [Assigned] (SPARK-18917) Dataframe - Time Out Issues / Taking long time in append mode on object stores

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18917: Assignee: (was: Apache Spark) > Dataframe - Time Out Issues / Taking long time in

[jira] [Assigned] (SPARK-18917) Dataframe - Time Out Issues / Taking long time in append mode on object stores

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18917: Assignee: Apache Spark > Dataframe - Time Out Issues / Taking long time in append mode on

[jira] [Commented] (SPARK-18917) Dataframe - Time Out Issues / Taking long time in append mode on object stores

2016-12-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15761926#comment-15761926 ] Apache Spark commented on SPARK-18917: -- User 'alunarbeach' has created a pull request for this

[jira] [Comment Edited] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15761741#comment-15761741 ] Franklyn Dsouza edited comment on SPARK-18589 at 12/19/16 5:37 PM: --- The

[jira] [Comment Edited] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15761741#comment-15761741 ] Franklyn Dsouza edited comment on SPARK-18589 at 12/19/16 5:29 PM: --- The

  1   2   >