[jira] [Created] (SPARK-26925) how to get the statistics when read from or writer to another database by datasourceV2

2019-02-18 Thread webber (JIRA)
webber created SPARK-26925: -- Summary: how to get the statistics when read from or writer to another database by datasourceV2 Key: SPARK-26925 URL: https://issues.apache.org/jira/browse/SPARK-26925

[jira] [Commented] (SPARK-26911) Spark do not see column in table

2019-02-18 Thread Vitaly Larchenkov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771633#comment-16771633 ] Vitaly Larchenkov commented on SPARK-26911: --- Yeah, will do that in few days. > Spark do not

[jira] [Resolved] (SPARK-26911) Spark do not see column in table

2019-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-26911. -- Resolution: Incomplete > Spark do not see column in table >

[jira] [Commented] (SPARK-26907) Does ShuffledRDD Replication Work With External Shuffle Service

2019-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771630#comment-16771630 ] Hyukjin Kwon commented on SPARK-26907: -- Questions should better go to mailing list than filing it

[jira] [Resolved] (SPARK-26907) Does ShuffledRDD Replication Work With External Shuffle Service

2019-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-26907. -- Resolution: Invalid > Does ShuffledRDD Replication Work With External Shuffle Service >

[jira] [Resolved] (SPARK-24744) Structured Streaming set SparkSession configuration with the value in the metadata if there is not a option set by user.

2019-02-18 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim resolved SPARK-24744. -- Resolution: Information Provided We provided the reason why such change is restricted in the

[jira] [Commented] (SPARK-26911) Spark do not see column in table

2019-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771628#comment-16771628 ] Hyukjin Kwon commented on SPARK-26911: -- Can you make the reproducer self-runnable and narrow down

[jira] [Updated] (SPARK-26922) Set socket timeout consistently in Arrow optimization

2019-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-26922: - Issue Type: Sub-task (was: Bug) Parent: SPARK-26759 > Set socket timeout consistently

[jira] [Resolved] (SPARK-26886) Proper termination of external processes launched by the worker

2019-02-18 Thread luzengxiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] luzengxiang resolved SPARK-26886. - Resolution: Won't Do > Proper termination of external processes launched by the worker >

[jira] [Commented] (SPARK-23682) Memory issue with Spark structured streaming

2019-02-18 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771613#comment-16771613 ] Jungtaek Lim commented on SPARK-23682: -- Ping [~bondyk] to see whether SPARK-24717 resolves this. If

[jira] [Updated] (SPARK-26854) Support ANY/SOME subquery

2019-02-18 Thread Mingcong Han (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mingcong Han updated SPARK-26854: - Summary: Support ANY/SOME subquery (was: Support ANY subquery) > Support ANY/SOME subquery >

[jira] [Resolved] (SPARK-26909) use unsafeRow.hashCode() as hash value in HashAggregate

2019-02-18 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-26909. - Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 23821

[jira] [Assigned] (SPARK-26909) use unsafeRow.hashCode() as hash value in HashAggregate

2019-02-18 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-26909: --- Assignee: yucai > use unsafeRow.hashCode() as hash value in HashAggregate >

[jira] [Updated] (SPARK-26740) Statistics for date and timestamp columns depend on system time zone

2019-02-18 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-26740: Fix Version/s: 2.4.1 > Statistics for date and timestamp columns depend on system time zone >

[jira] [Updated] (SPARK-26859) Fix field writer index bug in non-vectorized ORC deserializer

2019-02-18 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-26859: -- Summary: Fix field writer index bug in non-vectorized ORC deserializer (was: Reading ORC

[jira] [Created] (SPARK-26924) Document Arrow optimization and vectorized R APIs

2019-02-18 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-26924: Summary: Document Arrow optimization and vectorized R APIs Key: SPARK-26924 URL: https://issues.apache.org/jira/browse/SPARK-26924 Project: Spark Issue

[jira] [Created] (SPARK-26923) Refactor ArrowRRunner and RRunner to deduplicate codes

2019-02-18 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-26923: Summary: Refactor ArrowRRunner and RRunner to deduplicate codes Key: SPARK-26923 URL: https://issues.apache.org/jira/browse/SPARK-26923 Project: Spark Issue

[jira] [Assigned] (SPARK-24783) spark.sql.shuffle.partitions=0 should throw exception

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24783: Assignee: Apache Spark > spark.sql.shuffle.partitions=0 should throw exception >

[jira] [Assigned] (SPARK-24783) spark.sql.shuffle.partitions=0 should throw exception

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24783: Assignee: (was: Apache Spark) > spark.sql.shuffle.partitions=0 should throw

[jira] [Created] (SPARK-26922) Set socket timeout consistently in Arrow optimization

2019-02-18 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-26922: Summary: Set socket timeout consistently in Arrow optimization Key: SPARK-26922 URL: https://issues.apache.org/jira/browse/SPARK-26922 Project: Spark Issue

[jira] [Created] (SPARK-26921) Fix CRAN hack as soon as Arrow is available on CRAN

2019-02-18 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-26921: Summary: Fix CRAN hack as soon as Arrow is available on CRAN Key: SPARK-26921 URL: https://issues.apache.org/jira/browse/SPARK-26921 Project: Spark Issue

[jira] [Updated] (SPARK-26920) Deduplicate type checking across Arrow optimization and vectorized APIs in SparkR

2019-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-26920: - Issue Type: Sub-task (was: Bug) Parent: SPARK-26759 > Deduplicate type checking across

[jira] [Created] (SPARK-26920) Deduplicate type checking across Arrow optimization and vectorized APIs in SparkR

2019-02-18 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-26920: Summary: Deduplicate type checking across Arrow optimization and vectorized APIs in SparkR Key: SPARK-26920 URL: https://issues.apache.org/jira/browse/SPARK-26920

[jira] [Assigned] (SPARK-26919) change maven default compile java home

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26919: Assignee: Apache Spark > change maven default compile java home >

[jira] [Commented] (SPARK-26919) change maven default compile java home

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771515#comment-16771515 ] Apache Spark commented on SPARK-26919: -- User 'MrDLontheway' has created a pull request for this

[jira] [Created] (SPARK-26919) change maven default compile java home

2019-02-18 Thread daile (JIRA)
daile created SPARK-26919: - Summary: change maven default compile java home Key: SPARK-26919 URL: https://issues.apache.org/jira/browse/SPARK-26919 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-26919) change maven default compile java home

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26919: Assignee: (was: Apache Spark) > change maven default compile java home >

[jira] [Commented] (SPARK-26919) change maven default compile java home

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771514#comment-16771514 ] Apache Spark commented on SPARK-26919: -- User 'MrDLontheway' has created a pull request for this

[jira] [Updated] (SPARK-26919) change maven default compile java home

2019-02-18 Thread daile (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] daile updated SPARK-26919: -- Attachment: p1.png > change maven default compile java home > -- > >

[jira] [Comment Edited] (SPARK-26858) Vectorized gapplyCollect, Arrow optimization in native R function execution

2019-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771507#comment-16771507 ] Hyukjin Kwon edited comment on SPARK-26858 at 2/19/19 2:29 AM: --- {quote} If

[jira] [Commented] (SPARK-26858) Vectorized gapplyCollect, Arrow optimization in native R function execution

2019-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771507#comment-16771507 ] Hyukjin Kwon commented on SPARK-26858: -- {quote} If I understand, this is the case where Spark

[jira] [Created] (SPARK-26918) All .md should have ASF license header

2019-02-18 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-26918: Summary: All .md should have ASF license header Key: SPARK-26918 URL: https://issues.apache.org/jira/browse/SPARK-26918 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-26918) All .md should have ASF license header

2019-02-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-26918: - Description: per policy, all md files should have the header, like eg.

[jira] [Updated] (SPARK-26918) All .md should have ASF license header

2019-02-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-26918: - Description: per policy, all md files should have the header, like eg.

[jira] [Commented] (SPARK-26910) Re-release SparkR to CRAN

2019-02-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771496#comment-16771496 ] Felix Cheung commented on SPARK-26910: -- 2.3.3 has been submitted to CRAN. we are currently waiting

[jira] [Commented] (SPARK-26910) Re-release SparkR to CRAN

2019-02-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771498#comment-16771498 ] Felix Cheung commented on SPARK-26910: -- once that works we should look into 2.4.1 > Re-release

[jira] [Assigned] (SPARK-26910) Re-release SparkR to CRAN

2019-02-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung reassigned SPARK-26910: Assignee: Felix Cheung > Re-release SparkR to CRAN > - > >

[jira] [Commented] (SPARK-26858) Vectorized gapplyCollect, Arrow optimization in native R function execution

2019-02-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771494#comment-16771494 ] Felix Cheung commented on SPARK-26858: -- If I understand, this is the case where Spark actually

[jira] [Commented] (SPARK-26777) SQL worked in 2.3.2 and fails in 2.4.0

2019-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771486#comment-16771486 ] Hyukjin Kwon commented on SPARK-26777: -- Please open a separate issue if you're not very sure if

[jira] [Resolved] (SPARK-26785) data source v2 API refactor: streaming write

2019-02-18 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-26785. - Resolution: Fixed Fix Version/s: 3.0.0 > data source v2 API refactor: streaming write >

[jira] [Assigned] (SPARK-26917) CacheManager blocks while traversing plans

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26917: Assignee: (was: Apache Spark) > CacheManager blocks while traversing plans >

[jira] [Assigned] (SPARK-26917) CacheManager blocks while traversing plans

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26917: Assignee: Apache Spark > CacheManager blocks while traversing plans >

[jira] [Created] (SPARK-26917) CacheManager blocks while traversing plans

2019-02-18 Thread Dave DeCaprio (JIRA)
Dave DeCaprio created SPARK-26917: - Summary: CacheManager blocks while traversing plans Key: SPARK-26917 URL: https://issues.apache.org/jira/browse/SPARK-26917 Project: Spark Issue Type:

[jira] [Commented] (SPARK-26777) SQL worked in 2.3.2 and fails in 2.4.0

2019-02-18 Thread Ilya Peysakhov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771304#comment-16771304 ] Ilya Peysakhov commented on SPARK-26777: [~hyukjin.kwon] [~yuri.budilov]   [~kabhwan]  

[jira] [Assigned] (SPARK-26916) Upgrade to Kafka 2.1.1

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26916: Assignee: Apache Spark (was: Dongjoon Hyun) > Upgrade to Kafka 2.1.1 >

[jira] [Assigned] (SPARK-26916) Upgrade to Kafka 2.1.1

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26916: Assignee: Dongjoon Hyun (was: Apache Spark) > Upgrade to Kafka 2.1.1 >

[jira] [Assigned] (SPARK-26916) Upgrade to Kafka 2.1.1

2019-02-18 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-26916: - Assignee: Dongjoon Hyun > Upgrade to Kafka 2.1.1 > -- > >

[jira] [Commented] (SPARK-26916) Upgrade to Kafka 2.1.1

2019-02-18 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771216#comment-16771216 ] Dongjoon Hyun commented on SPARK-26916: --- I'll make a PR shortly. > Upgrade to Kafka 2.1.1 >

[jira] [Updated] (SPARK-26916) Upgrade to Kafka 2.1.1

2019-02-18 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-26916: -- Issue Type: Improvement (was: Bug) > Upgrade to Kafka 2.1.1 > -- > >

[jira] [Updated] (SPARK-26916) Upgrade to Kafka 2.1.1

2019-02-18 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-26916: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-24417 > Upgrade to Kafka 2.1.1

[jira] [Created] (SPARK-26916) Upgrade to Kafka 2.1.1

2019-02-18 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-26916: - Summary: Upgrade to Kafka 2.1.1 Key: SPARK-26916 URL: https://issues.apache.org/jira/browse/SPARK-26916 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-24783) spark.sql.shuffle.partitions=0 should throw exception

2019-02-18 Thread langjingxiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771141#comment-16771141 ] langjingxiang commented on SPARK-24783: --- yes you are right ,It seems that there is no use of 0 or

[jira] [Comment Edited] (SPARK-26881) Scaling issue with Gramian computation for RowMatrix: too many results sent to driver

2019-02-18 Thread Rafael RENAUDIN-AVINO (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771138#comment-16771138 ] Rafael RENAUDIN-AVINO edited comment on SPARK-26881 at 2/18/19 2:49 PM:

[jira] [Commented] (SPARK-26881) Scaling issue with Gramian computation for RowMatrix: too many results sent to driver

2019-02-18 Thread Rafael RENAUDIN-AVINO (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771138#comment-16771138 ] Rafael RENAUDIN-AVINO commented on SPARK-26881: --- Basically I see two improvements that

[jira] [Assigned] (SPARK-26915) File source should write without schema inference and validation in DataFrameWriter.save()

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26915: Assignee: (was: Apache Spark) > File source should write without schema inference

[jira] [Created] (SPARK-26915) File source should write without schema inference and validation in DataFrameWriter.save()

2019-02-18 Thread Gengliang Wang (JIRA)
Gengliang Wang created SPARK-26915: -- Summary: File source should write without schema inference and validation in DataFrameWriter.save() Key: SPARK-26915 URL: https://issues.apache.org/jira/browse/SPARK-26915

[jira] [Assigned] (SPARK-26915) File source should write without schema inference and validation in DataFrameWriter.save()

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26915: Assignee: Apache Spark > File source should write without schema inference and

[jira] [Commented] (SPARK-26881) Scaling issue with Gramian computation for RowMatrix: too many results sent to driver

2019-02-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771094#comment-16771094 ] Sean Owen commented on SPARK-26881: --- It's related but not quite the same issue. (You should try 2.3.3

[jira] [Commented] (SPARK-24783) spark.sql.shuffle.partitions=0 should throw exception

2019-02-18 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771085#comment-16771085 ] Jungtaek Lim commented on SPARK-24783: -- Does we have the chance to set the value to 0 or negative

[jira] [Commented] (SPARK-24783) spark.sql.shuffle.partitions=0 should throw exception

2019-02-18 Thread langjingxiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771070#comment-16771070 ] langjingxiang commented on SPARK-24783: --- ok  Let me add a judgement. >

[jira] [Resolved] (SPARK-26880) dataDF.queryExecution.toRdd corrupt rows

2019-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-26880. -- Resolution: Invalid > dataDF.queryExecution.toRdd corrupt rows >

[jira] [Commented] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2019-02-18 Thread Moein Hosseini (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771035#comment-16771035 ] Moein Hosseini commented on SPARK-15544: [~srowen] I've started to work on it. Seems it comes

[jira] [Assigned] (SPARK-26914) ThriftServer scheduler pool may be unpredictably when using fair schedule mode

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26914: Assignee: (was: Apache Spark) > ThriftServer scheduler pool may be unpredictably

[jira] [Assigned] (SPARK-26914) ThriftServer scheduler pool may be unpredictably when using fair schedule mode

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26914: Assignee: Apache Spark > ThriftServer scheduler pool may be unpredictably when using

[jira] [Updated] (SPARK-26914) ThriftServer scheduler pool may be unpredictably when using fair schedule mode

2019-02-18 Thread zhoukang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhoukang updated SPARK-26914: - Attachment: 26914-03.png 26914-02.png 26914-01.png > ThriftServer

[jira] [Updated] (SPARK-26914) ThriftServer scheduler pool may be unpredictably when using fair schedule mode

2019-02-18 Thread zhoukang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhoukang updated SPARK-26914: - Description: When using fair scheduler mode for thrift server, we may have unpredictable result.

[jira] [Created] (SPARK-26914) ThriftServer scheduler pool may be unpredictably when using fair schedule mode

2019-02-18 Thread zhoukang (JIRA)
zhoukang created SPARK-26914: Summary: ThriftServer scheduler pool may be unpredictably when using fair schedule mode Key: SPARK-26914 URL: https://issues.apache.org/jira/browse/SPARK-26914 Project:

[jira] [Commented] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2019-02-18 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16771008#comment-16771008 ] Jungtaek Lim commented on SPARK-15544: -- I'm interested in this issue, but I guess the thing is not

[jira] [Assigned] (SPARK-26912) Allow setting permission for event_log

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26912: Assignee: Apache Spark > Allow setting permission for event_log >

[jira] [Assigned] (SPARK-26912) Allow setting permission for event_log

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26912: Assignee: (was: Apache Spark) > Allow setting permission for event_log >

[jira] [Assigned] (SPARK-26913) New data source V2 API: SupportsDirectWrite

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26913: Assignee: Apache Spark > New data source V2 API: SupportsDirectWrite >

[jira] [Assigned] (SPARK-26913) New data source V2 API: SupportsDirectWrite

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26913: Assignee: (was: Apache Spark) > New data source V2 API: SupportsDirectWrite >

[jira] [Created] (SPARK-26913) New data source V2 API: SupportsDirectWrite

2019-02-18 Thread Gengliang Wang (JIRA)
Gengliang Wang created SPARK-26913: -- Summary: New data source V2 API: SupportsDirectWrite Key: SPARK-26913 URL: https://issues.apache.org/jira/browse/SPARK-26913 Project: Spark Issue Type:

[jira] [Commented] (SPARK-26911) Spark do not see column in table

2019-02-18 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16770964#comment-16770964 ] Marco Gaido commented on SPARK-26911: - May you please check that current master is still affected?

[jira] [Updated] (SPARK-26912) Allow setting permission for event_log

2019-02-18 Thread Jackey Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jackey Lee updated SPARK-26912: --- Summary: Allow setting permission for event_log (was: Allow set permission for event_log) > Allow

[jira] [Created] (SPARK-26912) Allow set permission for event_log

2019-02-18 Thread Jackey Lee (JIRA)
Jackey Lee created SPARK-26912: -- Summary: Allow set permission for event_log Key: SPARK-26912 URL: https://issues.apache.org/jira/browse/SPARK-26912 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-26911) Spark

2019-02-18 Thread Vitaly Larchenkov (JIRA)
Vitaly Larchenkov created SPARK-26911: - Summary: Spark Key: SPARK-26911 URL: https://issues.apache.org/jira/browse/SPARK-26911 Project: Spark Issue Type: Bug Components: Spark

[jira] [Updated] (SPARK-26911) Spark do not see column in table

2019-02-18 Thread Vitaly Larchenkov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vitaly Larchenkov updated SPARK-26911: -- Description:     Spark cannot find column that actually exists in array {code:java}

[jira] [Updated] (SPARK-26911) Spark do not see column in table

2019-02-18 Thread Vitaly Larchenkov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vitaly Larchenkov updated SPARK-26911: -- Summary: Spark do not see column in table (was: Spark ) > Spark do not see column in

[jira] [Resolved] (SPARK-26353) Add typed aggregate functions(max/min) to the example module

2019-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-26353. -- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 23304

[jira] [Resolved] (SPARK-26889) Fix timestamp type in Structured Streaming + Kafka Integration Guide

2019-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-26889. -- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 23796

[jira] [Assigned] (SPARK-26889) Fix timestamp type in Structured Streaming + Kafka Integration Guide

2019-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-26889: Assignee: Gabor Somogyi > Fix timestamp type in Structured Streaming + Kafka Integration

[jira] [Assigned] (SPARK-26353) Add typed aggregate functions(max/min) to the example module

2019-02-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-26353: Assignee: liuxian > Add typed aggregate functions(max/min) to the example module >

[jira] [Created] (SPARK-26910) Re-release SparkR to CRAN

2019-02-18 Thread Michael Chirico (JIRA)
Michael Chirico created SPARK-26910: --- Summary: Re-release SparkR to CRAN Key: SPARK-26910 URL: https://issues.apache.org/jira/browse/SPARK-26910 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-26881) Scaling issue with Gramian computation for RowMatrix: too many results sent to driver

2019-02-18 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16770881#comment-16770881 ] Marco Gaido commented on SPARK-26881: - This may have been fixed/improved by SPARK-26228, could you

[jira] [Commented] (SPARK-26880) dataDF.queryExecution.toRdd corrupt rows

2019-02-18 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16770866#comment-16770866 ] Jungtaek Lim commented on SPARK-26880: -- Just submitted a PR to add note on proper usage of

[jira] [Comment Edited] (SPARK-26880) dataDF.queryExecution.toRdd corrupt rows

2019-02-18 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16770866#comment-16770866 ] Jungtaek Lim edited comment on SPARK-26880 at 2/18/19 8:40 AM: --- Just

[jira] [Commented] (SPARK-22860) Spark workers log ssl passwords passed to the executors

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16770843#comment-16770843 ] Apache Spark commented on SPARK-22860: -- User 'HeartSaVioR' has created a pull request for this

[jira] [Created] (SPARK-26909) use unsafeRow.hashCode() as hash value in HashAggregate

2019-02-18 Thread yucai (JIRA)
yucai created SPARK-26909: - Summary: use unsafeRow.hashCode() as hash value in HashAggregate Key: SPARK-26909 URL: https://issues.apache.org/jira/browse/SPARK-26909 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-26909) use unsafeRow.hashCode() as hash value in HashAggregate

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26909: Assignee: Apache Spark > use unsafeRow.hashCode() as hash value in HashAggregate >

[jira] [Assigned] (SPARK-26909) use unsafeRow.hashCode() as hash value in HashAggregate

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-26909: Assignee: (was: Apache Spark) > use unsafeRow.hashCode() as hash value in

[jira] [Commented] (SPARK-22860) Spark workers log ssl passwords passed to the executors

2019-02-18 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16770844#comment-16770844 ] Jungtaek Lim commented on SPARK-22860: -- I'm proposing to just redact these values from log message

[jira] [Updated] (SPARK-26909) use unsafeRow.hashCode() as hash value in HashAggregate

2019-02-18 Thread yucai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yucai updated SPARK-26909: -- Description: This is a followup PR for #21149. New way uses unsafeRow.hashCode() as hash value in

[jira] [Commented] (SPARK-22860) Spark workers log ssl passwords passed to the executors

2019-02-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16770842#comment-16770842 ] Apache Spark commented on SPARK-22860: -- User 'HeartSaVioR' has created a pull request for this