[jira] [Commented] (SPARK-22450) Safely register class for mllib

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679468#comment-16679468 ] Apache Spark commented on SPARK-22450: -- User 'zhengruifeng' has created a pull request for this

[jira] [Assigned] (SPARK-25974) Optimizes Generates bytecode for ordering based on the given order

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25974: Assignee: Apache Spark > Optimizes Generates bytecode for ordering based on the given

[jira] [Resolved] (SPARK-25967) sql.functions.trim() should remove trailing and leading tabs

2018-11-08 Thread Victor Sahin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Victor Sahin resolved SPARK-25967. -- Resolution: Feedback Received > sql.functions.trim() should remove trailing and leading tabs

[jira] [Updated] (SPARK-25961) Random numbers are not supported when handling data skew

2018-11-08 Thread zengxl (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zengxl updated SPARK-25961: --- Summary: Random numbers are not supported when handling data skew (was: 处理数据倾斜时使用随机数不支持) > Random numbers

[jira] [Created] (SPARK-25968) Non-codegen Floor and Ceil fail for FloatType

2018-11-08 Thread Juliusz Sompolski (JIRA)
Juliusz Sompolski created SPARK-25968: - Summary: Non-codegen Floor and Ceil fail for FloatType Key: SPARK-25968 URL: https://issues.apache.org/jira/browse/SPARK-25968 Project: Spark

[jira] [Commented] (SPARK-25961) Random numbers are not supported when handling data skew

2018-11-08 Thread Kris Mok (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16680152#comment-16680152 ] Kris Mok commented on SPARK-25961: -- It looks like the current restriction makes sense, because the

[jira] [Assigned] (SPARK-25977) Parsing decimals from CSV using locale

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25977: Assignee: (was: Apache Spark) > Parsing decimals from CSV using locale >

[jira] [Assigned] (SPARK-25962) Specify minimum versions for both pydocstyle and flake8 in 'lint-python' script

2018-11-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-25962: Assignee: Hyukjin Kwon > Specify minimum versions for both pydocstyle and flake8 in

[jira] [Updated] (SPARK-23831) Add org.apache.derby to IsolatedClientLoader

2018-11-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-23831: - Fix Version/s: (was: 2.4.0) > Add org.apache.derby to IsolatedClientLoader >

[jira] [Commented] (SPARK-25967) sql.functions.trim() should remove trailing and leading tabs

2018-11-08 Thread Victor Sahin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679083#comment-16679083 ] Victor Sahin commented on SPARK-25967: -- I see. In that case I can close the issue. >

[jira] [Commented] (SPARK-25974) Optimizes Generates bytecode for ordering based on the given order

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679580#comment-16679580 ] Apache Spark commented on SPARK-25974: -- User 'heary-cao' has created a pull request for this issue:

[jira] [Created] (SPARK-25974) Optimizes Generates bytecode for ordering based on the given order

2018-11-08 Thread caoxuewen (JIRA)
caoxuewen created SPARK-25974: - Summary: Optimizes Generates bytecode for ordering based on the given order Key: SPARK-25974 URL: https://issues.apache.org/jira/browse/SPARK-25974 Project: Spark

[jira] [Commented] (SPARK-22450) Safely register class for mllib

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679467#comment-16679467 ] Apache Spark commented on SPARK-22450: -- User 'zhengruifeng' has created a pull request for this

[jira] [Updated] (SPARK-25976) Allow rdd.reduce on empty rdd by returning an Option[T]

2018-11-08 Thread Yuval Yaari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuval Yaari updated SPARK-25976: Description: it is sometimes useful to let the user decide what value to return when reducing on

[jira] [Assigned] (SPARK-25972) Missed JSON options in streaming.py

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25972: Assignee: Apache Spark > Missed JSON options in streaming.py >

[jira] [Created] (SPARK-25982) Dataframe write is non blocking in fair scheduling mode

2018-11-08 Thread Ramandeep Singh (JIRA)
Ramandeep Singh created SPARK-25982: --- Summary: Dataframe write is non blocking in fair scheduling mode Key: SPARK-25982 URL: https://issues.apache.org/jira/browse/SPARK-25982 Project: Spark

[jira] [Commented] (SPARK-25960) Support subpath mounting with Kubernetes

2018-11-08 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679242#comment-16679242 ] Dongjoon Hyun commented on SPARK-25960: --- Hi, [~tnachen]. I updated the field because the next

[jira] [Assigned] (SPARK-25971) Ignore partition byte-size statistics in SQLQueryTestSuite

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25971: Assignee: Apache Spark > Ignore partition byte-size statistics in SQLQueryTestSuite >

[jira] [Commented] (SPARK-25966) "EOF Reached the end of stream with bytes left to read" while reading/writing to Parquets

2018-11-08 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679611#comment-16679611 ] Steve Loughran commented on SPARK-25966: bq. It looks to me like a problem in closing the file

[jira] [Commented] (SPARK-22827) Avoid throwing OutOfMemoryError in case of exception in spill

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679307#comment-16679307 ] Apache Spark commented on SPARK-22827: -- User 'ueshin' has created a pull request for this issue:

[jira] [Created] (SPARK-25983) spark-sql-kafka-0-10 no longer works with Kafka 0.10.0

2018-11-08 Thread Alexander Bessonov (JIRA)
Alexander Bessonov created SPARK-25983: -- Summary: spark-sql-kafka-0-10 no longer works with Kafka 0.10.0 Key: SPARK-25983 URL: https://issues.apache.org/jira/browse/SPARK-25983 Project: Spark

[jira] [Assigned] (SPARK-25965) Add read benchmark for Avro

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25965: Assignee: Apache Spark > Add read benchmark for Avro > --- > >

[jira] [Commented] (SPARK-24834) Utils#nanSafeCompare{Double,Float} functions do not differ from normal java double/float comparison

2018-11-08 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16680170#comment-16680170 ] Matt Cheah commented on SPARK-24834: [~srowen] - I know this is an old ticket but I wanted to

[jira] [Resolved] (SPARK-25980) dev list mail server is down

2018-11-08 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-25980. - Resolution: Invalid > dev list mail server is down > > >

[jira] [Updated] (SPARK-25973) Spark History Main page performance improvement

2018-11-08 Thread William Montaz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] William Montaz updated SPARK-25973: --- Priority: Minor (was: Major) > Spark History Main page performance improvement >

[jira] [Commented] (SPARK-25972) Missed JSON options in streaming.py

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679434#comment-16679434 ] Apache Spark commented on SPARK-25972: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Issue Comment Deleted] (SPARK-25971) Ignore partition byte-size statistics in SQLQueryTestSuite

2018-11-08 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25971: -- Comment: was deleted (was: User 'dongjoon-hyun' has created a pull request for this issue:

[jira] [Commented] (SPARK-25958) error: [Errno 97] Address family not supported by protocol in dataframe.take()

2018-11-08 Thread Ruslan Dautkhanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679261#comment-16679261 ] Ruslan Dautkhanov commented on SPARK-25958: --- [~XuanYuan] interesting.. here's our /etc/hosts:

[jira] [Commented] (SPARK-25973) Spark History Main page performance improvement

2018-11-08 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679639#comment-16679639 ] Yuming Wang commented on SPARK-25973: - Please create a pull request: 

[jira] [Resolved] (SPARK-25908) Remove old deprecated items in Spark 3

2018-11-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-25908. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 22921

[jira] [Resolved] (SPARK-25968) Non-codegen Floor and Ceil fail for FloatType

2018-11-08 Thread Juliusz Sompolski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Juliusz Sompolski resolved SPARK-25968. --- Resolution: Won't Fix Ok, I see it's not supposed to handle it, but type gets

[jira] [Commented] (SPARK-25980) dev list mail server is down

2018-11-08 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679997#comment-16679997 ] Wenchen Fan commented on SPARK-25980: - sorry I opened the ticket at a wrong place. Will open a new

[jira] [Comment Edited] (SPARK-24421) sun.misc.Unsafe in JDK11

2018-11-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16680169#comment-16680169 ] Sean Owen edited comment on SPARK-24421 at 11/8/18 7:56 PM: I've found that,

[jira] [Commented] (SPARK-25975) Spark History does not display necessarily the incomplete applications when requested

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679782#comment-16679782 ] Apache Spark commented on SPARK-25975: -- User 'Willymontaz' has created a pull request for this

[jira] [Assigned] (SPARK-25897) Cannot run k8s integration tests in sbt

2018-11-08 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-25897: -- Assignee: Marcelo Vanzin > Cannot run k8s integration tests in sbt >

[jira] [Assigned] (SPARK-25975) Spark History does not display necessarily the incomplete applications when requested

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25975: Assignee: Apache Spark > Spark History does not display necessarily the incomplete

[jira] [Assigned] (SPARK-25955) Porting JSON test for CSV functions

2018-11-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-25955: Assignee: Maxim Gekk > Porting JSON test for CSV functions >

[jira] [Created] (SPARK-25971) Ignore partition byte-size statistics in SQLQueryTestSuite

2018-11-08 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-25971: - Summary: Ignore partition byte-size statistics in SQLQueryTestSuite Key: SPARK-25971 URL: https://issues.apache.org/jira/browse/SPARK-25971 Project: Spark

[jira] [Commented] (SPARK-25959) Difference in featureImportances results on computed vs saved models

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679913#comment-16679913 ] Apache Spark commented on SPARK-25959: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Created] (SPARK-25975) Spark History does not display necessarily the incomplete applications when requested

2018-11-08 Thread William Montaz (JIRA)
William Montaz created SPARK-25975: -- Summary: Spark History does not display necessarily the incomplete applications when requested Key: SPARK-25975 URL: https://issues.apache.org/jira/browse/SPARK-25975

[jira] [Commented] (SPARK-24437) Memory leak in UnsafeHashedRelation

2018-11-08 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679745#comment-16679745 ] Marco Gaido commented on SPARK-24437: - [~dvogelbacher] the point is: a broadcast is never

[jira] [Assigned] (SPARK-25965) Add read benchmark for Avro

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25965: Assignee: (was: Apache Spark) > Add read benchmark for Avro >

[jira] [Commented] (SPARK-25961) 处理数据倾斜时使用随机数不支持

2018-11-08 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679237#comment-16679237 ] Dongjoon Hyun commented on SPARK-25961: --- [~zengxl]. Please use English in Apache Spark JIRA. >

[jira] [Updated] (SPARK-25973) Spark History Main page performance improvement

2018-11-08 Thread William Montaz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] William Montaz updated SPARK-25973: --- Description: HistoryPage.scala counts applications (with a predicate depending on if it is

[jira] [Updated] (SPARK-25332) Instead of broadcast hash join ,Sort merge join has selected when restart spark-shell/spark-JDBC for hive provider

2018-11-08 Thread Babulal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Babulal updated SPARK-25332: Priority: Critical (was: Major) > Instead of broadcast hash join ,Sort merge join has selected when

[jira] [Resolved] (SPARK-24529) Add spotbugs into maven build process

2018-11-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-24529. --- Resolution: Won't Fix Looks like this slows down the build too much > Add spotbugs into maven

[jira] [Reopened] (SPARK-23831) Add org.apache.derby to IsolatedClientLoader

2018-11-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reopened SPARK-23831: -- Assignee: (was: Yuming Wang) > Add org.apache.derby to IsolatedClientLoader >

[jira] [Resolved] (SPARK-25952) from_json returns wrong result if corrupt record column is in the middle of schema

2018-11-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25952. -- Resolution: Fixed Assignee: Maxim Gekk Fix Version/s: 3.0.0 Fixed in

[jira] [Commented] (SPARK-24437) Memory leak in UnsafeHashedRelation

2018-11-08 Thread Eyal Farago (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679560#comment-16679560 ] Eyal Farago commented on SPARK-24437: - [~dvogelbacher], what about the _checkpoint_ approach?

[jira] [Updated] (SPARK-25973) Spark History Main page performance improvment

2018-11-08 Thread William Montaz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] William Montaz updated SPARK-25973: --- Attachment: fix.patch > Spark History Main page performance improvment >

[jira] [Commented] (SPARK-25332) Instead of broadcast hash join ,Sort merge join has selected when restart spark-shell/spark-JDBC for hive provider

2018-11-08 Thread Babulal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679574#comment-16679574 ] Babulal commented on SPARK-25332: - Since issue impacting performance degradation so marking as

[jira] [Commented] (SPARK-25977) Parsing decimals from CSV using locale

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679710#comment-16679710 ] Apache Spark commented on SPARK-25977: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-24421) sun.misc.Unsafe in JDK11

2018-11-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16680169#comment-16680169 ] Sean Owen edited comment on SPARK-24421 at 11/8/18 6:44 PM: I've found that,

[jira] [Commented] (SPARK-22827) Avoid throwing OutOfMemoryError in case of exception in spill

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679306#comment-16679306 ] Apache Spark commented on SPARK-22827: -- User 'ueshin' has created a pull request for this issue:

[jira] [Commented] (SPARK-25971) Ignore partition byte-size statistics in SQLQueryTestSuite

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679397#comment-16679397 ] Apache Spark commented on SPARK-25971: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Created] (SPARK-25976) Allow rdd.reduce on empty rdd by returning an Option[T]

2018-11-08 Thread Yuval Yaari (JIRA)
Yuval Yaari created SPARK-25976: --- Summary: Allow rdd.reduce on empty rdd by returning an Option[T] Key: SPARK-25976 URL: https://issues.apache.org/jira/browse/SPARK-25976 Project: Spark Issue

[jira] [Assigned] (SPARK-25959) Difference in featureImportances results on computed vs saved models

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25959: Assignee: Apache Spark > Difference in featureImportances results on computed vs saved

[jira] [Commented] (SPARK-25959) Difference in featureImportances results on computed vs saved models

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679911#comment-16679911 ] Apache Spark commented on SPARK-25959: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Resolved] (SPARK-25969) pyspark deal with large data memory issues

2018-11-08 Thread zhao yufei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao yufei resolved SPARK-25969. Resolution: Resolved > pyspark deal with large data memory issues >

[jira] [Created] (SPARK-25973) Spark History Main page performance improvment

2018-11-08 Thread William Montaz (JIRA)
William Montaz created SPARK-25973: -- Summary: Spark History Main page performance improvment Key: SPARK-25973 URL: https://issues.apache.org/jira/browse/SPARK-25973 Project: Spark Issue

[jira] [Commented] (SPARK-25676) Refactor BenchmarkWideTable to use main method

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679646#comment-16679646 ] Apache Spark commented on SPARK-25676: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Commented] (SPARK-25958) error: [Errno 97] Address family not supported by protocol in dataframe.take()

2018-11-08 Thread Yuanjian Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679169#comment-16679169 ] Yuanjian Li commented on SPARK-25958: - We also meet this problem in internal folk and fixed by

[jira] [Created] (SPARK-25969) pyspark deal with large data memory issues

2018-11-08 Thread zhao yufei (JIRA)
zhao yufei created SPARK-25969: -- Summary: pyspark deal with large data memory issues Key: SPARK-25969 URL: https://issues.apache.org/jira/browse/SPARK-25969 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-24421) sun.misc.Unsafe in JDK11

2018-11-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-24421: Summary: sun.misc.Unsafe in JDK11 (was: sun.misc.Unsafe in JDK9+) > sun.misc.Unsafe in JDK11 >

[jira] [Commented] (SPARK-24421) sun.misc.Unsafe in JDK11

2018-11-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16680169#comment-16680169 ] Sean Owen commented on SPARK-24421: --- I've found that, actually, we can't even access clean() with

[jira] [Commented] (SPARK-24421) sun.misc.Unsafe in JDK11

2018-11-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16680117#comment-16680117 ] DB Tsai commented on SPARK-24421: - [~srowen] Great news! Thanks for looking at this. Out of my

[jira] [Created] (SPARK-25978) Pyspark can only be used in spark-submit in spark-py docker image for kubernetes

2018-11-08 Thread Maxime Nannan (JIRA)
Maxime Nannan created SPARK-25978: - Summary: Pyspark can only be used in spark-submit in spark-py docker image for kubernetes Key: SPARK-25978 URL: https://issues.apache.org/jira/browse/SPARK-25978

[jira] [Commented] (SPARK-9686) Spark Thrift server doesn't return correct JDBC metadata

2018-11-08 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679796#comment-16679796 ] Yuming Wang commented on SPARK-9686: This is my fix: Implement Spark's own GetSchemasOperation:

[jira] [Commented] (SPARK-25965) Add read benchmark for Avro

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679333#comment-16679333 ] Apache Spark commented on SPARK-25965: -- User 'gengliangwang' has created a pull request for this

[jira] [Created] (SPARK-25977) Parsing decimals from CSV using locale

2018-11-08 Thread Maxim Gekk (JIRA)
Maxim Gekk created SPARK-25977: -- Summary: Parsing decimals from CSV using locale Key: SPARK-25977 URL: https://issues.apache.org/jira/browse/SPARK-25977 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-24421) sun.misc.Unsafe in JDK11

2018-11-08 Thread Alan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16680355#comment-16680355 ] Alan commented on SPARK-24421: -- The comment that sun.misc.Unsafe is private and not accessible in JDK 9 or

[jira] [Created] (SPARK-25980) dev list mail server is down

2018-11-08 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-25980: --- Summary: dev list mail server is down Key: SPARK-25980 URL: https://issues.apache.org/jira/browse/SPARK-25980 Project: Spark Issue Type: IT Help

[jira] [Assigned] (SPARK-25974) Optimizes Generates bytecode for ordering based on the given order

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25974: Assignee: (was: Apache Spark) > Optimizes Generates bytecode for ordering based on

[jira] [Updated] (SPARK-25973) Spark History Main page performance improvement

2018-11-08 Thread William Montaz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] William Montaz updated SPARK-25973: --- Summary: Spark History Main page performance improvement (was: Spark History Main page

[jira] [Commented] (SPARK-25958) error: [Errno 97] Address family not supported by protocol in dataframe.take()

2018-11-08 Thread Ruslan Dautkhanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679263#comment-16679263 ] Ruslan Dautkhanov commented on SPARK-25958: --- I just removed ipv6 reference ::1 in /etc/hosts

[jira] [Assigned] (SPARK-25971) Ignore partition byte-size statistics in SQLQueryTestSuite

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25971: Assignee: (was: Apache Spark) > Ignore partition byte-size statistics in

[jira] [Resolved] (SPARK-25964) Revise OrcReadBenchmark/DataSourceReadBenchmark case names and execution instructions

2018-11-08 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-25964. --- Resolution: Fixed Assignee: Gengliang Wang Fix Version/s: 3.0.0 This is

[jira] [Commented] (SPARK-20156) Java String toLowerCase "Turkish locale bug" causes Spark problems

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679542#comment-16679542 ] Apache Spark commented on SPARK-20156: -- User 'zhengruifeng' has created a pull request for this

[jira] [Assigned] (SPARK-25972) Missed JSON options in streaming.py

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25972: Assignee: (was: Apache Spark) > Missed JSON options in streaming.py >

[jira] [Commented] (SPARK-25973) Spark History Main page performance improvement

2018-11-08 Thread William Montaz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679772#comment-16679772 ] William Montaz commented on SPARK-25973: Ok created https://github.com/apache/spark/pull/22980

[jira] [Commented] (SPARK-16759) Spark expose an API to pass in Caller Context into it

2018-11-08 Thread Aihua Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679135#comment-16679135 ] Aihua Xu commented on SPARK-16759: -- Seems we should implement this callerContext in SparkContext rather

[jira] [Commented] (SPARK-25975) Spark History does not display necessarily the incomplete applications when requested

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679781#comment-16679781 ] Apache Spark commented on SPARK-25975: -- User 'Willymontaz' has created a pull request for this

[jira] [Updated] (SPARK-24421) Accessing sun.misc.Cleaner in JDK11

2018-11-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-24421: -- Affects Version/s: (was: 2.3.0) 3.0.0 > Accessing sun.misc.Cleaner in

[jira] [Created] (SPARK-25984) Remove deprecated .newInstance(), primitive box class constructor calls

2018-11-08 Thread Sean Owen (JIRA)
Sean Owen created SPARK-25984: - Summary: Remove deprecated .newInstance(), primitive box class constructor calls Key: SPARK-25984 URL: https://issues.apache.org/jira/browse/SPARK-25984 Project: Spark

[jira] [Commented] (SPARK-25971) Ignore partition byte-size statistics in SQLQueryTestSuite

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679400#comment-16679400 ] Apache Spark commented on SPARK-25971: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Commented] (SPARK-25979) Window function: allow parentheses around window reference

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679977#comment-16679977 ] Apache Spark commented on SPARK-25979: -- User 'gengliangwang' has created a pull request for this

[jira] [Commented] (SPARK-24421) sun.misc.Unsafe in JDK11

2018-11-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16680439#comment-16680439 ] Sean Owen commented on SPARK-24421: --- Hey [~Bateman], thank you, yeah your comments and my edits just

[jira] [Assigned] (SPARK-25979) Window function: allow parentheses around window reference

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25979: Assignee: (was: Apache Spark) > Window function: allow parentheses around window

[jira] [Resolved] (SPARK-25971) Ignore partition byte-size statistics in SQLQueryTestSuite

2018-11-08 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-25971. --- Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 3.0.0 This is

[jira] [Updated] (SPARK-24421) Accessing sun.misc.Cleaner in JDK11

2018-11-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-24421: -- Summary: Accessing sun.misc.Cleaner in JDK11 (was: sun.misc.Unsafe in JDK11) > Accessing

[jira] [Commented] (SPARK-25904) Avoid allocating arrays too large for JVMs

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679861#comment-16679861 ] Apache Spark commented on SPARK-25904: -- User 'squito' has created a pull request for this issue:

[jira] [Assigned] (SPARK-25970) Add Instrumentation to PrefixSpan

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25970: Assignee: (was: Apache Spark) > Add Instrumentation to PrefixSpan >

[jira] [Commented] (SPARK-25973) Spark History Main page performance improvement

2018-11-08 Thread William Montaz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679853#comment-16679853 ] William Montaz commented on SPARK-25973: New pull request on master branch 

[jira] [Commented] (SPARK-25965) Add read benchmark for Avro

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16679331#comment-16679331 ] Apache Spark commented on SPARK-25965: -- User 'gengliangwang' has created a pull request for this

[jira] [Updated] (SPARK-25904) Avoid allocating arrays too large for JVMs

2018-11-08 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-25904: - Fix Version/s: 2.4.1 > Avoid allocating arrays too large for JVMs >

[jira] [Assigned] (SPARK-25984) Remove deprecated .newInstance(), primitive box class constructor calls

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25984: Assignee: Sean Owen (was: Apache Spark) > Remove deprecated .newInstance(), primitive

[jira] [Commented] (SPARK-25984) Remove deprecated .newInstance(), primitive box class constructor calls

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16680572#comment-16680572 ] Apache Spark commented on SPARK-25984: -- User 'srowen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-25984) Remove deprecated .newInstance(), primitive box class constructor calls

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25984: Assignee: Apache Spark (was: Sean Owen) > Remove deprecated .newInstance(), primitive

[jira] [Updated] (SPARK-25984) Remove deprecated .newInstance(), primitive wrapper class constructor calls

2018-11-08 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-25984: -- Description: While working on JDK 11 support, I noticed a lot of new deprecation warnings. 80% of

[jira] [Assigned] (SPARK-25975) Spark History does not display necessarily the incomplete applications when requested

2018-11-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-25975: Assignee: (was: Apache Spark) > Spark History does not display necessarily the

  1   2   >