[jira] [Commented] (SPARK-21720) Filter predicate with many conditions throw stackoverflow error

2017-08-13 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16125253#comment-16125253 ] Kazuaki Ishizaki commented on SPARK-21720: -- I confirmed that this occurs in the master branch. I

[jira] [Commented] (SPARK-21721) Memory leak in org.apache.spark.sql.hive.execution.InsertIntoHiveTable

2017-08-13 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16125217#comment-16125217 ] Liang-Chi Hsieh commented on SPARK-21721: - Submitted a PR at

[jira] [Created] (SPARK-21722) Enable timezone-aware timestamp type when creating Pandas DataFrame.

2017-08-13 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-21722: - Summary: Enable timezone-aware timestamp type when creating Pandas DataFrame. Key: SPARK-21722 URL: https://issues.apache.org/jira/browse/SPARK-21722 Project:

[jira] [Updated] (SPARK-21721) Memory leak in org.apache.spark.sql.hive.execution.InsertIntoHiveTable

2017-08-13 Thread yzheng616 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yzheng616 updated SPARK-21721: -- Description: The leak came from org.apache.spark.sql.hive.execution.InsertIntoHiveTable. At line 118,

[jira] [Updated] (SPARK-21721) Memory leak in org.apache.spark.sql.hive.execution.InsertIntoHiveTable

2017-08-13 Thread yzheng616 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yzheng616 updated SPARK-21721: -- Description: The leak came from org.apache.spark.sql.hive.execution.InsertIntoHiveTable. At line 118,

[jira] [Updated] (SPARK-21721) Memory leak in org.apache.spark.sql.hive.execution.InsertIntoHiveTable

2017-08-13 Thread yzheng616 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yzheng616 updated SPARK-21721: -- Description: The leak came from org.apache.spark.sql.hive.execution.InsertIntoHiveTable. At line 118,

[jira] [Updated] (SPARK-21721) Memory leak in org.apache.spark.sql.hive.execution.InsertIntoHiveTable

2017-08-13 Thread yzheng616 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yzheng616 updated SPARK-21721: -- Description: The leak came from org.apache.spark.sql.hive.execution.InsertIntoHiveTable. At line 118,

[jira] [Updated] (SPARK-21721) Memory leak in org.apache.spark.sql.hive.execution.InsertIntoHiveTable

2017-08-13 Thread yzheng616 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yzheng616 updated SPARK-21721: -- Description: The leak came from org.apache.spark.sql.hive.execution.InsertIntoHiveTable. At line 118,

[jira] [Updated] (SPARK-21721) Memory leak in org.apache.spark.sql.hive.execution.InsertIntoHiveTable

2017-08-13 Thread yzheng616 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yzheng616 updated SPARK-21721: -- Description: The leak came from org.apache.spark.sql.hive.execution.InsertIntoHiveTable. At line 118,

[jira] [Updated] (SPARK-21721) Memory leak in org.apache.spark.sql.hive.execution.InsertIntoHiveTable

2017-08-13 Thread yzheng616 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yzheng616 updated SPARK-21721: -- Description: The leak came from org.apache.spark.sql.hive.execution.InsertIntoHiveTable. At line 118,

[jira] [Updated] (SPARK-21721) Memory leak in org.apache.spark.sql.hive.execution.InsertIntoHiveTable

2017-08-13 Thread yzheng616 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yzheng616 updated SPARK-21721: -- Description: The leak came from org.apache.spark.sql.hive.execution.InsertIntoHiveTable. At line 118,

[jira] [Created] (SPARK-21721) Memory leak in org.apache.spark.sql.hive.execution.InsertIntoHiveTable

2017-08-13 Thread yzheng616 (JIRA)
yzheng616 created SPARK-21721: - Summary: Memory leak in org.apache.spark.sql.hive.execution.InsertIntoHiveTable Key: SPARK-21721 URL: https://issues.apache.org/jira/browse/SPARK-21721 Project: Spark

[jira] [Comment Edited] (SPARK-18085) SPIP: Better History Server scalability for many / large applications

2017-08-13 Thread duyanghao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16125150#comment-16125150 ] duyanghao edited comment on SPARK-18085 at 8/14/17 2:32 AM: [~vanzin] so

[jira] [Comment Edited] (SPARK-18085) SPIP: Better History Server scalability for many / large applications

2017-08-13 Thread duyanghao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16125150#comment-16125150 ] duyanghao edited comment on SPARK-18085 at 8/14/17 2:31 AM: [~vanzin] so

[jira] [Comment Edited] (SPARK-18085) SPIP: Better History Server scalability for many / large applications

2017-08-13 Thread duyanghao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16125150#comment-16125150 ] duyanghao edited comment on SPARK-18085 at 8/14/17 2:28 AM: [~vanzin] so

[jira] [Commented] (SPARK-18085) SPIP: Better History Server scalability for many / large applications

2017-08-13 Thread duyanghao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16125150#comment-16125150 ] duyanghao commented on SPARK-18085: --- [~vanzin] so what you mean is that your project has done nothing

[jira] [Commented] (SPARK-19372) Code generation for Filter predicate including many OR conditions exceeds JVM method size limit

2017-08-13 Thread srinivasan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16125144#comment-16125144 ] srinivasan commented on SPARK-19372: [~kiszk] I created a new ticket ,

[jira] [Updated] (SPARK-21720) Filter predicate with many conditions throw stackoverflow error

2017-08-13 Thread srinivasan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] srinivasan updated SPARK-21720: --- Summary: Filter predicate with many conditions throw stackoverflow error (was: FIlter predicate

[jira] [Created] (SPARK-21720) FIlter predicate with many conditions throw stackoverflow error

2017-08-13 Thread srinivasan (JIRA)
srinivasan created SPARK-21720: -- Summary: FIlter predicate with many conditions throw stackoverflow error Key: SPARK-21720 URL: https://issues.apache.org/jira/browse/SPARK-21720 Project: Spark

[jira] [Commented] (SPARK-21714) SparkSubmit in Yarn Client mode downloads remote files and then reuploads them again

2017-08-13 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16125115#comment-16125115 ] Saisai Shao commented on SPARK-21714: - I noticed this issue before and tried to fix it, but the

[jira] [Updated] (SPARK-21657) Spark has exponential time complexity to explode(array of structs)

2017-08-13 Thread Ruslan Dautkhanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ruslan Dautkhanov updated SPARK-21657: -- Description: It can take up to half a day to explode a modest-sized nested collection

[jira] [Commented] (SPARK-19256) Hive bucketing support

2017-08-13 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16125084#comment-16125084 ] Tejas Patil commented on SPARK-19256: - After the refactoring of the insertion plan node has been

[jira] [Comment Edited] (SPARK-19372) Code generation for Filter predicate including many OR conditions exceeds JVM method size limit

2017-08-13 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16124982#comment-16124982 ] Kazuaki Ishizaki edited comment on SPARK-19372 at 8/13/17 5:05 PM: ---

[jira] [Commented] (SPARK-19372) Code generation for Filter predicate including many OR conditions exceeds JVM method size limit

2017-08-13 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16124982#comment-16124982 ] Kazuaki Ishizaki commented on SPARK-19372: -- [~srinivasanm] I can reproduce this issue by using

[jira] [Created] (SPARK-21719) Enable complex expression in Column.getItem(...)

2017-08-13 Thread Dean Gurvitz (JIRA)
Dean Gurvitz created SPARK-21719: Summary: Enable complex expression in Column.getItem(...) Key: SPARK-21719 URL: https://issues.apache.org/jira/browse/SPARK-21719 Project: Spark Issue Type:

[jira] [Updated] (SPARK-21718) Heavy log of type: "Skipping partition based on stats ..."

2017-08-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-21718: -- Priority: Trivial (was: Major) CC [~lian cheng] > Heavy log of type: "Skipping partition based on

[jira] [Commented] (SPARK-21718) Heavy log of type: "Skipping partition based on stats ..."

2017-08-13 Thread Gian Lorenzo Meocci (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16124857#comment-16124857 ] Gian Lorenzo Meocci commented on SPARK-21718: -

[jira] [Commented] (SPARK-19372) Code generation for Filter predicate including many OR conditions exceeds JVM method size limit

2017-08-13 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16124856#comment-16124856 ] Kazuaki Ishizaki commented on SPARK-19372: -- Thank you for letting us know the problem. I

[jira] [Created] (SPARK-21718) Heavy log of type: "Skipping partition based on stats ..."

2017-08-13 Thread Gian Lorenzo Meocci (JIRA)
Gian Lorenzo Meocci created SPARK-21718: --- Summary: Heavy log of type: "Skipping partition based on stats ..." Key: SPARK-21718 URL: https://issues.apache.org/jira/browse/SPARK-21718 Project:

[jira] [Resolved] (SPARK-21711) spark-submit command should accept log4j configuration parameters for spark client logging.

2017-08-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21711. --- Resolution: Not A Problem OK, good to know > spark-submit command should accept log4j configuration

[jira] [Created] (SPARK-21717) Decouple the generated codes of consuming rows in operators under whole-stage codegen

2017-08-13 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-21717: --- Summary: Decouple the generated codes of consuming rows in operators under whole-stage codegen Key: SPARK-21717 URL: https://issues.apache.org/jira/browse/SPARK-21717