[jira] [Commented] (SPARK-29158) Expose SerializableConfiguration for DSv2

2019-12-16 Thread Jorge Machado (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997947#comment-16997947 ] Jorge Machado commented on SPARK-29158: --- How can we get SerializableConfiguration with 2.4.4 ? Any

[jira] [Created] (SPARK-30283) V2 Command logical plan should use UnresolvedV2Relation for a table

2019-12-16 Thread Terry Kim (Jira)
Terry Kim created SPARK-30283: - Summary: V2 Command logical plan should use UnresolvedV2Relation for a table Key: SPARK-30283 URL: https://issues.apache.org/jira/browse/SPARK-30283 Project: Spark

[jira] [Commented] (SPARK-16183) Large Spark SQL commands cause StackOverflowError in parser when using sqlContext.sql

2019-12-16 Thread Shubhradeep Majumdar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-16183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997931#comment-16997931 ] Shubhradeep Majumdar commented on SPARK-16183: -- Yes, the issue still exists in Spark 2.4.0.

[jira] [Created] (SPARK-30282) UnresolvedV2Relation should be resolved to temp view first

2019-12-16 Thread Terry Kim (Jira)
Terry Kim created SPARK-30282: - Summary: UnresolvedV2Relation should be resolved to temp view first Key: SPARK-30282 URL: https://issues.apache.org/jira/browse/SPARK-30282 Project: Spark Issue

[jira] [Commented] (SPARK-30281) 'archive' option in FileStreamSource misses to consider partitioned and recursive option

2019-12-16 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997905#comment-16997905 ] Jungtaek Lim commented on SPARK-30281: -- Will submit a PR soon. > 'archive' option in

[jira] [Created] (SPARK-30281) 'archive' option in FileStreamSource misses to consider partitioned and recursive option

2019-12-16 Thread Jungtaek Lim (Jira)
Jungtaek Lim created SPARK-30281: Summary: 'archive' option in FileStreamSource misses to consider partitioned and recursive option Key: SPARK-30281 URL: https://issues.apache.org/jira/browse/SPARK-30281

[jira] [Created] (SPARK-30280) Update documentation

2019-12-16 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-30280: --- Summary: Update documentation Key: SPARK-30280 URL: https://issues.apache.org/jira/browse/SPARK-30280 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-16996) Hive ACID delta files not seen

2019-12-16 Thread SandhyaMora (Jira)
[ https://issues.apache.org/jira/browse/SPARK-16996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997886#comment-16997886 ] SandhyaMora commented on SPARK-16996: - Any Update on writing data into Hive ACID tables form spark ?

[jira] [Resolved] (SPARK-30201) HiveOutputWriter standardOI should use ObjectInspectorCopyOption.DEFAULT

2019-12-16 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-30201. - Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26831

[jira] [Assigned] (SPARK-30201) HiveOutputWriter standardOI should use ObjectInspectorCopyOption.DEFAULT

2019-12-16 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-30201: --- Assignee: ulysses you > HiveOutputWriter standardOI should use

[jira] [Created] (SPARK-30279) Support 32 or more grouping attributes for GROUPING_ID

2019-12-16 Thread Takeshi Yamamuro (Jira)
Takeshi Yamamuro created SPARK-30279: Summary: Support 32 or more grouping attributes for GROUPING_ID Key: SPARK-30279 URL: https://issues.apache.org/jira/browse/SPARK-30279 Project: Spark

[jira] [Assigned] (SPARK-30094) Current namespace is not used during table resolution

2019-12-16 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-30094: --- Assignee: Terry Kim > Current namespace is not used during table resolution >

[jira] [Resolved] (SPARK-30094) Current namespace is not used during table resolution

2019-12-16 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-30094. - Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26894

[jira] [Resolved] (SPARK-30277) NoSuchMethodError in Spark 3.0.0-preview with Delta Lake

2019-12-16 Thread Marcelo Masiero Vanzin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Masiero Vanzin resolved SPARK-30277. Resolution: Not A Problem That's a internal Spark class, which means that

[jira] [Created] (SPARK-30278) Update Spark SQL document menu for new changes

2019-12-16 Thread Yuanjian Li (Jira)
Yuanjian Li created SPARK-30278: --- Summary: Update Spark SQL document menu for new changes Key: SPARK-30278 URL: https://issues.apache.org/jira/browse/SPARK-30278 Project: Spark Issue Type:

[jira] [Created] (SPARK-30277) NoSuchMethodError in Spark 3.0.0-preview with Delta Lake

2019-12-16 Thread Victor Zhang (Jira)
Victor Zhang created SPARK-30277: Summary: NoSuchMethodError in Spark 3.0.0-preview with Delta Lake Key: SPARK-30277 URL: https://issues.apache.org/jira/browse/SPARK-30277 Project: Spark

[jira] [Commented] (SPARK-6235) Address various 2G limits

2019-12-16 Thread Samuel Shepard (Jira)
[ https://issues.apache.org/jira/browse/SPARK-6235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997798#comment-16997798 ] Samuel Shepard commented on SPARK-6235: --- [~irashid] I meant the former (task result > 2G) as best I

[jira] [Resolved] (SPARK-29164) Rewrite coalesce(boolean, booleanLit) as boolean expression

2019-12-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-29164. -- Resolution: Won't Fix > Rewrite coalesce(boolean, booleanLit) as boolean expression >

[jira] [Commented] (SPARK-29164) Rewrite coalesce(boolean, booleanLit) as boolean expression

2019-12-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997791#comment-16997791 ] Hyukjin Kwon commented on SPARK-29164: -- Resolving per the discussion in the PR. > Rewrite

[jira] [Updated] (SPARK-30181) Throws runtime exception when filter metastore partition key that's not string type or integral types

2019-12-16 Thread Yu-Jhe Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu-Jhe Li updated SPARK-30181: -- Description: SQL below will throw a runtime exception since spark-2.4.0. I think it's a bug brought

[jira] [Updated] (SPARK-30276) Support Filter expression allows simultaneous use of DISTINCT

2019-12-16 Thread jiaan.geng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jiaan.geng updated SPARK-30276: --- Summary: Support Filter expression allows simultaneous use of DISTINCT (was: Support Filter

[jira] [Created] (SPARK-30276) Support Filter expression allow simultaneous use of DISTINCT

2019-12-16 Thread jiaan.geng (Jira)
jiaan.geng created SPARK-30276: -- Summary: Support Filter expression allow simultaneous use of DISTINCT Key: SPARK-30276 URL: https://issues.apache.org/jira/browse/SPARK-30276 Project: Spark

[jira] [Commented] (SPARK-30276) Support Filter expression allow simultaneous use of DISTINCT

2019-12-16 Thread jiaan.geng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997785#comment-16997785 ] jiaan.geng commented on SPARK-30276: I'm working. > Support Filter expression allow simultaneous

[jira] [Created] (SPARK-30275) Add gitlab-ci.yml file for reproducible builds

2019-12-16 Thread Jim Kleckner (Jira)
Jim Kleckner created SPARK-30275: Summary: Add gitlab-ci.yml file for reproducible builds Key: SPARK-30275 URL: https://issues.apache.org/jira/browse/SPARK-30275 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-30233) Spark WebUI task table indentation issue

2019-12-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-30233. -- Resolution: Duplicate > Spark WebUI task table indentation issue >

[jira] [Commented] (SPARK-30239) Creating a dataframe with Pandas rather than Numpy datatypes fails

2019-12-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997782#comment-16997782 ] Hyukjin Kwon commented on SPARK-30239: -- Can you show the self-contained reproducer? > Creating a

[jira] [Commented] (SPARK-30242) Support reading Parquet files from Stream Buffer

2019-12-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997779#comment-16997779 ] Hyukjin Kwon commented on SPARK-30242: -- Nope, I don't think it will be able as it requires to

[jira] [Resolved] (SPARK-30242) Support reading Parquet files from Stream Buffer

2019-12-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-30242. -- Resolution: Won't Fix > Support reading Parquet files from Stream Buffer >

[jira] [Commented] (SPARK-30249) Invalid Column Names in parquet tables should not be allowed

2019-12-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997774#comment-16997774 ] Hyukjin Kwon commented on SPARK-30249: -- It seems to be valid in Parquet: {code} scala>

[jira] [Resolved] (SPARK-30249) Invalid Column Names in parquet tables should not be allowed

2019-12-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-30249. -- Resolution: Not A Problem > Invalid Column Names in parquet tables should not be allowed >

[jira] [Updated] (SPARK-30264) Unexpected behaviour when using persist MEMORY_ONLY in RDD

2019-12-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-30264: - Affects Version/s: 2.4.4 > Unexpected behaviour when using persist MEMORY_ONLY in RDD >

[jira] [Resolved] (SPARK-30270) Can't pickle abstract classes (with cloudpickle)

2019-12-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-30270. -- Resolution: Cannot Reproduce I can confirm that's fixed and cannot be reproduced in the

[jira] [Created] (SPARK-30274) Avoid BytesToBytesMap lookup hang forever when holding keys reaching max capacity

2019-12-16 Thread L. C. Hsieh (Jira)
L. C. Hsieh created SPARK-30274: --- Summary: Avoid BytesToBytesMap lookup hang forever when holding keys reaching max capacity Key: SPARK-30274 URL: https://issues.apache.org/jira/browse/SPARK-30274

[jira] [Resolved] (SPARK-30268) Incorrect pyspark package name when releasing preview version

2019-12-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-30268. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26909

[jira] [Commented] (SPARK-30171) Eliminate warnings: part2

2019-12-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997740#comment-16997740 ] Sean R. Owen commented on SPARK-30171: -- Is this a dupe of SPARK-30258? > Eliminate warnings: part2

[jira] [Resolved] (SPARK-30258) Eliminate warnings of deprecated Spark APIs in tests

2019-12-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-30258. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26885

[jira] [Assigned] (SPARK-30258) Eliminate warnings of deprecated Spark APIs in tests

2019-12-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen reassigned SPARK-30258: Assignee: Maxim Gekk > Eliminate warnings of deprecated Spark APIs in tests >

[jira] [Assigned] (SPARK-30247) GaussianMixtureModel in py side should expose gaussian

2019-12-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen reassigned SPARK-30247: Assignee: Huaxin Gao > GaussianMixtureModel in py side should expose gaussian >

[jira] [Resolved] (SPARK-30247) GaussianMixtureModel in py side should expose gaussian

2019-12-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-30247. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26882

[jira] [Commented] (SPARK-23015) spark-submit fails when submitting several jobs in parallel

2019-12-16 Thread Kevin Grealish (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997724#comment-16997724 ] Kevin Grealish commented on SPARK-23015: Here is something that may help craft a complete

[jira] [Commented] (SPARK-25392) [Spark Job History]Inconsistent behaviour for pool details in spark web UI and history server page

2019-12-16 Thread Marcelo Masiero Vanzin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997698#comment-16997698 ] Marcelo Masiero Vanzin commented on SPARK-25392: The fix basically hides pool details

[jira] [Assigned] (SPARK-25392) [Spark Job History]Inconsistent behaviour for pool details in spark web UI and history server page

2019-12-16 Thread Marcelo Masiero Vanzin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Masiero Vanzin reassigned SPARK-25392: -- Assignee: shahid > [Spark Job History]Inconsistent behaviour for

[jira] [Resolved] (SPARK-25392) [Spark Job History]Inconsistent behaviour for pool details in spark web UI and history server page

2019-12-16 Thread Marcelo Masiero Vanzin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Masiero Vanzin resolved SPARK-25392. Fix Version/s: 3.0.0 2.4.5 Resolution: Fixed

[jira] [Assigned] (SPARK-29043) [History Server]Only one replay thread of FsHistoryProvider work because of straggler

2019-12-16 Thread Marcelo Masiero Vanzin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Masiero Vanzin reassigned SPARK-29043: -- Assignee: feiwang > [History Server]Only one replay thread of

[jira] [Resolved] (SPARK-29043) [History Server]Only one replay thread of FsHistoryProvider work because of straggler

2019-12-16 Thread Marcelo Masiero Vanzin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Masiero Vanzin resolved SPARK-29043. Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Commented] (SPARK-23015) spark-submit fails when submitting several jobs in parallel

2019-12-16 Thread Kevin Grealish (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997669#comment-16997669 ] Kevin Grealish commented on SPARK-23015: %TIME% has a granularity of 10ms, so while this does

[jira] [Assigned] (SPARK-30209) Display stageId, attemptId, taskId with SQL max metric in UI

2019-12-16 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves reassigned SPARK-30209: - Assignee: Niranjan Artal > Display stageId, attemptId, taskId with SQL max metric in

[jira] [Resolved] (SPARK-30209) Display stageId, attemptId, taskId with SQL max metric in UI

2019-12-16 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-30209. --- Fix Version/s: 3.0.0 Resolution: Fixed > Display stageId, attemptId, taskId with SQL

[jira] [Commented] (SPARK-29106) Add jenkins arm test for spark

2019-12-16 Thread Shane Knapp (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997641#comment-16997641 ] Shane Knapp commented on SPARK-29106: - huh...  i created a new python 3.6 env, ran the python test

[jira] [Commented] (SPARK-6235) Address various 2G limits

2019-12-16 Thread Imran Rashid (Jira)
[ https://issues.apache.org/jira/browse/SPARK-6235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997631#comment-16997631 ] Imran Rashid commented on SPARK-6235: - [~sammysheep] are you discussing the use case for task results

[jira] [Updated] (SPARK-30273) Add melt() function

2019-12-16 Thread Shelby Vanhooser (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shelby Vanhooser updated SPARK-30273: - Description: - Adds melt() functionality based on

[jira] [Created] (SPARK-30273) Add melt() function

2019-12-16 Thread Shelby Vanhooser (Jira)
Shelby Vanhooser created SPARK-30273: Summary: Add melt() function Key: SPARK-30273 URL: https://issues.apache.org/jira/browse/SPARK-30273 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-30273) Add melt() function

2019-12-16 Thread Shelby Vanhooser (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shelby Vanhooser updated SPARK-30273: - Labels: PySpark feature (was: ) > Add melt() function > --- > >

[jira] [Comment Edited] (SPARK-23015) spark-submit fails when submitting several jobs in parallel

2019-12-16 Thread Evgenii (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997604#comment-16997604 ] Evgenii edited comment on SPARK-23015 at 12/16/19 8:01 PM: --- Here is working

[jira] [Comment Edited] (SPARK-23015) spark-submit fails when submitting several jobs in parallel

2019-12-16 Thread Evgenii (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997604#comment-16997604 ] Evgenii edited comment on SPARK-23015 at 12/16/19 8:00 PM: --- Here is working

[jira] [Commented] (SPARK-23015) spark-submit fails when submitting several jobs in parallel

2019-12-16 Thread Evgenii (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997604#comment-16997604 ] Evgenii commented on SPARK-23015: - Here is working solution: set

[jira] [Created] (SPARK-30272) Remove usage of Guava that breaks in Guava 27

2019-12-16 Thread Sean R. Owen (Jira)
Sean R. Owen created SPARK-30272: Summary: Remove usage of Guava that breaks in Guava 27 Key: SPARK-30272 URL: https://issues.apache.org/jira/browse/SPARK-30272 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-29574) spark with user provided hadoop doesn't work on kubernetes

2019-12-16 Thread Marcelo Masiero Vanzin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Masiero Vanzin reassigned SPARK-29574: -- Assignee: Shahin Shakeri > spark with user provided hadoop doesn't

[jira] [Resolved] (SPARK-29574) spark with user provided hadoop doesn't work on kubernetes

2019-12-16 Thread Marcelo Masiero Vanzin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Masiero Vanzin resolved SPARK-29574. Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Comment Edited] (SPARK-6235) Address various 2G limits

2019-12-16 Thread Samuel Shepard (Jira)
[ https://issues.apache.org/jira/browse/SPARK-6235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16995889#comment-16995889 ] Samuel Shepard edited comment on SPARK-6235 at 12/16/19 4:39 PM: -

[jira] [Commented] (SPARK-30264) Unexpected behaviour when using persist MEMORY_ONLY in RDD

2019-12-16 Thread moshe ohaion (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997396#comment-16997396 ] moshe ohaion commented on SPARK-30264: -- Steps to reproduce: # File users8.avro was created by

[jira] [Updated] (SPARK-30264) Unexpected behaviour when using persist MEMORY_ONLY in RDD

2019-12-16 Thread moshe ohaion (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] moshe ohaion updated SPARK-30264: - Attachment: GenericMain.java > Unexpected behaviour when using persist MEMORY_ONLY in RDD >

[jira] [Updated] (SPARK-30264) Unexpected behaviour when using persist MEMORY_ONLY in RDD

2019-12-16 Thread moshe ohaion (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] moshe ohaion updated SPARK-30264: - Attachment: users8.avro > Unexpected behaviour when using persist MEMORY_ONLY in RDD >

[jira] [Commented] (SPARK-30072) Create dedicated planner for subqueries

2019-12-16 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997368#comment-16997368 ] Wenchen Fan commented on SPARK-30072: - > The nested subquery "SELECT max(df2.k) FROM df1 JOIN df2 ON

[jira] [Commented] (SPARK-30049) SQL fails to parse when comment contains an unmatched quote character

2019-12-16 Thread Oleg Bonar (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997350#comment-16997350 ] Oleg Bonar commented on SPARK-30049: I would like to investigate this issue. > SQL fails to parse

[jira] [Updated] (SPARK-30268) Incorrect pyspark package name when releasing preview version

2019-12-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-30268: Summary: Incorrect pyspark package name when releasing preview version (was: pyspark pyspark

[jira] [Created] (SPARK-30271) dynamic allocation won't release some executor in some case.

2019-12-16 Thread angerszhu (Jira)
angerszhu created SPARK-30271: - Summary: dynamic allocation won't release some executor in some case. Key: SPARK-30271 URL: https://issues.apache.org/jira/browse/SPARK-30271 Project: Spark

[jira] [Commented] (SPARK-30150) Manage resources (ADD/LIST) does not support quoted path

2019-12-16 Thread Rakesh Raushan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997314#comment-16997314 ] Rakesh Raushan commented on SPARK-30150: Thanks!! > Manage resources (ADD/LIST) does not

[jira] [Commented] (SPARK-16180) Task hang on fetching blocks (cached RDD)

2019-12-16 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-16180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997310#comment-16997310 ] angerszhu commented on SPARK-16180: --- i meet this problem recently in spark-2.4 > Task hang on

[jira] [Updated] (SPARK-30269) Should use old partition stats to decide whether to update stats when analyzing partition

2019-12-16 Thread Zhenhua Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-30269: - Summary: Should use old partition stats to decide whether to update stats when analyzing

[jira] [Updated] (SPARK-30270) Can't pickle abstract classes (with cloudpickle)

2019-12-16 Thread Sebastian Straub (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Straub updated SPARK-30270: - Description: I can't use any classes that are derived from abstract classes in PySpark,

[jira] [Created] (SPARK-30270) Can't pickle abstract classes (with cloudpickle)

2019-12-16 Thread Sebastian Straub (Jira)
Sebastian Straub created SPARK-30270: Summary: Can't pickle abstract classes (with cloudpickle) Key: SPARK-30270 URL: https://issues.apache.org/jira/browse/SPARK-30270 Project: Spark

[jira] [Created] (SPARK-30269) Should use old partition stats to compare when analyzing partition

2019-12-16 Thread Zhenhua Wang (Jira)
Zhenhua Wang created SPARK-30269: Summary: Should use old partition stats to compare when analyzing partition Key: SPARK-30269 URL: https://issues.apache.org/jira/browse/SPARK-30269 Project: Spark

[jira] [Commented] (SPARK-25250) Race condition with tasks running when new attempt for same stage is created leads to other task in the next attempt running on the same partition id retry multiple ti

2019-12-16 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997306#comment-16997306 ] Wenchen Fan commented on SPARK-25250: - It's https://issues.apache.org/jira/browse/SPARK-27474 >

[jira] [Assigned] (SPARK-30150) Manage resources (ADD/LIST) does not support quoted path

2019-12-16 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-30150: --- Assignee: Rakesh Raushan (was: jobit mathew) > Manage resources (ADD/LIST) does not

[jira] [Updated] (SPARK-30268) pyspark pyspark package name when releasing preview version

2019-12-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-30268: Description: {noformat} cp: cannot stat

[jira] [Commented] (SPARK-30223) queries in thrift server may read wrong SQL configs

2019-12-16 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997298#comment-16997298 ] Wenchen Fan commented on SPARK-30223: - It's not possible to pass around the `SQLConf` object in all

[jira] [Commented] (SPARK-30150) Manage resources (ADD/LIST) does not support quoted path

2019-12-16 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997299#comment-16997299 ] Wenchen Fan commented on SPARK-30150: - updated > Manage resources (ADD/LIST) does not support

[jira] [Created] (SPARK-30268) pyspark pyspark package name when releasing preview version

2019-12-16 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-30268: --- Summary: pyspark pyspark package name when releasing preview version Key: SPARK-30268 URL: https://issues.apache.org/jira/browse/SPARK-30268 Project: Spark

[jira] [Commented] (SPARK-27021) Leaking Netty event loop group for shuffle chunk fetch requests

2019-12-16 Thread roncenzhao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997288#comment-16997288 ] roncenzhao commented on SPARK-27021: [~attilapiros] Thanks. The issue is the same problem we have

[jira] [Created] (SPARK-30267) avro deserializer: ArrayList cannot be cast to GenericData$Array

2019-12-16 Thread Steven Aerts (Jira)
Steven Aerts created SPARK-30267: Summary: avro deserializer: ArrayList cannot be cast to GenericData$Array Key: SPARK-30267 URL: https://issues.apache.org/jira/browse/SPARK-30267 Project: Spark

[jira] [Resolved] (SPARK-30265) Do not change R version when releasing preview versions

2019-12-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-30265. - Resolution: Fixed Issue resolved by pull request 26904

[jira] [Resolved] (SPARK-30192) support column position in DS v2

2019-12-16 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-30192. - Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26817

[jira] [Created] (SPARK-30266) Int overflow and MatchError in ApproximatePercentile

2019-12-16 Thread Kent Yao (Jira)
Kent Yao created SPARK-30266: Summary: Int overflow and MatchError in ApproximatePercentile Key: SPARK-30266 URL: https://issues.apache.org/jira/browse/SPARK-30266 Project: Spark Issue Type:

[jira] [Commented] (SPARK-29505) desc extended is case sensitive

2019-12-16 Thread pavithra ramachandran (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16997090#comment-16997090 ] pavithra ramachandran commented on SPARK-29505: --- i will work on this > desc extended is

[jira] [Issue Comment Deleted] (SPARK-29505) desc extended is case sensitive

2019-12-16 Thread Shivu Sondur (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivu Sondur updated SPARK-29505: - Comment: was deleted (was: I am checking this issue) > desc extended is case sensitive >

[jira] [Created] (SPARK-30265) Do not change R version when releasing preview versions

2019-12-16 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-30265: --- Summary: Do not change R version when releasing preview versions Key: SPARK-30265 URL: https://issues.apache.org/jira/browse/SPARK-30265 Project: Spark Issue