[jira] [Created] (SPARK-37182) pyspark.pandas.to_numeric() should support the errors option

2021-11-01 Thread Chuck Connell (Jira)
Chuck Connell created SPARK-37182: - Summary: pyspark.pandas.to_numeric() should support the errors option Key: SPARK-37182 URL: https://issues.apache.org/jira/browse/SPARK-37182 Project: Spark

[jira] [Created] (SPARK-37183) pyspark.pandas.DataFrame.map() should support .fillna()

2021-11-01 Thread Chuck Connell (Jira)
Chuck Connell created SPARK-37183: - Summary: pyspark.pandas.DataFrame.map() should support .fillna() Key: SPARK-37183 URL: https://issues.apache.org/jira/browse/SPARK-37183 Project: Spark

[jira] [Commented] (SPARK-37182) pyspark.pandas.to_numeric() should support the errors option

2021-11-01 Thread Chuck Connell (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436992#comment-17436992 ] Chuck Connell commented on SPARK-37182: --- Duplicate of 

[jira] [Updated] (SPARK-37189) pyspark.pandas histogram accepts the range option but does not use it

2021-11-01 Thread Chuck Connell (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chuck Connell updated SPARK-37189: -- Description: In pyspark.pandas if you write a line like this {quote}DF.plot.hist(bins=30,

[jira] [Created] (SPARK-37185) DataFrame.take() only uses one worker

2021-11-01 Thread mathieu longtin (Jira)
mathieu longtin created SPARK-37185: --- Summary: DataFrame.take() only uses one worker Key: SPARK-37185 URL: https://issues.apache.org/jira/browse/SPARK-37185 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-37185) DataFrame.take() only uses one worker

2021-11-01 Thread mathieu longtin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437009#comment-17437009 ] mathieu longtin commented on SPARK-37185: - Additional note: if there's a "group by" in the

[jira] [Created] (SPARK-37187) pyspark.pandas fails to create a histogram of one column from a large DataFrame

2021-11-01 Thread Chuck Connell (Jira)
Chuck Connell created SPARK-37187: - Summary: pyspark.pandas fails to create a histogram of one column from a large DataFrame Key: SPARK-37187 URL: https://issues.apache.org/jira/browse/SPARK-37187

[jira] [Created] (SPARK-37184) pyspark.pandas should support DF["column"].str.split("some_suffix").str[0]

2021-11-01 Thread Chuck Connell (Jira)
Chuck Connell created SPARK-37184: - Summary: pyspark.pandas should support DF["column"].str.split("some_suffix").str[0] Key: SPARK-37184 URL: https://issues.apache.org/jira/browse/SPARK-37184

[jira] [Commented] (SPARK-37166) SPIP: Storage Partitioned Join

2021-11-01 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436963#comment-17436963 ] Chao Sun commented on SPARK-37166: -- [~xkrogen] sure just linked. > SPIP: Storage Partitioned Join >

[jira] [Created] (SPARK-37181) pyspark.pandas.read_csv() should support latin-1 encoding

2021-11-01 Thread Chuck Connell (Jira)
Chuck Connell created SPARK-37181: - Summary: pyspark.pandas.read_csv() should support latin-1 encoding Key: SPARK-37181 URL: https://issues.apache.org/jira/browse/SPARK-37181 Project: Spark

[jira] [Resolved] (SPARK-37182) pyspark.pandas.to_numeric() should support the errors option

2021-11-01 Thread Chuck Connell (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chuck Connell resolved SPARK-37182. --- Resolution: Duplicate https://issues.apache.org/jira/browse/SPARK-36609 >

[jira] [Issue Comment Deleted] (SPARK-37182) pyspark.pandas.to_numeric() should support the errors option

2021-11-01 Thread Chuck Connell (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chuck Connell updated SPARK-37182: -- Comment: was deleted (was: https://issues.apache.org/jira/browse/SPARK-36609) >

[jira] [Created] (SPARK-37189) CLONE - pyspark.pandas histogram accepts the title option but does not add a title to the plot

2021-11-01 Thread Chuck Connell (Jira)
Chuck Connell created SPARK-37189: - Summary: CLONE - pyspark.pandas histogram accepts the title option but does not add a title to the plot Key: SPARK-37189 URL: https://issues.apache.org/jira/browse/SPARK-37189

[jira] [Updated] (SPARK-37189) pyspark.pandas histogram accepts the range option but does not use it

2021-11-01 Thread Chuck Connell (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chuck Connell updated SPARK-37189: -- Summary: pyspark.pandas histogram accepts the range option but does not use it (was: CLONE -

[jira] [Created] (SPARK-37190) Improve error messages for casting under ANSI mode

2021-11-01 Thread Allison Wang (Jira)
Allison Wang created SPARK-37190: Summary: Improve error messages for casting under ANSI mode Key: SPARK-37190 URL: https://issues.apache.org/jira/browse/SPARK-37190 Project: Spark Issue

[jira] [Created] (SPARK-37186) pyspark.pandas should support tseries.offsets

2021-11-01 Thread Chuck Connell (Jira)
Chuck Connell created SPARK-37186: - Summary: pyspark.pandas should support tseries.offsets Key: SPARK-37186 URL: https://issues.apache.org/jira/browse/SPARK-37186 Project: Spark Issue Type:

[jira] [Created] (SPARK-37188) pyspark.pandas histogram accepts the title option but does not add a title to the plot

2021-11-01 Thread Chuck Connell (Jira)
Chuck Connell created SPARK-37188: - Summary: pyspark.pandas histogram accepts the title option but does not add a title to the plot Key: SPARK-37188 URL: https://issues.apache.org/jira/browse/SPARK-37188

[jira] [Commented] (SPARK-37166) SPIP: Storage Partitioned Join

2021-11-01 Thread Erik Krogen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436959#comment-17436959 ] Erik Krogen commented on SPARK-37166: - [~csun] can you link the doc here? > SPIP: Storage

[jira] [Assigned] (SPARK-37023) Avoid fetching merge status when shuffleMergeEnabled is false for a shuffleDependency during retry

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37023: Assignee: Apache Spark > Avoid fetching merge status when shuffleMergeEnabled is false

[jira] [Assigned] (SPARK-37023) Avoid fetching merge status when shuffleMergeEnabled is false for a shuffleDependency during retry

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37023: Assignee: (was: Apache Spark) > Avoid fetching merge status when shuffleMergeEnabled

[jira] [Commented] (SPARK-37023) Avoid fetching merge status when shuffleMergeEnabled is false for a shuffleDependency during retry

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437042#comment-17437042 ] Apache Spark commented on SPARK-37023: -- User 'rmcyang' has created a pull request for this issue:

[jira] [Commented] (SPARK-37023) Avoid fetching merge status when shuffleMergeEnabled is false for a shuffleDependency during retry

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437041#comment-17437041 ] Apache Spark commented on SPARK-37023: -- User 'rmcyang' has created a pull request for this issue:

[jira] [Commented] (SPARK-37191) Allow merging DecimalTypes with different precision values

2021-11-01 Thread Ivan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437061#comment-17437061 ] Ivan commented on SPARK-37191: -- This is somewhat related to

[jira] [Resolved] (SPARK-37159) Change HiveExternalCatalogVersionsSuite to be able to test with Java 17

2021-11-01 Thread Kousuke Saruta (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta resolved SPARK-37159. Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved in

[jira] [Created] (SPARK-37191) Allow merging DecimalTypes with different precision values

2021-11-01 Thread Ivan (Jira)
Ivan created SPARK-37191: Summary: Allow merging DecimalTypes with different precision values Key: SPARK-37191 URL: https://issues.apache.org/jira/browse/SPARK-37191 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-37192) Migrate SHOW TBLPROPERTIES to use V2 command by default

2021-11-01 Thread PengLei (Jira)
PengLei created SPARK-37192: --- Summary: Migrate SHOW TBLPROPERTIES to use V2 command by default Key: SPARK-37192 URL: https://issues.apache.org/jira/browse/SPARK-37192 Project: Spark Issue Type:

[jira] [Updated] (SPARK-37124) Support RowToColumnarExec with Arrow

2021-11-01 Thread Chendi.Xue (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chendi.Xue updated SPARK-37124: --- Description: This Jira is aim to support Arrow format in RowToColumnarExec  Current

[jira] [Assigned] (SPARK-37191) Allow merging DecimalTypes with different precision values

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37191: Assignee: (was: Apache Spark) > Allow merging DecimalTypes with different precision

[jira] [Assigned] (SPARK-37191) Allow merging DecimalTypes with different precision values

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37191: Assignee: Apache Spark > Allow merging DecimalTypes with different precision values >

[jira] [Commented] (SPARK-37191) Allow merging DecimalTypes with different precision values

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437064#comment-17437064 ] Apache Spark commented on SPARK-37191: -- User 'sadikovi' has created a pull request for this issue:

[jira] [Commented] (SPARK-37051) The filter operator gets wrong results in char type

2021-11-01 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437122#comment-17437122 ] Yang Jie commented on SPARK-37051: -- Can you test {code:java} select * from t2_orc where

[jira] [Updated] (SPARK-37191) Allow merging DecimalTypes with different precision values

2021-11-01 Thread Ivan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan updated SPARK-37191: - Description: When merging DecimalTypes with different precision but the same scale, one would get the

[jira] [Commented] (SPARK-37190) Improve error messages for casting under ANSI mode

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437070#comment-17437070 ] Apache Spark commented on SPARK-37190: -- User 'allisonwang-db' has created a pull request for this

[jira] [Resolved] (SPARK-36554) Error message while trying to use spark sql functions directly on dataframe columns without using select expression

2021-11-01 Thread Kousuke Saruta (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta resolved SPARK-36554. Fix Version/s: 3.3.0 Assignee: Nicolas Azrak Resolution: Fixed Issue

[jira] [Commented] (SPARK-37192) Migrate SHOW TBLPROPERTIES to use V2 command by default

2021-11-01 Thread Terry Kim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437120#comment-17437120 ] Terry Kim commented on SPARK-37192: --- Yes, go for it! Thanks! > Migrate SHOW TBLPROPERTIES to use V2

[jira] [Updated] (SPARK-37191) Allow merging DecimalTypes with different precision values

2021-11-01 Thread Ivan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan updated SPARK-37191: - Description: When merging DecimalTypes with different precision but the same scale, one would get the

[jira] [Created] (SPARK-37193) DynamicJoinSelection.shouldDemoteBroadcastHashJoin should not apply to outer joins

2021-11-01 Thread Eugene Koifman (Jira)
Eugene Koifman created SPARK-37193: -- Summary: DynamicJoinSelection.shouldDemoteBroadcastHashJoin should not apply to outer joins Key: SPARK-37193 URL: https://issues.apache.org/jira/browse/SPARK-37193

[jira] [Commented] (SPARK-37172) Push down filters having both partitioning and non-partitioning columns

2021-11-01 Thread Chungmin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437135#comment-17437135 ] Chungmin commented on SPARK-37172: -- I can work on this if the rationale seems okay. > Push down

[jira] [Assigned] (SPARK-37190) Improve error messages for casting under ANSI mode

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37190: Assignee: (was: Apache Spark) > Improve error messages for casting under ANSI mode >

[jira] [Commented] (SPARK-37190) Improve error messages for casting under ANSI mode

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437068#comment-17437068 ] Apache Spark commented on SPARK-37190: -- User 'allisonwang-db' has created a pull request for this

[jira] [Assigned] (SPARK-37190) Improve error messages for casting under ANSI mode

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37190: Assignee: Apache Spark > Improve error messages for casting under ANSI mode >

[jira] [Commented] (SPARK-37192) Migrate SHOW TBLPROPERTIES to use V2 command by default

2021-11-01 Thread PengLei (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437113#comment-17437113 ] PengLei commented on SPARK-37192: - [~imback82] [~wenchen] I want to try to fix it, okay? > Migrate SHOW

[jira] [Commented] (SPARK-37051) The filter operator gets wrong results in char type

2021-11-01 Thread frankli (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437127#comment-17437127 ] frankli commented on SPARK-37051: - I know this SQL can work, but this behavior is different from MYSQL

[jira] [Assigned] (SPARK-37193) DynamicJoinSelection.shouldDemoteBroadcastHashJoin should not apply to outer joins

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37193: Assignee: Apache Spark > DynamicJoinSelection.shouldDemoteBroadcastHashJoin should not

[jira] [Assigned] (SPARK-37193) DynamicJoinSelection.shouldDemoteBroadcastHashJoin should not apply to outer joins

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37193: Assignee: (was: Apache Spark) > DynamicJoinSelection.shouldDemoteBroadcastHashJoin

[jira] [Commented] (SPARK-37193) DynamicJoinSelection.shouldDemoteBroadcastHashJoin should not apply to outer joins

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437125#comment-17437125 ] Apache Spark commented on SPARK-37193: -- User 'ekoifman' has created a pull request for this issue:

[jira] [Commented] (SPARK-37193) DynamicJoinSelection.shouldDemoteBroadcastHashJoin should not apply to outer joins

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437126#comment-17437126 ] Apache Spark commented on SPARK-37193: -- User 'ekoifman' has created a pull request for this issue:

[jira] [Updated] (SPARK-35496) Upgrade Scala 2.13 to 2.13.7

2021-11-01 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yang Jie updated SPARK-35496: - Affects Version/s: 3.3.0 > Upgrade Scala 2.13 to 2.13.7 > > >

[jira] [Resolved] (SPARK-37161) RowToColumnConverter support AnsiIntervalType

2021-11-01 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk resolved SPARK-37161. -- Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 34446

[jira] [Assigned] (SPARK-37161) RowToColumnConverter support AnsiIntervalType

2021-11-01 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk reassigned SPARK-37161: Assignee: PengLei > RowToColumnConverter support AnsiIntervalType >

[jira] [Commented] (SPARK-37013) `select format_string('%0$s', 'Hello')` has different behavior when using java 8 and Java 17

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436641#comment-17436641 ] Apache Spark commented on SPARK-37013: -- User 'LuciferYang' has created a pull request for this

[jira] [Commented] (SPARK-36061) Create a PodGroup with user specified minimum resources required

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436725#comment-17436725 ] Apache Spark commented on SPARK-36061: -- User 'Yikun' has created a pull request for this issue:

[jira] [Assigned] (SPARK-36061) Create a PodGroup with user specified minimum resources required

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36061: Assignee: Apache Spark > Create a PodGroup with user specified minimum resources

[jira] [Assigned] (SPARK-36061) Create a PodGroup with user specified minimum resources required

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36061: Assignee: (was: Apache Spark) > Create a PodGroup with user specified minimum

[jira] [Assigned] (SPARK-37176) JsonSource's infer should have the same exception handle logic as JacksonParser's parse logic

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37176: Assignee: Apache Spark > JsonSource's infer should have the same exception handle logic

[jira] [Commented] (SPARK-37176) JsonSource's infer should have the same exception handle logic as JacksonParser's parse logic

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436675#comment-17436675 ] Apache Spark commented on SPARK-37176: -- User 'advancedxy' has created a pull request for this

[jira] [Assigned] (SPARK-37176) JsonSource's infer should have the same exception handle logic as JacksonParser's parse logic

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37176: Assignee: (was: Apache Spark) > JsonSource's infer should have the same exception

[jira] [Created] (SPARK-37176) JsonSource's infer should have the same exception handle logic as JacksonParser's parse logic

2021-11-01 Thread Xianjin YE (Jira)
Xianjin YE created SPARK-37176: -- Summary: JsonSource's infer should have the same exception handle logic as JacksonParser's parse logic Key: SPARK-37176 URL: https://issues.apache.org/jira/browse/SPARK-37176

[jira] [Comment Edited] (SPARK-23977) Add commit protocol binding to Hadoop 3.1 PathOutputCommitter mechanism

2021-11-01 Thread Gustavo Martin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436740#comment-17436740 ] Gustavo Martin edited comment on SPARK-23977 at 11/1/21, 10:35 AM: ---

[jira] [Comment Edited] (SPARK-23977) Add commit protocol binding to Hadoop 3.1 PathOutputCommitter mechanism

2021-11-01 Thread Gustavo Martin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436740#comment-17436740 ] Gustavo Martin edited comment on SPARK-23977 at 11/1/21, 10:48 AM: ---

[jira] [Resolved] (SPARK-37062) Introduce a new data source for providing consistent set of rows per microbatch

2021-11-01 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim resolved SPARK-37062. -- Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 34333

[jira] [Assigned] (SPARK-37062) Introduce a new data source for providing consistent set of rows per microbatch

2021-11-01 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim reassigned SPARK-37062: Assignee: Jungtaek Lim > Introduce a new data source for providing consistent set of

[jira] [Created] (SPARK-37178) Add Target Encoding to ml.feature

2021-11-01 Thread Simon Tao (Jira)
Simon Tao created SPARK-37178: - Summary: Add Target Encoding to ml.feature Key: SPARK-37178 URL: https://issues.apache.org/jira/browse/SPARK-37178 Project: Spark Issue Type: New Feature

[jira] [Comment Edited] (SPARK-23977) Add commit protocol binding to Hadoop 3.1 PathOutputCommitter mechanism

2021-11-01 Thread Gustavo Martin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436740#comment-17436740 ] Gustavo Martin edited comment on SPARK-23977 at 11/1/21, 10:34 AM: ---

[jira] [Commented] (SPARK-23977) Add commit protocol binding to Hadoop 3.1 PathOutputCommitter mechanism

2021-11-01 Thread Gustavo Martin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436740#comment-17436740 ] Gustavo Martin commented on SPARK-23977: Thank you ver much [~ste...@apache.org] for your

[jira] [Created] (SPARK-37177) Support LONG argument to the Spark SQL LIMIT clause

2021-11-01 Thread Douglas Moore (Jira)
Douglas Moore created SPARK-37177: - Summary: Support LONG argument to the Spark SQL LIMIT clause Key: SPARK-37177 URL: https://issues.apache.org/jira/browse/SPARK-37177 Project: Spark Issue

[jira] [Created] (SPARK-37180) PySpark.pandas should support __version__

2021-11-01 Thread Chuck Connell (Jira)
Chuck Connell created SPARK-37180: - Summary: PySpark.pandas should support __version__ Key: SPARK-37180 URL: https://issues.apache.org/jira/browse/SPARK-37180 Project: Spark Issue Type:

[jira] [Updated] (SPARK-37180) PySpark.pandas should support __version__

2021-11-01 Thread Chuck Connell (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chuck Connell updated SPARK-37180: -- Description: In regular pandas you can say {quote}pd.___version___  {quote} to get the pandas

[jira] [Updated] (SPARK-37180) PySpark.pandas should support __version__

2021-11-01 Thread Chuck Connell (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chuck Connell updated SPARK-37180: -- Description: In regular pandas you can say {quote}{{pd.__version__ }}{quote} to get the

[jira] [Commented] (SPARK-36566) Add Spark appname as a label to the executor pods

2021-11-01 Thread Yikun Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436885#comment-17436885 ] Yikun Jiang commented on SPARK-36566: - Yep, it's useful for me. Does it make sense if we also set

[jira] [Comment Edited] (SPARK-36566) Add Spark appname as a label to the executor pods

2021-11-01 Thread Yikun Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436885#comment-17436885 ] Yikun Jiang edited comment on SPARK-36566 at 11/1/21, 3:11 PM: --- Yep, it's

[jira] [Commented] (SPARK-36566) Add Spark appname as a label to the executor pods

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436895#comment-17436895 ] Apache Spark commented on SPARK-36566: -- User 'Yikun' has created a pull request for this issue:

[jira] [Assigned] (SPARK-36566) Add Spark appname as a label to the executor pods

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36566: Assignee: Apache Spark > Add Spark appname as a label to the executor pods >

[jira] [Assigned] (SPARK-36566) Add Spark appname as a label to the executor pods

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36566: Assignee: (was: Apache Spark) > Add Spark appname as a label to the executor pods >

[jira] [Commented] (SPARK-36566) Add Spark appname as a label to the executor pods

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436896#comment-17436896 ] Apache Spark commented on SPARK-36566: -- User 'Yikun' has created a pull request for this issue:

[jira] [Commented] (SPARK-37034) What's the progress of vectorized execution for spark?

2021-11-01 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436898#comment-17436898 ] Wenchen Fan commented on SPARK-37034: - This is a question, not a feature request. Please ask it in

[jira] [Comment Edited] (SPARK-37034) What's the progress of vectorized execution for spark?

2021-11-01 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436898#comment-17436898 ] Wenchen Fan edited comment on SPARK-37034 at 11/1/21, 3:23 PM: --- This is a

[jira] [Updated] (SPARK-37179) ANSI mode: Allow casting between Timestamp and Numeric

2021-11-01 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang updated SPARK-37179: --- Description: The casting between > ANSI mode: Allow casting between Timestamp and Numeric

[jira] [Assigned] (SPARK-37179) ANSI mode: Allow casting between Timestamp and Numeric

2021-11-01 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang reassigned SPARK-37179: -- Assignee: Gengliang Wang > ANSI mode: Allow casting between Timestamp and Numeric >

[jira] [Updated] (SPARK-37179) ANSI mode: Allow casting between Timestamp and Numeric

2021-11-01 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang updated SPARK-37179: --- Description: We should allow casting As we did some data science, we found that many Spark

[jira] [Assigned] (SPARK-37179) ANSI mode: Allow casting between Timestamp and Numeric

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37179: Assignee: Gengliang Wang (was: Apache Spark) > ANSI mode: Allow casting between

[jira] [Commented] (SPARK-37179) ANSI mode: Allow casting between Timestamp and Numeric

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436858#comment-17436858 ] Apache Spark commented on SPARK-37179: -- User 'gengliangwang' has created a pull request for this

[jira] [Assigned] (SPARK-37179) ANSI mode: Allow casting between Timestamp and Numeric

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37179: Assignee: Apache Spark (was: Gengliang Wang) > ANSI mode: Allow casting between

[jira] [Commented] (SPARK-37034) What's the progress of vectorized execution for spark?

2021-11-01 Thread xiaoli (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436825#comment-17436825 ] xiaoli commented on SPARK-37034:   [~dongjoon] [~yumwang] [~cloud_fan] Sorry to ping you, as there is

[jira] [Assigned] (SPARK-37178) Add Target Encoding to ml.feature

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37178: Assignee: (was: Apache Spark) > Add Target Encoding to ml.feature >

[jira] [Assigned] (SPARK-37178) Add Target Encoding to ml.feature

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37178: Assignee: Apache Spark > Add Target Encoding to ml.feature >

[jira] [Created] (SPARK-37179) ANSI mode: Allow casting between Timestamp and Numeric

2021-11-01 Thread Gengliang Wang (Jira)
Gengliang Wang created SPARK-37179: -- Summary: ANSI mode: Allow casting between Timestamp and Numeric Key: SPARK-37179 URL: https://issues.apache.org/jira/browse/SPARK-37179 Project: Spark

[jira] [Commented] (SPARK-37178) Add Target Encoding to ml.feature

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436844#comment-17436844 ] Apache Spark commented on SPARK-37178: -- User 'taosiyuan163' has created a pull request for this

[jira] [Updated] (SPARK-37179) ANSI mode: Allow casting between Timestamp and Numeric

2021-11-01 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang updated SPARK-37179: --- Description: We should allow the casting between Timestamp and Numeric types: * As we did

[jira] [Commented] (SPARK-37179) ANSI mode: Allow casting between Timestamp and Numeric

2021-11-01 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436861#comment-17436861 ] Apache Spark commented on SPARK-37179: -- User 'gengliangwang' has created a pull request for this