Chuck Connell created SPARK-37182:
-
Summary: pyspark.pandas.to_numeric() should support the errors
option
Key: SPARK-37182
URL: https://issues.apache.org/jira/browse/SPARK-37182
Project: Spark
Chuck Connell created SPARK-37183:
-
Summary: pyspark.pandas.DataFrame.map() should support .fillna()
Key: SPARK-37183
URL: https://issues.apache.org/jira/browse/SPARK-37183
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-37182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436992#comment-17436992
]
Chuck Connell commented on SPARK-37182:
---
Duplicate of
[
https://issues.apache.org/jira/browse/SPARK-37189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chuck Connell updated SPARK-37189:
--
Description:
In pyspark.pandas if you write a line like this
{quote}DF.plot.hist(bins=30,
mathieu longtin created SPARK-37185:
---
Summary: DataFrame.take() only uses one worker
Key: SPARK-37185
URL: https://issues.apache.org/jira/browse/SPARK-37185
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-37185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437009#comment-17437009
]
mathieu longtin commented on SPARK-37185:
-
Additional note: if there's a "group by" in the
Chuck Connell created SPARK-37187:
-
Summary: pyspark.pandas fails to create a histogram of one column
from a large DataFrame
Key: SPARK-37187
URL: https://issues.apache.org/jira/browse/SPARK-37187
Chuck Connell created SPARK-37184:
-
Summary: pyspark.pandas should support
DF["column"].str.split("some_suffix").str[0]
Key: SPARK-37184
URL: https://issues.apache.org/jira/browse/SPARK-37184
[
https://issues.apache.org/jira/browse/SPARK-37166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436963#comment-17436963
]
Chao Sun commented on SPARK-37166:
--
[~xkrogen] sure just linked.
> SPIP: Storage Partitioned Join
>
Chuck Connell created SPARK-37181:
-
Summary: pyspark.pandas.read_csv() should support latin-1 encoding
Key: SPARK-37181
URL: https://issues.apache.org/jira/browse/SPARK-37181
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-37182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chuck Connell resolved SPARK-37182.
---
Resolution: Duplicate
https://issues.apache.org/jira/browse/SPARK-36609
>
[
https://issues.apache.org/jira/browse/SPARK-37182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chuck Connell updated SPARK-37182:
--
Comment: was deleted
(was: https://issues.apache.org/jira/browse/SPARK-36609)
>
Chuck Connell created SPARK-37189:
-
Summary: CLONE - pyspark.pandas histogram accepts the title option
but does not add a title to the plot
Key: SPARK-37189
URL: https://issues.apache.org/jira/browse/SPARK-37189
[
https://issues.apache.org/jira/browse/SPARK-37189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chuck Connell updated SPARK-37189:
--
Summary: pyspark.pandas histogram accepts the range option but does not use
it (was: CLONE -
Allison Wang created SPARK-37190:
Summary: Improve error messages for casting under ANSI mode
Key: SPARK-37190
URL: https://issues.apache.org/jira/browse/SPARK-37190
Project: Spark
Issue
Chuck Connell created SPARK-37186:
-
Summary: pyspark.pandas should support tseries.offsets
Key: SPARK-37186
URL: https://issues.apache.org/jira/browse/SPARK-37186
Project: Spark
Issue Type:
Chuck Connell created SPARK-37188:
-
Summary: pyspark.pandas histogram accepts the title option but
does not add a title to the plot
Key: SPARK-37188
URL: https://issues.apache.org/jira/browse/SPARK-37188
[
https://issues.apache.org/jira/browse/SPARK-37166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436959#comment-17436959
]
Erik Krogen commented on SPARK-37166:
-
[~csun] can you link the doc here?
> SPIP: Storage
[
https://issues.apache.org/jira/browse/SPARK-37023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37023:
Assignee: Apache Spark
> Avoid fetching merge status when shuffleMergeEnabled is false
[
https://issues.apache.org/jira/browse/SPARK-37023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37023:
Assignee: (was: Apache Spark)
> Avoid fetching merge status when shuffleMergeEnabled
[
https://issues.apache.org/jira/browse/SPARK-37023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437042#comment-17437042
]
Apache Spark commented on SPARK-37023:
--
User 'rmcyang' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-37023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437041#comment-17437041
]
Apache Spark commented on SPARK-37023:
--
User 'rmcyang' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-37191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437061#comment-17437061
]
Ivan commented on SPARK-37191:
--
This is somewhat related to
[
https://issues.apache.org/jira/browse/SPARK-37159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kousuke Saruta resolved SPARK-37159.
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved in
Ivan created SPARK-37191:
Summary: Allow merging DecimalTypes with different precision
values
Key: SPARK-37191
URL: https://issues.apache.org/jira/browse/SPARK-37191
Project: Spark
Issue Type: Bug
PengLei created SPARK-37192:
---
Summary: Migrate SHOW TBLPROPERTIES to use V2 command by default
Key: SPARK-37192
URL: https://issues.apache.org/jira/browse/SPARK-37192
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-37124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chendi.Xue updated SPARK-37124:
---
Description:
This Jira is aim to support Arrow format in RowToColumnarExec
Current
[
https://issues.apache.org/jira/browse/SPARK-37191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37191:
Assignee: (was: Apache Spark)
> Allow merging DecimalTypes with different precision
[
https://issues.apache.org/jira/browse/SPARK-37191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37191:
Assignee: Apache Spark
> Allow merging DecimalTypes with different precision values
>
[
https://issues.apache.org/jira/browse/SPARK-37191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437064#comment-17437064
]
Apache Spark commented on SPARK-37191:
--
User 'sadikovi' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-37051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437122#comment-17437122
]
Yang Jie commented on SPARK-37051:
--
Can you test
{code:java}
select * from t2_orc where
[
https://issues.apache.org/jira/browse/SPARK-37191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ivan updated SPARK-37191:
-
Description:
When merging DecimalTypes with different precision but the same scale, one
would get the
[
https://issues.apache.org/jira/browse/SPARK-37190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437070#comment-17437070
]
Apache Spark commented on SPARK-37190:
--
User 'allisonwang-db' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-36554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kousuke Saruta resolved SPARK-36554.
Fix Version/s: 3.3.0
Assignee: Nicolas Azrak
Resolution: Fixed
Issue
[
https://issues.apache.org/jira/browse/SPARK-37192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437120#comment-17437120
]
Terry Kim commented on SPARK-37192:
---
Yes, go for it! Thanks!
> Migrate SHOW TBLPROPERTIES to use V2
[
https://issues.apache.org/jira/browse/SPARK-37191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ivan updated SPARK-37191:
-
Description:
When merging DecimalTypes with different precision but the same scale, one
would get the
Eugene Koifman created SPARK-37193:
--
Summary: DynamicJoinSelection.shouldDemoteBroadcastHashJoin should
not apply to outer joins
Key: SPARK-37193
URL: https://issues.apache.org/jira/browse/SPARK-37193
[
https://issues.apache.org/jira/browse/SPARK-37172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437135#comment-17437135
]
Chungmin commented on SPARK-37172:
--
I can work on this if the rationale seems okay.
> Push down
[
https://issues.apache.org/jira/browse/SPARK-37190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37190:
Assignee: (was: Apache Spark)
> Improve error messages for casting under ANSI mode
>
[
https://issues.apache.org/jira/browse/SPARK-37190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437068#comment-17437068
]
Apache Spark commented on SPARK-37190:
--
User 'allisonwang-db' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-37190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37190:
Assignee: Apache Spark
> Improve error messages for casting under ANSI mode
>
[
https://issues.apache.org/jira/browse/SPARK-37192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437113#comment-17437113
]
PengLei commented on SPARK-37192:
-
[~imback82] [~wenchen] I want to try to fix it, okay?
> Migrate SHOW
[
https://issues.apache.org/jira/browse/SPARK-37051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437127#comment-17437127
]
frankli commented on SPARK-37051:
-
I know this SQL can work, but this behavior is different from MYSQL
[
https://issues.apache.org/jira/browse/SPARK-37193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37193:
Assignee: Apache Spark
> DynamicJoinSelection.shouldDemoteBroadcastHashJoin should not
[
https://issues.apache.org/jira/browse/SPARK-37193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37193:
Assignee: (was: Apache Spark)
> DynamicJoinSelection.shouldDemoteBroadcastHashJoin
[
https://issues.apache.org/jira/browse/SPARK-37193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437125#comment-17437125
]
Apache Spark commented on SPARK-37193:
--
User 'ekoifman' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-37193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17437126#comment-17437126
]
Apache Spark commented on SPARK-37193:
--
User 'ekoifman' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-35496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yang Jie updated SPARK-35496:
-
Affects Version/s: 3.3.0
> Upgrade Scala 2.13 to 2.13.7
>
>
>
[
https://issues.apache.org/jira/browse/SPARK-37161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Max Gekk resolved SPARK-37161.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 34446
[
https://issues.apache.org/jira/browse/SPARK-37161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Max Gekk reassigned SPARK-37161:
Assignee: PengLei
> RowToColumnConverter support AnsiIntervalType
>
[
https://issues.apache.org/jira/browse/SPARK-37013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436641#comment-17436641
]
Apache Spark commented on SPARK-37013:
--
User 'LuciferYang' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-36061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436725#comment-17436725
]
Apache Spark commented on SPARK-36061:
--
User 'Yikun' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-36061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-36061:
Assignee: Apache Spark
> Create a PodGroup with user specified minimum resources
[
https://issues.apache.org/jira/browse/SPARK-36061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-36061:
Assignee: (was: Apache Spark)
> Create a PodGroup with user specified minimum
[
https://issues.apache.org/jira/browse/SPARK-37176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37176:
Assignee: Apache Spark
> JsonSource's infer should have the same exception handle logic
[
https://issues.apache.org/jira/browse/SPARK-37176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436675#comment-17436675
]
Apache Spark commented on SPARK-37176:
--
User 'advancedxy' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-37176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37176:
Assignee: (was: Apache Spark)
> JsonSource's infer should have the same exception
Xianjin YE created SPARK-37176:
--
Summary: JsonSource's infer should have the same exception handle
logic as JacksonParser's parse logic
Key: SPARK-37176
URL: https://issues.apache.org/jira/browse/SPARK-37176
[
https://issues.apache.org/jira/browse/SPARK-23977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436740#comment-17436740
]
Gustavo Martin edited comment on SPARK-23977 at 11/1/21, 10:35 AM:
---
[
https://issues.apache.org/jira/browse/SPARK-23977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436740#comment-17436740
]
Gustavo Martin edited comment on SPARK-23977 at 11/1/21, 10:48 AM:
---
[
https://issues.apache.org/jira/browse/SPARK-37062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jungtaek Lim resolved SPARK-37062.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 34333
[
https://issues.apache.org/jira/browse/SPARK-37062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jungtaek Lim reassigned SPARK-37062:
Assignee: Jungtaek Lim
> Introduce a new data source for providing consistent set of
Simon Tao created SPARK-37178:
-
Summary: Add Target Encoding to ml.feature
Key: SPARK-37178
URL: https://issues.apache.org/jira/browse/SPARK-37178
Project: Spark
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/SPARK-23977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436740#comment-17436740
]
Gustavo Martin edited comment on SPARK-23977 at 11/1/21, 10:34 AM:
---
[
https://issues.apache.org/jira/browse/SPARK-23977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436740#comment-17436740
]
Gustavo Martin commented on SPARK-23977:
Thank you ver much [~ste...@apache.org] for your
Douglas Moore created SPARK-37177:
-
Summary: Support LONG argument to the Spark SQL LIMIT clause
Key: SPARK-37177
URL: https://issues.apache.org/jira/browse/SPARK-37177
Project: Spark
Issue
Chuck Connell created SPARK-37180:
-
Summary: PySpark.pandas should support __version__
Key: SPARK-37180
URL: https://issues.apache.org/jira/browse/SPARK-37180
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-37180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chuck Connell updated SPARK-37180:
--
Description:
In regular pandas you can say
{quote}pd.___version___
{quote}
to get the pandas
[
https://issues.apache.org/jira/browse/SPARK-37180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chuck Connell updated SPARK-37180:
--
Description:
In regular pandas you can say
{quote}{{pd.__version__ }}{quote}
to get the
[
https://issues.apache.org/jira/browse/SPARK-36566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436885#comment-17436885
]
Yikun Jiang commented on SPARK-36566:
-
Yep, it's useful for me. Does it make sense if we also set
[
https://issues.apache.org/jira/browse/SPARK-36566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436885#comment-17436885
]
Yikun Jiang edited comment on SPARK-36566 at 11/1/21, 3:11 PM:
---
Yep, it's
[
https://issues.apache.org/jira/browse/SPARK-36566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436895#comment-17436895
]
Apache Spark commented on SPARK-36566:
--
User 'Yikun' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-36566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-36566:
Assignee: Apache Spark
> Add Spark appname as a label to the executor pods
>
[
https://issues.apache.org/jira/browse/SPARK-36566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-36566:
Assignee: (was: Apache Spark)
> Add Spark appname as a label to the executor pods
>
[
https://issues.apache.org/jira/browse/SPARK-36566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436896#comment-17436896
]
Apache Spark commented on SPARK-36566:
--
User 'Yikun' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-37034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436898#comment-17436898
]
Wenchen Fan commented on SPARK-37034:
-
This is a question, not a feature request. Please ask it in
[
https://issues.apache.org/jira/browse/SPARK-37034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436898#comment-17436898
]
Wenchen Fan edited comment on SPARK-37034 at 11/1/21, 3:23 PM:
---
This is a
[
https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gengliang Wang updated SPARK-37179:
---
Description: The casting between
> ANSI mode: Allow casting between Timestamp and Numeric
[
https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gengliang Wang reassigned SPARK-37179:
--
Assignee: Gengliang Wang
> ANSI mode: Allow casting between Timestamp and Numeric
>
[
https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gengliang Wang updated SPARK-37179:
---
Description:
We should allow casting
As we did some data science, we found that many Spark
[
https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37179:
Assignee: Gengliang Wang (was: Apache Spark)
> ANSI mode: Allow casting between
[
https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436858#comment-17436858
]
Apache Spark commented on SPARK-37179:
--
User 'gengliangwang' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37179:
Assignee: Apache Spark (was: Gengliang Wang)
> ANSI mode: Allow casting between
[
https://issues.apache.org/jira/browse/SPARK-37034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436825#comment-17436825
]
xiaoli commented on SPARK-37034:
[~dongjoon] [~yumwang] [~cloud_fan] Sorry to ping you, as there is
[
https://issues.apache.org/jira/browse/SPARK-37178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37178:
Assignee: (was: Apache Spark)
> Add Target Encoding to ml.feature
>
[
https://issues.apache.org/jira/browse/SPARK-37178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37178:
Assignee: Apache Spark
> Add Target Encoding to ml.feature
>
Gengliang Wang created SPARK-37179:
--
Summary: ANSI mode: Allow casting between Timestamp and Numeric
Key: SPARK-37179
URL: https://issues.apache.org/jira/browse/SPARK-37179
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-37178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436844#comment-17436844
]
Apache Spark commented on SPARK-37178:
--
User 'taosiyuan163' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gengliang Wang updated SPARK-37179:
---
Description:
We should allow the casting between Timestamp and Numeric types:
* As we did
[
https://issues.apache.org/jira/browse/SPARK-37179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436861#comment-17436861
]
Apache Spark commented on SPARK-37179:
--
User 'gengliangwang' has created a pull request for this
90 matches
Mail list logo