[
https://issues.apache.org/jira/browse/SPARK-30861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17041268#comment-17041268
]
Bryan Cutler commented on SPARK-30861:
--
Issue resolved by pull request 27614
[
https://issues.apache.org/jira/browse/SPARK-30861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler updated SPARK-30861:
-
Fix Version/s: 3.0.0
> Deprecate constructor of SQLContext and getOrCreate in SQLContext at
[
https://issues.apache.org/jira/browse/SPARK-30861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-30861.
--
Resolution: Fixed
> Deprecate constructor of SQLContext and getOrCreate in SQLContext at
Bryan Cutler created SPARK-30834:
Summary: Add note for recommended versions of Pandas and PyArrow
for 2.4.x
Key: SPARK-30834
URL: https://issues.apache.org/jira/browse/SPARK-30834
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-30834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler updated SPARK-30834:
-
Description:
CI testing for branch 2.4 has been with the versions below. These are
recommened
[
https://issues.apache.org/jira/browse/SPARK-30834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler updated SPARK-30834:
-
Description:
CI testing for branch 2.4 has been with the versions below. These are
recommened
[
https://issues.apache.org/jira/browse/SPARK-30834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler updated SPARK-30834:
-
Component/s: PySpark
> Add note for recommended versions of Pandas and PyArrow for 2.4.x
>
Bryan Cutler created SPARK-30777:
Summary: PySpark test_arrow tests fail with Pandas > 1.0.0
Key: SPARK-30777
URL: https://issues.apache.org/jira/browse/SPARK-30777
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-29748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-29748.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26496
[
https://issues.apache.org/jira/browse/SPARK-29748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler reassigned SPARK-29748:
Assignee: Bryan Cutler
> Remove sorting of fields in PySpark SQL Row creation
>
[
https://issues.apache.org/jira/browse/SPARK-22232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-22232.
--
Resolution: Won't Fix
Closing in favor for fix in SPARK-29748
> Row objects in pyspark
[
https://issues.apache.org/jira/browse/SPARK-24915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-24915.
--
Resolution: Won't Fix
Closing in favor of fix in SPARK-29748
> Calling
[
https://issues.apache.org/jira/browse/SPARK-24915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17014719#comment-17014719
]
Bryan Cutler commented on SPARK-24915:
--
[~jhereth] apologies for closing prematurely, I didn't know
[
https://issues.apache.org/jira/browse/SPARK-24915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler reopened SPARK-24915:
--
> Calling SparkSession.createDataFrame with schema can throw exception
>
[
https://issues.apache.org/jira/browse/SPARK-24915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17019770#comment-17019770
]
Bryan Cutler commented on SPARK-24915:
--
[~jhereth] since there is already a lot of discussion on
[
https://issues.apache.org/jira/browse/SPARK-30961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-30961.
--
Resolution: Won't Fix
Thanks [~KevinAppel] and [~nicornk] for the info, I'll go ahead and
[
https://issues.apache.org/jira/browse/SPARK-30961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17053717#comment-17053717
]
Bryan Cutler commented on SPARK-30961:
--
Just to be clear, this is only an issue with Spark 2.4.x.
[
https://issues.apache.org/jira/browse/SPARK-31306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler reassigned SPARK-31306:
Assignee: Ben
> rand() function documentation suggests an inclusive upper bound of 1.0
>
[
https://issues.apache.org/jira/browse/SPARK-31306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-31306.
--
Resolution: Fixed
Issue resolved by pull request 28071
[
https://issues.apache.org/jira/browse/SPARK-31306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler reassigned SPARK-31306:
Assignee: Bryan Cutler
> rand() function documentation suggests an inclusive upper bound
[
https://issues.apache.org/jira/browse/SPARK-31306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler reassigned SPARK-31306:
Assignee: (was: Bryan Cutler)
> rand() function documentation suggests an inclusive
[
https://issues.apache.org/jira/browse/SPARK-31299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17073027#comment-17073027
]
Bryan Cutler commented on SPARK-31299:
--
It looks like you are using {{DenseVector}} from
[
https://issues.apache.org/jira/browse/SPARK-31299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-31299.
--
Resolution: Not A Problem
> Pyspark.ml.clustering illegalArgumentException with dataframe
[
https://issues.apache.org/jira/browse/SPARK-31299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler updated SPARK-31299:
-
Description:
I hope this is the right place and way to report a bug in (at least) the
PySpark
[
https://issues.apache.org/jira/browse/SPARK-31704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17106500#comment-17106500
]
Bryan Cutler commented on SPARK-31704:
--
This is due to a Netty API that Arrow uses and
[
https://issues.apache.org/jira/browse/SPARK-31629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100154#comment-17100154
]
Bryan Cutler commented on SPARK-31629:
--
[~appleyuchi] are you able to try out a more recent version
[
https://issues.apache.org/jira/browse/SPARK-32312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17190553#comment-17190553
]
Bryan Cutler commented on SPARK-32312:
--
Sorry for the delay, I was holding off for a couple of
Bryan Cutler created SPARK-33073:
Summary: Improve error handling on Pandas to Arrow conversion
failures
Key: SPARK-33073
URL: https://issues.apache.org/jira/browse/SPARK-33073
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-33073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler updated SPARK-33073:
-
Description:
Currently, when converting from Pandas to Arrow for Pandas UDF return values or
[
https://issues.apache.org/jira/browse/SPARK-32686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler reassigned SPARK-32686:
Assignee: Nicholas Chammas
> Un-deprecate inferring DataFrame schema from list of
[
https://issues.apache.org/jira/browse/SPARK-32686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-32686.
--
Fix Version/s: 3.1.0
Resolution: Fixed
Issue resolved by pull request 29510
[
https://issues.apache.org/jira/browse/SPARK-24554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17205719#comment-17205719
]
Bryan Cutler commented on SPARK-24554:
--
I started working on this, but ran into an issue at
[
https://issues.apache.org/jira/browse/SPARK-25351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler reassigned SPARK-25351:
Assignee: Jalpan Randeri
> Handle Pandas category type when converting from Python with
[
https://issues.apache.org/jira/browse/SPARK-25351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-25351.
--
Fix Version/s: 3.1.0
Resolution: Fixed
Issue resolved by pull request 26585
[
https://issues.apache.org/jira/browse/SPARK-33189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17217779#comment-17217779
]
Bryan Cutler commented on SPARK-33189:
--
There is an env var we can set that will use the old
[
https://issues.apache.org/jira/browse/SPARK-33213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17219840#comment-17219840
]
Bryan Cutler commented on SPARK-33213:
--
Just a couple notes:
The library and format versions are
[
https://issues.apache.org/jira/browse/SPARK-32174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-32174.
--
Resolution: Not A Problem
Great, I will mark this as resolved then. We should add the
[
https://issues.apache.org/jira/browse/SPARK-32312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157543#comment-17157543
]
Bryan Cutler commented on SPARK-32312:
--
I've been doing local testing and will submit a WIP PR
Bryan Cutler created SPARK-32312:
Summary: Upgrade Apache Arrow to 1.0.0
Key: SPARK-32312
URL: https://issues.apache.org/jira/browse/SPARK-32312
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-32300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler reassigned SPARK-32300:
Assignee: Hyukjin Kwon
> toPandas with no partitions should work
>
[
https://issues.apache.org/jira/browse/SPARK-32300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-32300.
--
Fix Version/s: 2.4.7
Resolution: Fixed
Issue resolved by pull request 29098
Bryan Cutler created SPARK-32162:
Summary: Improve Pandas Grouped Map with Window test output
Key: SPARK-32162
URL: https://issues.apache.org/jira/browse/SPARK-32162
Project: Spark
Issue
Bryan Cutler created SPARK-32285:
Summary: Add PySpark support for nested timestamps with arrow
Key: SPARK-32285
URL: https://issues.apache.org/jira/browse/SPARK-32285
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-21187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler updated SPARK-21187:
-
Description:
This is to track adding the remaining type support in Arrow Converters.
[
https://issues.apache.org/jira/browse/SPARK-32174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17152958#comment-17152958
]
Bryan Cutler commented on SPARK-32174:
--
>From the stacktrace, it looks like you are using JDK9 or
[
https://issues.apache.org/jira/browse/SPARK-32098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-32098.
--
Fix Version/s: 3.1.0
2.4.7
3.0.1
Resolution:
[
https://issues.apache.org/jira/browse/SPARK-32098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler reassigned SPARK-32098:
Assignee: Hyukjin Kwon
> Use iloc for positional slicing instead of direct slicing in
[
https://issues.apache.org/jira/browse/SPARK-31998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler updated SPARK-31998:
-
Component/s: (was: Spark Core)
SQL
> Change package references for
[
https://issues.apache.org/jira/browse/SPARK-31998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler updated SPARK-31998:
-
Issue Type: Improvement (was: Bug)
> Change package references for ArrowBuf
>
Bryan Cutler created SPARK-32080:
Summary: Simplify ArrowColumnVector ListArray accessor
Key: SPARK-32080
URL: https://issues.apache.org/jira/browse/SPARK-32080
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-32080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler updated SPARK-32080:
-
Priority: Trivial (was: Major)
> Simplify ArrowColumnVector ListArray accessor
>
Bryan Cutler created SPARK-31964:
Summary: Avoid Pandas import for CategoricalDtype with Arrow
conversion
Key: SPARK-31964
URL: https://issues.apache.org/jira/browse/SPARK-31964
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-31915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-31915.
--
Fix Version/s: 3.1.0
Resolution: Fixed
Issue resolved by pull request 28777
[
https://issues.apache.org/jira/browse/SPARK-31915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler reassigned SPARK-31915:
Assignee: Hyukjin Kwon
> Resolve the grouping column properly per the case sensitivity
[
https://issues.apache.org/jira/browse/SPARK-32413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-32413.
--
Resolution: Not A Problem
Hi [~stoksoz] , this type of discussion is more appropriate for the
[
https://issues.apache.org/jira/browse/SPARK-32413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler closed SPARK-32413.
> Guidance for my project
>
>
> Key: SPARK-32413
>
[
https://issues.apache.org/jira/browse/SPARK-33489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17238950#comment-17238950
]
Bryan Cutler commented on SPARK-33489:
--
Yes, Arrow supports null type. Should be pretty
Bryan Cutler created SPARK-33613:
Summary: [Python][Tests] Replace calls to deprecated test APIs
Key: SPARK-33613
URL: https://issues.apache.org/jira/browse/SPARK-33613
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-33576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17241769#comment-17241769
]
Bryan Cutler commented on SPARK-33576:
--
Is this due to the 2GB limit? As in
[
https://issues.apache.org/jira/browse/SPARK-33489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17241092#comment-17241092
]
Bryan Cutler commented on SPARK-33489:
--
Great, thanks [~cactice] ! Please feel free to ping me if
[
https://issues.apache.org/jira/browse/SPARK-33576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-33576.
--
Resolution: Duplicate
Going to resolve as a duplicate, but please reopen if you find it is
[
https://issues.apache.org/jira/browse/SPARK-33576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17248064#comment-17248064
]
Bryan Cutler commented on SPARK-33576:
--
[~darshats] I believe the only current workaround is to
[
https://issues.apache.org/jira/browse/SPARK-21187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler updated SPARK-21187:
-
Description:
This is to track adding the remaining type support in Arrow Converters.
[
https://issues.apache.org/jira/browse/SPARK-21187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-21187.
--
Fix Version/s: 3.1.0
Resolution: Fixed
With MapType now added, all basic types are
[
https://issues.apache.org/jira/browse/SPARK-32285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler updated SPARK-32285:
-
Parent: (was: SPARK-21187)
Issue Type: Improvement (was: Sub-task)
> Add PySpark
[
https://issues.apache.org/jira/browse/SPARK-33279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17224420#comment-17224420
]
Bryan Cutler edited comment on SPARK-33279 at 11/2/20, 5:21 AM:
[
https://issues.apache.org/jira/browse/SPARK-33279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17224420#comment-17224420
]
Bryan Cutler commented on SPARK-33279:
--
[~fan_li_ya] we should change the Arrow-Spark integration
[
https://issues.apache.org/jira/browse/SPARK-24632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17255712#comment-17255712
]
Bryan Cutler commented on SPARK-24632:
--
Ping [~huaxingao] in case you have some time to look into
[
https://issues.apache.org/jira/browse/SPARK-32953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-32953.
--
Fix Version/s: 3.2.0
Resolution: Fixed
Issue resolved by pull request 29818
[
https://issues.apache.org/jira/browse/SPARK-32953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler reassigned SPARK-32953:
Assignee: David Li
> Lower memory usage in toPandas with Arrow self_destruct
>
[
https://issues.apache.org/jira/browse/SPARK-34463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17293893#comment-17293893
]
Bryan Cutler commented on SPARK-34463:
--
As David said, it depends on what is done in Pandas that
[
https://issues.apache.org/jira/browse/SPARK-34463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17293893#comment-17293893
]
Bryan Cutler edited comment on SPARK-34463 at 3/2/21, 6:11 PM:
---
As David
[
https://issues.apache.org/jira/browse/SPARK-21187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler updated SPARK-21187:
-
Attachment: (was: 0--1172099527-254246775-1412485878)
> Complete support for remaining
[
https://issues.apache.org/jira/browse/SPARK-34521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-34521.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 34509
[
https://issues.apache.org/jira/browse/SPARK-34521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler reassigned SPARK-34521:
Assignee: Nicolas Azrak
> spark.createDataFrame does not support Pandas StringDtype
[
https://issues.apache.org/jira/browse/SPARK-39160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-39160.
--
Fix Version/s: 3.4.0
Resolution: Fixed
Issue resolved by pull request 36518
[
https://issues.apache.org/jira/browse/SPARK-39160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler reassigned SPARK-39160:
Assignee: Cheng Pan
> Remove workaround for ARROW-1948
>
[
https://issues.apache.org/jira/browse/SPARK-38098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler reassigned SPARK-38098:
Assignee: Luca Canali
> Add support for ArrayType of nested StructType to arrow-based
[
https://issues.apache.org/jira/browse/SPARK-38098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Cutler resolved SPARK-38098.
--
Fix Version/s: 3.4.0
Resolution: Fixed
Issue resolved by pull request 35391
701 - 779 of 779 matches
Mail list logo