[ https://issues.apache.org/jira/browse/SPARK-36143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Xinrong Meng updated SPARK-36143: --------------------------------- Summary: Adjust astype of Series with missing values to follow pandas (was: Adjust astype of Series of ExtensionDtype to follow pandas) > Adjust astype of Series with missing values to follow pandas > ------------------------------------------------------------ > > Key: SPARK-36143 > URL: https://issues.apache.org/jira/browse/SPARK-36143 > Project: Spark > Issue Type: Sub-task > Components: PySpark > Affects Versions: 3.2.0 > Reporter: Xinrong Meng > Priority: Major > > {code:java} > >>> pser = pd.Series([1, 2, np.nan], dtype=float) > >>> psser = ps.from_pandas(pser) > >>> pser.astype(int) > ... > ValueError: Cannot convert non-finite values (NA or inf) to integer > >>> psser.astype(int) > 0 1.0 > 1 2.0 > 2 NaN > dtype: float64 > {code} > As shown above, astype of Series of ExtensionDtype doesn't behave the same as > pandas for ExtensionDtype Series, we ought to adjust that. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org