Philip Kahn created SPARK-47997:
-----------------------------------

             Summary: Pandas-on-Spark incompletely implements DataFrame.drop
                 Key: SPARK-47997
                 URL: https://issues.apache.org/jira/browse/SPARK-47997
             Project: Spark
          Issue Type: Bug
          Components: Pandas API on Spark
    Affects Versions: 3.4.3
            Reporter: Philip Kahn


For Pandas v1.0+, `drop` supports the `errors` kwarg:

[https://pandas.pydata.org/pandas-docs/version/1.0/reference/api/pandas.DataFrame.drop.html]

 

Pandas-on-Spark does not implement it. This is especially glaring since the 
pyspark drop is a no-op on absent columns, behaving like `errors='ignore'`, so 
_extra_ work needed to be done to implement the raise behaviour.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to