Artem Kupchinskiy created SPARK-38116:
-----------------------------------------

             Summary: Ability to turn off auto commit in JDBC source for read 
only operations
                 Key: SPARK-38116
                 URL: https://issues.apache.org/jira/browse/SPARK-38116
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 3.2.1
            Reporter: Artem Kupchinskiy


Currently, all the jdbc connections on executors side work always with auto 
commit option set to true.

However, there are cases where this mode makes hard to use JdbcRelationProvider 
at all, i.e. reading huge datasets from Postgres (a whole result set is 
collected regardless of a fetch size when autocommit is set to true 
https://jdbc.postgresql.org/documentation/91/query.html#query-with-cursor )

So the proposal is following:
 # Add a boolean option "autocommit" to JDBC Source allowing a user to turn off 
autocommit mode for read only operations.
 # Add guards which prevent using this option in DML operations.  

 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to