[ 
https://issues.apache.org/jira/browse/SPARK-24330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gengliang Wang updated SPARK-24330:
-----------------------------------
    Description: 
Refactor ExecuteWriteTask in FileFormatWriter to reduce common logic and 
improve readability.
After the change, callers only need to call {{commit()}} or {{abort}} at the 
end of task.
Also there is less code in {{SingleDirectoryWriteTask}} and 
{{DynamicPartitionWriteTask}}.

Definitions of related classes are moved to a new file, and 
{{ExecuteWriteTask}} is renamed to {{FileFormatDataWriter}}.

  was:
As I am working on File data source V2 write path in my repo 
[https://github.com/gengliangwang/spark/tree/orcWriter] , I find it essential 
to refactor ExecuteWriteTask in FileFormatWriter with DataWriter of Data source 
V2:
 # Reuse the code in both `FileFormat` and Data Source V2
 # Better abstraction, callers only need to call `commit()` or `abort` at the 
end of task.

 

 


> Refactor ExecuteWriteTask in FileFormatWriter with DataWriter(V2)
> -----------------------------------------------------------------
>
>                 Key: SPARK-24330
>                 URL: https://issues.apache.org/jira/browse/SPARK-24330
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 2.3.1
>            Reporter: Gengliang Wang
>            Priority: Major
>
> Refactor ExecuteWriteTask in FileFormatWriter to reduce common logic and 
> improve readability.
> After the change, callers only need to call {{commit()}} or {{abort}} at the 
> end of task.
> Also there is less code in {{SingleDirectoryWriteTask}} and 
> {{DynamicPartitionWriteTask}}.
> Definitions of related classes are moved to a new file, and 
> {{ExecuteWriteTask}} is renamed to {{FileFormatDataWriter}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to