[
https://issues.apache.org/jira/browse/TAJO-1340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14740752#comment-14740752
]
ASF GitHub Bot commented on TAJO-1340:
--------------------------------------
Github user hyunsik commented on a diff in the pull request:
https://github.com/apache/tajo/pull/671#discussion_r39268081
--- Diff: tajo-common/src/main/java/org/apache/tajo/conf/TajoConf.java ---
@@ -215,6 +216,9 @@ public static int setDateOrder(int dateOrder) {
SHUFFLE_HASH_APPENDER_PAGE_VOLUME("tajo.shuffle.hash.appender.page.volumn-mb",
30),
HASH_SHUFFLE_PARENT_DIRS("tajo.hash.shuffle.parent.dirs.count", 10),
+ // Final output Configuration
--------------------------------------------------
+ FINAL_OUTPUT_FILE_FORMAT("tajo.final.output.file-format",
BuiltinStorages.TEXT, Validators.javaString()),
--- End diff --
Each key level should be meaningful, and left keys should be more general
than those of right ones.
I'd like to recommend ``DEFAULT_OUTPUT_FILE_FORMAT`` and
``tajo.output.file-format``.
> Change the default output file format.
> --------------------------------------
>
> Key: TAJO-1340
> URL: https://issues.apache.org/jira/browse/TAJO-1340
> Project: Tajo
> Issue Type: Improvement
> Components: Java Client, JDBC Driver, Offheap, Storage
> Reporter: Hyunsik Choi
> Assignee: Jinho Kim
> Fix For: 0.11.0, 0.12.0
>
> Attachments: TAJO-1340.patch, TAJO-1340_2.patch
>
>
> Currently, the default output file is CSV. Due to its nature, CSV has mainly
> three problems:
> * Its line or field delimiter can be duplicated to some character included
> in the result data.
> * Plan text file is likely to be larger than other file formats.
> * Its read and write performance is slow.
> We need to change the default output file format into other file formats. We
> also need to investigate which file format is the best for it.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)