[ 
https://issues.apache.org/jira/browse/TAJO-1340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14733364#comment-14733364
 ] 

ASF GitHub Bot commented on TAJO-1340:
--------------------------------------

Github user jinossy commented on the pull request:

    https://github.com/apache/tajo/pull/671#issuecomment-138226623
  
    Change to pre-fetch both TajoMaster and TajoClient
    
    Pre-Fetch, Network: Loopback, Data: lineitem 3GB
    ```
    TEXT format
    52 sec (Deserialize), 52 sec (Non-Deserialize)
    
    TEXT format + snappy compression
    54 sec (Deserialize), 54 sec (Non-Deserialize)
    
    DRAW format
    32 sec (Deserialize), 22 sec (Non-Deserialize)
    
    DRAW format + snappy compression
    47 sec (Deserialize), 47 sec (Non-Deserialize)
    ```


> Change the default output file format.
> --------------------------------------
>
>                 Key: TAJO-1340
>                 URL: https://issues.apache.org/jira/browse/TAJO-1340
>             Project: Tajo
>          Issue Type: Improvement
>          Components: Java Client, JDBC Driver, Offheap, Storage
>            Reporter: Hyunsik Choi
>            Assignee: Jinho Kim
>             Fix For: 0.11.0, 0.12.0
>
>         Attachments: TAJO-1340.patch, TAJO-1340_2.patch
>
>
> Currently, the default output file is CSV. Due to its nature, CSV has mainly 
> three problems:
>  * Its line or field delimiter can be duplicated to some character included 
> in the result data.
>  * Plan text file is likely to be larger than other file formats.
>  * Its read and write performance is slow.
> We need to change the default output file format into other file formats. We 
> also need to investigate which file format is the best for it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to