Nong Li created PARQUET-17:
------------------------------

             Summary: Parquet OutputFormat should allow controlling the file 
size
                 Key: PARQUET-17
                 URL: https://issues.apache.org/jira/browse/PARQUET-17
             Project: Parquet
          Issue Type: Bug
            Reporter: Nong Li


To generate the most efficient on disk file, the size of the file is important 
to control. It would be nice if we could configure the ouputformat to roll over 
new files when it reaches the right size and start a new file.

There's currently no easy way to tune this and requires indirect tuning (number 
of reduces, map input size).



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to