Nong Li created PARQUET-17:
------------------------------
Summary: Parquet OutputFormat should allow controlling the file
size
Key: PARQUET-17
URL: https://issues.apache.org/jira/browse/PARQUET-17
Project: Parquet
Issue Type: Bug
Reporter: Nong Li
To generate the most efficient on disk file, the size of the file is important
to control. It would be nice if we could configure the ouputformat to roll over
new files when it reaches the right size and start a new file.
There's currently no easy way to tune this and requires indirect tuning (number
of reduces, map input size).
--
This message was sent by Atlassian JIRA
(v6.2#6252)