Daniel Weeks created PARQUET-177:
------------------------------------
Summary: MemoryManager ensure minimum Row Group size
Key: PARQUET-177
URL: https://issues.apache.org/jira/browse/PARQUET-177
Project: Parquet
Issue Type: Improvement
Components: parquet-mr
Affects Versions: 1.6.0rc2
Reporter: Daniel Weeks
Assignee: Daniel Weeks
Priority: Minor
The memory manager currently has no limit to how small it will make row groups.
This is problematic because jobs that have a large number of writers can
result in tiny row groups that hurt performance.
The following patch will allow a configurable minimum size before killing the
job. Default is currently no limit.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)