[
https://issues.apache.org/jira/browse/ORC-21?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14994351#comment-14994351
]
ASF GitHub Bot commented on ORC-21:
-----------------------------------
GitHub user asandryh opened a pull request:
https://github.com/apache/orc/pull/12
Fixed ORC-21: Add dynamic memory usage estimation.
An upper bound on memory requirements is provided by two components:
* Reader::memoryUse() returns an upper bound on its memory needs. It
depends on the file and columns read.
* ColumnBatch::memoryUse() returns an upper bound on its memory needs. It
depends on the file, columns, and number of rows read.
The new utility FileMemory.cc compares estimated and actual memory usage.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/asandryh/orc orc-21
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/orc/pull/12.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #12
----
commit 01ea2074b413d7f95664338dc6df7d20224370f3
Author: Aliaksei Sandryhaila <[email protected]>
Date: 2015-11-06T19:59:08Z
Fixed ORC-21: Add dynamic memory usage estimation.
----
> Add functionality to estimate memory footprint
> ----------------------------------------------
>
> Key: ORC-21
> URL: https://issues.apache.org/jira/browse/ORC-21
> Project: Orc
> Issue Type: Task
> Reporter: Aliaksei Sandryhaila
> Assignee: Aliaksei Sandryhaila
>
> ORC library allocates multiple large buffers to read and materialize ORC
> files. For stability of applications that use the library, it may be
> desirable to have an estimate (preferably, a tight upper bound) of a memory
> footprint.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)