[ 
https://issues.apache.org/jira/browse/ORC-21?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14639347#comment-14639347
 ] 

Aliaksei Sandryhaila commented on ORC-21:
-----------------------------------------

We use two components for reading ORC files: a reader and a reusable batch that 
we fill up with data. Since the end user should not know or worry about the 
internal workings of these components, both should be able to report how much 
memory that need.

Proposed solution: add methods uint64_t Reader::memoryUse() and uint64_t 
ColumnVectorBatch::memoryUse() that return an exact, or at least an upper 
bound, on the memory footprint of the respective classes/subclasses.

> Add functionality to estimate memory footprint
> ----------------------------------------------
>
>                 Key: ORC-21
>                 URL: https://issues.apache.org/jira/browse/ORC-21
>             Project: Orc
>          Issue Type: Task
>            Reporter: Aliaksei Sandryhaila
>            Assignee: Aliaksei Sandryhaila
>
> ORC library allocates multiple large buffers to read and materialize ORC 
> files. For stability of applications that use the library, it may be 
> desirable to have an estimate (preferably, a tight upper bound) of a memory 
> footprint.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to