Lawrence Chan created ARROW-2296:
------------------------------------

             Summary: Add num_rows to file footer
                 Key: ARROW-2296
                 URL: https://issues.apache.org/jira/browse/ARROW-2296
             Project: Apache Arrow
          Issue Type: Improvement
            Reporter: Lawrence Chan


Maybe I'm overlooking something, but I don't see something on the API surface 
to get the number of rows in a arrow file without reading all the record 
batches.

I'd like to propose that we add `num_rows` as a field to the footer so it's 
easy to query without reading the whole file.

Meanwhile, before we get that added to the official format fbs, it would be 
nice to haveĀ a method that iterates over the record batch headers and sums up 
the lengths without reading the actual record batch body.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to