westonpace commented on code in PR #14663: URL: https://github.com/apache/arrow/pull/14663#discussion_r1029895126
########## cpp/src/arrow/dataset/file_base.h: ########## @@ -108,6 +115,9 @@ class ARROW_DS_EXPORT FileSource : public util::EqualityComparable<FileSource> { /// \brief Get a RandomAccessFile which views this file source Result<std::shared_ptr<io::RandomAccessFile>> Open() const; + /// \brief Get the size (in bytes) of the file or buffer Review Comment: Hmmm...the same question exists for "encoded size?" or "decoded size?". I believe the answer is "uncompressed and decoded". The goal is to limit how much data is loaded after a scanner is paused. I will update this comment. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org