[GitHub] [arrow] thisisnic commented on pull request #10269: ARROW-11705: [R] Support scalar value recycling in RecordBatch/Table$create()

GitBox Fri, 14 May 2021 06:34:44 -0700


thisisnic commented on pull request #10269:
URL: https://github.com/apache/arrow/pull/10269#issuecomment-841248102



   > There are some file-read benchmarks that are >5% slower, interestingly it 
is all (and only) the fanniemae dataset that is slower (both reading from 
parquet and from feather) and _only_ when it is being converted to a 
data.frame, not when it is being left as a table. This seems a little suspect 
to me since the only places that I'm seeing you've meaningfully changed the 
code is `RecordBatch$create`, `Table$create`, and `MakeArrayFromScalar`. Do any 
of those get called when reading parquet or feather files?
   
   They do not, which does make it strange; completely overlooked the fact that 
those shouldn't be relevant here.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow] thisisnic commented on pull request #10269: ARROW-11705: [R] Support scalar value recycling in RecordBatch/Table$create()

Reply via email to