[ https://issues.apache.org/jira/browse/ARROW-12428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17324968#comment-17324968 ]
David Li commented on ARROW-12428: ---------------------------------- D'oh, and you already explained this in the SO question :) I'll re-run the benchmarks to make sure they're fair. > [Python] pyarrow.parquet.read_* should use pre_buffer=True > ---------------------------------------------------------- > > Key: ARROW-12428 > URL: https://issues.apache.org/jira/browse/ARROW-12428 > Project: Apache Arrow > Issue Type: Improvement > Components: Python > Reporter: David Li > Assignee: David Li > Priority: Major > Labels: pull-request-available > Fix For: 5.0.0 > > Time Spent: 1h 10m > Remaining Estimate: 0h > > If the user is synchronously reading a single file, we should try to read it > as fast as possible. The one sticking point might be whether it's beneficial > to enable this no matter the filesystem or whether we should try to only > enable it on high-latency filesystems. -- This message was sent by Atlassian Jira (v8.3.4#803005)