drin opened a new pull request, #14071:
URL: https://github.com/apache/arrow/pull/14071

   A PR to subsume #13390
   
   I tried to add on to #13390, but then I rebased and now it seems really 
complicated to add to that PR.
   
   This PR primarily uses `Uri` to parse file URIs extracted from 
`substrait::ReadRel`. This occurs when `FromProto` is called on a `Relation` 
that is a read type (`ReadRel`). Additionally, to make `FromProto` a bit more 
readable, I moved the code for this case into a separate function, 
`FromReadRelation`. This function has the following overall flow:
   
   1. Validate ReadRel
   2. Initialize FileFormat from the 1st file URI
   3. foreach file URI:
       1. Validate the proto URI
       2. Parse using the Uri class
       3. Validate the Uri (should be a `file:///` path)
       4. Accumulate `FileInfo` instances using `LocalFileSystem`
   4. return DeclarationInfo for dataset scan over the file URIs
   
   
   Validation functions are prefixed with `Check` to match `CheckRelCommon`.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to