damccorm opened a new issue, #21551:
URL: https://github.com/apache/beam/issues/21551

   This tasks involves refactoring and improvements of IO connectors' file 
metadata related methods (GcsIO, S3IO, BlobIO, hadoop).
   
   Currently, we have individual methods like size, last_updated, checksum, and 
others. Each one would make a HTTP request in order to get the specific 
metadata field. If one needs to gather multiple metadata fields, then every 
specific method are called and making multiple requests under the hood. 
Actually, the HTTP response contains multiple file metadata fields but each 
time only one field is collected and others are discarded.
   
   We should have a public method that returns a named tuple which contains 
multiple file metadata fields. In its implementation it only makes one request, 
as existing methods for single metadata field. 
   
    
   
   Imported from Jira 
[BEAM-14393](https://issues.apache.org/jira/browse/BEAM-14393). Original Jira 
may contain additional context.
   Reported by: yihu.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to