Re: [PR] Normalize partitioned and flat object listing [datafusion]

via GitHub Fri, 07 Nov 2025 08:28:33 -0800


alamb commented on code in PR #18146:
URL: https://github.com/apache/datafusion/pull/18146#discussion_r2504451187



##########
datafusion/core/tests/datasource/object_store_access.rs:
##########
@@ -194,17 +183,8 @@ async fn query_partitioned_csv_file() {
     +---------+-------+-------+---+----+-----+
     ------- Object Store Request Summary -------
     RequestCountingObjectStore()
-    Total Requests: 11
-    - LIST (with delimiter) prefix=data
-    - LIST (with delimiter) prefix=data/a=1
-    - LIST (with delimiter) prefix=data/a=2
-    - LIST (with delimiter) prefix=data/a=3
-    - LIST (with delimiter) prefix=data/a=1/b=10
-    - LIST (with delimiter) prefix=data/a=2/b=20
-    - LIST (with delimiter) prefix=data/a=3/b=30
-    - LIST (with delimiter) prefix=data/a=1/b=10/c=100
-    - LIST (with delimiter) prefix=data/a=2/b=20/c=200
-    - LIST (with delimiter) prefix=data/a=3/b=30/c=300
+    Total Requests: 2
+    - LIST prefix=data
     - GET  (opts) path=data/a=2/b=20/c=200/file_2.csv

Review Comment:
   > That being said, what you've raised with your path examples is exactly the 
case I was thinking of when I noted that cache entries for partitioned tables 
might need to be "prefix aware"
   
   It seems like we will have two choices:
   1. Implement the relevant prefix filtering on the client (e.g if we have 
cached `LIST /path/to/foo` and then get a request for `LIST /path/to/foo/bar` 
we could try and filter / prefix match the entry in the cache)
   2. Not handle sub-prefix matches 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] Normalize partitioned and flat object listing [datafusion]

Reply via email to