alamb commented on code in PR #18820:
URL: https://github.com/apache/datafusion/pull/18820#discussion_r2549988898


##########
datafusion/common/src/config.rs:
##########
@@ -699,6 +699,12 @@ config_namespace! {
         /// the filters are applied in the same order as written in the query
         pub reorder_filters: bool, default = false
 
+        /// (reading) Force the use of RowSelections for filter results, when

Review Comment:
   This is an escape valve if we find some issue when using the new adaptive 
filter from @hhhizzz 
   - https://github.com/apache/arrow-rs/pull/8733



##########
datafusion/core/src/physical_planner.rs:
##########
@@ -3106,7 +3106,7 @@ mod tests {
 
         assert_contains!(
             &e,
-            r#"Error during planning: Can not find compatible types to compare 
Boolean with [Struct("foo": Boolean), Utf8]"#
+            r#"Error during planning: Can not find compatible types to compare 
Boolean with [Struct("foo": non-null Boolean), Utf8]"#

Review Comment:
   these are due the changes from 
https://github.com/apache/arrow-rs/issues/8648 to clean up datatype display. It 
is a nice improvement in my mind



##########
datafusion/core/tests/parquet/filter_pushdown.rs:
##########
@@ -636,6 +636,27 @@ async fn predicate_cache_pushdown_default() -> 
datafusion_common::Result<()> {
     config.options_mut().execution.parquet.pushdown_filters = true;
     let ctx = SessionContext::new_with_config(config);
     // The cache is on by default, and used when filter pushdown is enabled
+    PredicateCacheTest {
+        expected_inner_records: 8,
+        expected_records: 7, // reads more than necessary from the cache as 
then another bitmap is applied

Review Comment:
   this behavior changed due to adaptive filtering. I added a new test that 
turns off adaptive filtering to show doing so restores the old behavior



##########
datafusion-cli/src/main.rs:
##########
@@ -592,9 +592,9 @@ mod tests {
         
+-----------------------------------+-----------------+---------------------+------+------------------+
         | filename                          | file_size_bytes | 
metadata_size_bytes | hits | extra            |
         
+-----------------------------------+-----------------+---------------------+------+------------------+
-        | alltypes_plain.parquet            | 1851            | 6957           
     | 2    | page_index=false |
-        | alltypes_tiny_pages.parquet       | 454233          | 267014         
     | 2    | page_index=true  |
-        | lz4_raw_compressed_larger.parquet | 380836          | 996            
     | 2    | page_index=false |
+        | alltypes_plain.parquet            | 1851            | 8882           
     | 2    | page_index=false |

Review Comment:
   Update: the size is correct. As @etseidl says "the truth hurts"



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to