This is an automated email from the ASF dual-hosted git repository.

github-bot pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/arrow-datafusion.git


The following commit(s) were added to refs/heads/asf-site by this push:
     new 3607d0775b Publish built docs triggered by 
482f32caa9a05b172e723df41a9a1c50e8447b00
3607d0775b is described below

commit 3607d0775b802a63a522e0a8acc93acec15ce6e1
Author: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
AuthorDate: Fri May 12 14:41:02 2023 +0000

    Publish built docs triggered by 482f32caa9a05b172e723df41a9a1c50e8447b00
---
 _sources/user-guide/configs.md.txt | 2 +-
 user-guide/configs.html            | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/_sources/user-guide/configs.md.txt 
b/_sources/user-guide/configs.md.txt
index 77196299ed..d64f327e06 100644
--- a/_sources/user-guide/configs.md.txt
+++ b/_sources/user-guide/configs.md.txt
@@ -49,7 +49,7 @@ Environment variables are read during `SessionConfig` 
initialisation so they mus
 | datafusion.execution.collect_statistics                    | false      | 
Should DataFusion collect statistics after listing files                        
                                                                                
                                                                                
                                                                                
                                                                                
                 [...]
 | datafusion.execution.target_partitions                     | 0          | 
Number of partitions for query execution. Increasing partitions can increase 
concurrency. Defaults to the number of CPU cores on the system                  
                                                                                
                                                                                
                                                                                
                    [...]
 | datafusion.execution.time_zone                             | +00:00     | 
The default time zone Some functions, e.g. `EXTRACT(HOUR from SOME_TIME)`, 
shift the underlying datetime according to this time zone, and then extract the 
hour                                                                            
                                                                                
                                                                                
                      [...]
-| datafusion.execution.parquet.enable_page_index             | false      | If 
true, uses parquet data page level metadata (Page Index) statistics to reduce 
the number of rows decoded.                                                     
                                                                                
                                                                                
                                                                                
                [...]
+| datafusion.execution.parquet.enable_page_index             | true       | If 
true, uses parquet data page level metadata (Page Index) statistics to reduce 
the number of rows decoded.                                                     
                                                                                
                                                                                
                                                                                
                [...]
 | datafusion.execution.parquet.pruning                       | true       | If 
true, the parquet reader attempts to skip entire row groups based on the 
predicate in the query and the metadata (min/max values) stored in the parquet 
file                                                                            
                                                                                
                                                                                
                      [...]
 | datafusion.execution.parquet.skip_metadata                 | true       | If 
true, the parquet reader skip the optional embedded metadata that may be in the 
file Schema. This setting can help avoid schema conflicts when querying 
multiple parquet files with schemas containing compatible types but different 
metadata                                                                        
                                                                                
                        [...]
 | datafusion.execution.parquet.metadata_size_hint            | NULL       | If 
specified, the parquet reader will try and fetch the last `size_hint` bytes of 
the parquet file optimistically. If not specified, two reads are required: One 
read to fetch the 8-byte parquet footer and another to fetch the metadata 
length encoded in the footer                                                    
                                                                                
                      [...]
diff --git a/user-guide/configs.html b/user-guide/configs.html
index 715e8e91dd..6e75801bc8 100644
--- a/user-guide/configs.html
+++ b/user-guide/configs.html
@@ -391,7 +391,7 @@ Environment variables are read during <code class="docutils 
literal notranslate"
 <td><p>The default time zone Some functions, e.g. <code class="docutils 
literal notranslate"><span class="pre">EXTRACT(HOUR</span> <span 
class="pre">from</span> <span class="pre">SOME_TIME)</span></code>, shift the 
underlying datetime according to this time zone, and then extract the 
hour</p></td>
 </tr>
 <tr 
class="row-even"><td><p>datafusion.execution.parquet.enable_page_index</p></td>
-<td><p>false</p></td>
+<td><p>true</p></td>
 <td><p>If true, uses parquet data page level metadata (Page Index) statistics 
to reduce the number of rows decoded.</p></td>
 </tr>
 <tr class="row-odd"><td><p>datafusion.execution.parquet.pruning</p></td>

Reply via email to