[ 
https://issues.apache.org/jira/browse/PARQUET-622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xu Chen updated PARQUET-622:
----------------------------
    Description: 
If using dictionary more of duplicate records have more compress rate ?

Why do not let one RowGroup has one DictionaryData to make more duplicate 
records into dictionary 

  was:
If using dictionary more of duplicate record have more compress rate ?

Why do not let one RowGroup has one DictionaryData to make more duplicate 
record into dictionary 


> Each RowGroup has one DictionaryData
> ------------------------------------
>
>                 Key: PARQUET-622
>                 URL: https://issues.apache.org/jira/browse/PARQUET-622
>             Project: Parquet
>          Issue Type: Improvement
>          Components: parquet-format
>            Reporter: Xu Chen
>
> If using dictionary more of duplicate records have more compress rate ?
> Why do not let one RowGroup has one DictionaryData to make more duplicate 
> records into dictionary 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to