[
https://issues.apache.org/jira/browse/TAJO-714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13950985#comment-13950985
]
David Chen commented on TAJO-714:
---------------------------------
I have modified {{ParquetAppender}} to allow these settings to be set using the
{{with}} clause in {{create table}} statements. The RB for this is here:
https://reviews.apache.org/r/19786/
I have tested this using the following query:
{code}
create table table1_parquet (
id int,
name text,
score float,
type text)
using parquet with (
'parquet.page.size'='65536',
'parquet.block.size'='536870912',
'parquet.compression'='snappy')
as select * from table1;
{code}
I have a few questions:
* The code to set default parameter values is a bit complex because
{{TableMeta.getOption()}} only support {{String}} default values. I can
overload this method to support {{int}} and {{boolean}} default values, but I
noticed that this method returns {{null}} if {{p.hasParams()}} is false. Should
this method still return the default value in this case regardless?
* Enabling compression requires the compression codec jars to be in the
classpath. Do you think we should add gzip, lzo, and snappy to the dependencies
in the pom.xml or should we leave it up to users to install those jars?
> Enable setting Parquet tuning parameters
> ----------------------------------------
>
> Key: TAJO-714
> URL: https://issues.apache.org/jira/browse/TAJO-714
> Project: Tajo
> Issue Type: Improvement
> Reporter: David Chen
> Assignee: David Chen
> Attachments: TAJO-714.patch
>
>
> The first version of Parquet support does not support setting Parquet's
> tuning configuration parameters, such as compression, row group and page
> size, dictionary encoding, etc.
--
This message was sent by Atlassian JIRA
(v6.2#6252)