[ 
https://issues.apache.org/jira/browse/CASSANDRA-18534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17773559#comment-17773559
 ] 

Stefan Miklosovic commented on CASSANDRA-18534:
-----------------------------------------------

I prefer to merge CASSANDRA-18872 first where it will be removed. So it means 
that 5.0 will _not_ have crc_check_chance in compression anymore.

Then you might rebase this work against 5.0 where crc_check_chance will not be 
in compression anymore and you might tweak FileHandler builder to propagate 
sstable format option there to align it.

[~maxwellguo] [~blambov] how do this sound to you?

BTW I think this ticket as a whole needs to have a ML thread. We are changing 
CQL here and it would be great to involve more people into this.

What seems to be a little bit "strange" to me is that we chose these properties:

row_index_granularity
bloom_filter_fp_chance
crc_check_chance
min/max_index_interval

But _why exactly these_? Also, what does have a crc_check_chance, for example, 
to do with _sstable format_. There is no "format" behind that. crc_check_chance 
(similarly bloom_filter_fp_chance), is just a _probability_ with which we do so 
and so operation. That is an operational parameter, we are not _formatting an 
sstable_ as such. Maybe it is just a matter of naming, I just find this to be 
important to mention. 

> Make sstable format configurable per table
> ------------------------------------------
>
>                 Key: CASSANDRA-18534
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-18534
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Cluster/Schema, Local/SSTable
>            Reporter: Branimir Lambov
>            Assignee: Maxwell Guo
>            Priority: Normal
>             Fix For: 5.x
>
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Some SSTable format settings need to be configurable per table for better 
> efficiency. This includes:
>  - {{row_index_granularity}}
>  - {{bloom_filter_fp_chance}}
>  - {{crc_check_chance}}
>  - {{min/max_index_interval}}
> Some of these are currently configurable using direct properties of tables. 
> Having them as format properties makes better sense and should also support 
> specifying useable combinations of settings, e.g.
> {code:java}
> CREATE TABLE ... WITH sstable_format = "bti-fast";
> CREATE TABLE ... WITH sstable_format = "bti-small";
> {code}
> where {{bti-fast}} and {{bti-small}} can be defined in {{cassandra.yaml}} 
> e.g. as
> {code:java}
> sstable.format.options:
>   - bti-fast:
>       row_index_granularity: 1kiB
>       bloom_filter_fp_chance: 0.01
>   - bti-small:
>       row_index_granularity: 32kiB
>       bloom_filter_fp_chance: 0.1
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org

Reply via email to