GitHub user QiangCai opened a pull request:
https://github.com/apache/incubator-carbondata/pull/757
[CARBONDATA-782]support SORT_COLUMNS
The tasks of SORT_COLUMNS:
Support create table with sort_columns property.
e.g. tblproperties('sort_columns' = 'col7,col3')
The table with SORT_COLUMNS property will be sorted by SORT_COLUMNS. The
order of columns is decided by SORT_COLUMNS.
Change the encoding rule of SORT_COLUMNS
Firstly, the rule of column encoding will keep consistent with previous.
Secondly, if a column of SORT_COLUMNS is a measure before, now this column
will be created as a dimension. And this dimension is a no-dicitonary
column(Better to use other direct-dictionary).
Thirdly, the dimension of SORT_COLUMNS have RLE and ROWID page, other
dimension have only RLE(not sorted).
The start/end key should be composed of SORT_COLUMNS.
Using SORT_COLUMNS to build start/end key during data loading and select
query.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/QiangCai/incubator-carbondata nosorttables
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/incubator-carbondata/pull/757.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #757
----
commit 043ded68a347a61924458f12bcd2be1c784b5cb9
Author: QiangCai <[email protected]>
Date: 2017-03-02T09:48:54Z
sort columns
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---