Fixed length encoding is the least efficient encoding and a cause of the high expansion. Consider use dictionary, or some custom encoding.
On Wed, May 18, 2016 at 12:16 PM, hongbin ma <mahong...@apache.org> wrote: > I guess there is one or more dimensions with very high cardinality? have > you read http://kylin.apache.org/blog/2016/02/18/new-aggregation-group/? > there might be some inspirations there. > > also, are the four fixed length dimensions all text? are there any > relationships between the dimensions? Why not using dict for them, are they > super-high cardinality columns? > > On Mon, May 16, 2016 at 10:21 AM, Peng <pengli0...@outlook.com> wrote: > > > Hi, > > > > Is it normal when the expansion rate is about 1600% ? > > > > My cube : about one hundred million data; > > > > fact table has one lookup table; > > > > 5 dimensions , in which 4 dimensions' encoding are fixed length, > > separately > > the length are 9, 8, 8, 33 ; > > > > 2 measures, > > > > finally the expansion rate is about 1600. > > > > > > > > Thanks > > > > Peng > > > > > > > -- > Regards, > > *Bin Mahone | 马洪宾* > Apache Kylin: http://kylin.io > Github: https://github.com/binmahone >