Hi

Agree, we don't need to add special constrain for one column be together
existing in partition key and SORT_COLUMNS.

But from actual case, don't suggest giving partition key is same as the
first column of SORT_COLUMNS, maybe we need to add the tips to partition
feature's document.

Regards
Liang


Jacky Li wrote
>> 在 2017年4月15日,下午12:00,Jacky Li <

> jacky.likun@

> > 写道:
>> 
>> Hi Cao Lu,
>> 
>> The overall design likes good to me, I just have following points need to
>> confirm:
>> 1. Is there detele partition DDL?
>> 2. For the data loading part, it needs to do global shuffle before actual
>> data loading? And the partition key should not be included in
>> SORT_COLUMNS
>> option, right? If yes, I think it is better to put this constrain in the
>> document also.
> 
> After second thought, I think it is up to the user whether to put
> partition key in the SORT_COLUMNS. There should be no constrain.
> 
>> 3. For the query part, I suggest to add more description for index, like
>> how
>> B tree will be loaded into driver and many B tree will be there?
>> 4. As a further optimization, is it possible that we map the partition to
>> DataNode such that we do not need to communicate with NameNode for every
>> query? Can this mapping be considered like a cache?
>> 
>> Regards,
>> Jacky
>> 
>> 
>> --
>> View this message in context:
>> http://apache-carbondata-mailing-list-archive.1130556.n5.nabble.com/Discussion-Implement-Partition-Table-Feature-tp10938p11063.html
>> Sent from the Apache CarbonData Mailing List archive mailing list archive
>> at Nabble.com.





--
View this message in context: 
http://apache-carbondata-mailing-list-archive.1130556.n5.nabble.com/Discussion-Implement-Partition-Table-Feature-tp10938p11321.html
Sent from the Apache CarbonData Mailing List archive mailing list archive at 
Nabble.com.

Reply via email to