Re: [Discussion] Implement Partition Table Feature

Jacky Li Sat, 15 Apr 2017 08:32:39 -0700

> 在 2017年4月15日，下午12:00，Jacky Li <jacky.li...@qq.com> 写道：
> 
> Hi Cao Lu,
> 
> The overall design likes good to me, I just have following points need to
> confirm:
> 1. Is there detele partition DDL?
> 2. For the data loading part, it needs to do global shuffle before actual
> data loading? And the partition key should not be included in SORT_COLUMNS
> option, right? If yes, I think it is better to put this constrain in the
> document also.


After second thought, I think it is up to the user whether to put partition key 
in the SORT_COLUMNS. There should be no constrain.

> 3. For the query part, I suggest to add more description for index, like how
> B tree will be loaded into driver and many B tree will be there?
> 4. As a further optimization, is it possible that we map the partition to
> DataNode such that we do not need to communicate with NameNode for every
> query? Can this mapping be considered like a cache?
> 
> Regards,
> Jacky
> 
> 
> --
> View this message in context: 
> http://apache-carbondata-mailing-list-archive.1130556.n5.nabble.com/Discussion-Implement-Partition-Table-Feature-tp10938p11063.html
> Sent from the Apache CarbonData Mailing List archive mailing list archive at 
> Nabble.com.

Re: [Discussion] Implement Partition Table Feature

Reply via email to