Hi xuchianyin, Thanks for the question
1. Currently implementation is no need to load all the segments, only required will be loaded during filter and all segments will be loaded during query like count *. 2. Cache loading is fired during pruning phase in query, it will go to index server prune and load to cache , if index server is disabled and if distributed pruning is enabled then distributed pruning happens else driver side pruning, please check the index server Design doc for more info on this. For auto compaction, no need to load to index server, because internally one more level of compaction can happen and old loaded segments can become invalid, I will handle this is Design document. 3. Index server is a separate spark application meant for caching , so for SDK , spark session doesn't come into picture, so SDK not applicable, for file format case we will handle. Please get back for any clarifications or inputs. Thanks and Regards Akash R Nilugal From: xuchuanyin <xuchuan...@apache.org> > Date: Sat, 17 Aug, 2019, 11:55 AM > Subject: Re: [DISCUSSION] Cache Pre Priming > To: <dev@carbondata.apache.org> > > > Hi, I've two questions about the current index server implementation: > > 1. Currently do we need to load all the index data of all segments to cache > server while doing filter query OR only load the segments required by this > query? > > 2. When do we trigger the cache loading action during the query? > > As for the proposal in this mail, what will happen if auto-compaction occur > for this loading? > > 3. Since we want to preload the index to cache, maybe we need to handle all > the scenarios that causing data ingestion, so it seems you forget the SDK > scenario. > > > > -- > Sent from: > http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/ >