You can start with the high level architecture [1]. Then the community
might help you if you have any specific questions.
[1] https://drill.apache.org/architecture/
On Sun, Jul 24, 2016 at 11:36 PM, Sanjiv Kumar wrote:
> How drill runs query internally. I want to know how drill execute query fo
How drill runs query internally. I want to know how drill execute query for
different data sources.I want to know internal process of drill.
..
Thanks & Regards
*Sanjiv Kumar*
Even for csv or json format, directory-based Partition pruning [1] could be
leveraged to prune data. You have to use the special dir* field in your
query to filter out un-wanted data, or define a view which uses dir* field
and then query against the view.
1. https://drill.apache.org/docs/partition
Hi Hafiz,
I guess it depends on the query. Generally Drill will try to push any
filter you have in your query to the leaf nodes so they won't send any row
that doesn't pass the filter. Also only the columns that appear in the
query will be loaded from the file.
The file format you are querying al
Hi all!
I want to know about drill working. Suppose i query to data on S3. the
volume of data is huge in GB's. So when I query to that data what happens?
whether drill load whole data on drill nodes? or just query data without
loading whole data?