Re: Presto+CarbonData optimization work discussion

Bhavya Aggarwal Tue, 25 Jul 2017 02:32:00 -0700

I have created a pull request 1190  for Presto Optimization where we have
done following changes to improve the performance

1. Removed unnecessary loops from the integration code to make it more
efficient.
2. Implemented Lazy Blocks as is being used in case of ORC.
3. Improved dictionary decoding to have better results.

I have run this on my local machine for 2 GB data and results are attached
with this email, we see an improvement in almost all TPCH queries that we
have run.

Thanks and regards
Bhavya

On Thu, Jul 20, 2017 at 12:21 PM, rui qin <[email protected]> wrote:

> For -- 6) spark has the vectorized feature,but not in presto.How to
> implement
> it？
>
>
>
> --
> View this message in context: http://apache-carbondata-dev-
> mailing-list-archive.1130556.n5.nabble.com/Presto-
> CarbonData-optimization-work-discussion-tp18509p18548.html
> Sent from the Apache CarbonData Dev Mailing List archive mailing list
> archive at Nabble.com.
>

PrestoQueryResults.xlsx
Description: MS-Excel 2007 spreadsheet

Re: Presto+CarbonData optimization work discussion

Reply via email to