Re: GroupBy with index is really slow.

Andrey Mashenkov Wed, 10 May 2017 10:51:06 -0700

Hi,

I've just think you can achive same result with query without groupBy.
Select distinct age from ..


Does it workable for you?

10 мая 2017 г. 17:56 пользователь "Guillermo Ortiz" <konstt2...@gmail.com>
написал:

> It's my laptop. 16gb, i7.. The collection size it's 5Mill objects (1,2GB)
> and it takes about 30sec.
> I have tried to execute the query with the same collection with and
> without indices with the same time results.
>
> This is a test before to run in the real cluster with 6 nodes of 512Gb and
> 48cores each one. We tried to execute there with a real dataset(22M rows,
> 7gb) but it takes so long that query is aborted.
>
>
> 2017-05-08 12:12 GMT+02:00 Andrey Mashenkov <andrey.mashen...@gmail.com>:
>
>> Hi,
>>
>> Why do you think query is slow?
>> What its execution time? What is expected time?
>> How many nodes do you use? How much data does cache contains?
>>
>> On Mon, May 8, 2017 at 1:47 AM, Guillermo Ortiz <konstt2...@gmail.com>
>> wrote:
>>
>>> I have a simple cache where I have Person object (id, age and so on..)
>>>
>>> I have created indeces by id and age to try some examples but queries
>>> with "group by" go really slow.
>>>
>>> I'm trying this query:
>>> SELECT age
>>> FROM
>>> PersonWithindex
>>> group by age
>>>
>>>
>>> SELECT
>>>     AGE AS __C0
>>> FROM "personCacheWithIndex".PERSONWITHINDEX
>>>     /* "personCacheWithIndex"."age_idx" */
>>> GROUP BY AGE
>>> /* group sorted */
>>>
>>> SELECT
>>>     __C0 AS AGE
>>> FROM PUBLIC.__T0
>>>     /* "personCacheWithIndex"."merge_scan" */
>>> GROUP BY __C0
>>>
>>> Although it seems that it uses index, why is it going so slow? I think
>>> that it should be pretty fast with an index.
>>>
>>
>>
>>
>> --
>> Best regards,
>> Andrey V. Mashenkov
>>
>
>

Re: GroupBy with index is really slow.

Reply via email to