Re: Best way to partition the data

2017-09-01 Thread Jinfeng Ni
If you have small cardinality for partitioning column, yet still end up with 50k different small files, it's possible that you have many parallel writer minor-fragment (threads). By default, each writer minor-fragment will work independently. If you have cardinailty C and N writer minor fragment,

Re: Best way to partition the data

2017-09-01 Thread Padma Penumarthy
Have you tried building metadata cache file using "refresh table metadataā€¯ command ? That will help reduce the planning time. Is most of the time spent in planning or execution ? Pruning is done at rowgroup level i.e. at file level (we create one file per rowgroup). We do not support pruning a

Best way to partition the data

2017-09-01 Thread Damien Profeta
Hello, I have a dataset that I always query on 2 columns that don't have a big cardinality. So to benefit from pruning, I tried to partition the file on these keys, but I end up with 50k differents small file (30Mo) and query on it spend most of the time in the planning phase, to decode the m

Re: Problems using Postgres datasource

2017-09-01 Thread Gonzalo Ortiz Jaureguizar
Thanks for your fast answer. I bet it is a misconfiguration from my side, but I'm pretty blocked and it seems the typical thing that is easily detected by a 3rd observer ;) 2017-09-01 9:33 GMT+02:00 Kunal Khatua : > I'm not very familiar with the details of Postgres, but I do so see people > occ

RE: Problems using Postgres datasource

2017-09-01 Thread Kunal Khatua
I'm not very familiar with the details of Postgres, but I do so see people occassionally asking about it Have you checked the mailing list archives? You might find your answers there. -Original Message- From: Gonzalo Ortiz Jaureguizar [mailto:golthir...@gmail.com] Sent: Friday, Septembe

Re: Problems using Postgres datasource

2017-09-01 Thread William Dickenson
Gonzalo - we use drill against Postgres extensively although almost exclusively with odbc - that being said, our expert returns on Tuesday so I will walk your question over Thanks Sent from my iPhone > On Sep 1, 2017, at 9:28 AM, Gonzalo Ortiz Jaureguizar > wrote: > > I do not want to loo

Re: Problems using Postgres datasource

2017-09-01 Thread Gonzalo Ortiz Jaureguizar
I do not want to look like I'm spamming, but there has been some activity on the list and yet no one has answered me. I don't know if the topic is not be very interesting or it is out of scope. Maybe the silence is due to the fact that the email was forwarded and you may not read the interesting pa