Re: Directory partitions slower than scan all events

2015-02-18 Thread Adam Gilmore
Just a follow up on this. After reviewing the physical plans, the difference appears to be that dir0 is now included as a column in the parquet-scan and then it is passed to a project for price. It would be better if even when dir0 is specified as a filter, if it is not specified in the projectio

Directory partitions slower than scan all events

2015-02-18 Thread Adam Gilmore
Hi guys, I'm trying to understand something about directory partitions and how they're implemented. For sake of basic argument, I have ~3 mil rows in 3 separate Parquet files. Each one has a "groupId" of 1, 2 and 3 respectively. I then place them in separate directories named 1, 2 and 3. The f