Re: [sqlite] Covering index scan optimization

Elefterios Stamatogiannakis Sat, 15 Sep 2012 04:08:25 -0700

On 14/09/12 22:56, Clemens Ladisch wrote:

Elefterios Stamatogiannakis wrote:

On 13/09/12 23:02, Clemens Ladisch wrote:
For my main workload (OLAP) this can make an enormous difference!


OLAP isn't quite the typical SQLite use case.  But do you have any
numbers (which would help deciding whether to accept this patch)?


Concerning the "numbers", i'll run some timings.

Concerning OLAP being the typical SQLite use case. I haven't chosenSQLite for OLAP lightly. When the choice was made (some years ago) wehad benchmarked other DBs too (MySQL, Postgres).

Things may have changed from when we did the benchmarks, but i suspectthat if we ran the same tests again the final choice would remain thesame. Some more details concerning the things that we had found follows.

MySQL's query execution engine was very fast for simple selects, countsetc. When the queries became more complex, MySQL tended to "explode" dueto materializing nested queries to the disk or other query optimizerstupidities.

Postgres, which i hold dear in my heart, was a lot closer to SQLite'sbehaviour (it also runs single threaded for single OLAP queries).Nevertheless it was slower than SQLite for queries involving UDFs and onsome complex queries it could take many times more to finish a querythan SQLite.

In the end we didn't care that much about raw performance, but about UDFflexibility/speed, and to have a reliable and predictable queryplanner/optimizer.

Concerning the query optimizer, in my opinion and by watching other moreexperienced people than me writing queries. What counts more is not thecleverness of the query optimizer but being able to predict what areliable optimizer will do.

There are a number of things that i miss from Postgres e.g. nestedtables and the new JSON storage in 9.2. But using SQLite's UDF API (viaexceptional APSW [*] ) i have created workarounds for them in madIS [*].So i don't care that much.

What i would really like to have in SQLite concerning OLAP, would bebigger pages, and internal page compression in a similar manner thatcolumn stores do [^]. This would greatly alleviate the storage pain ofusing denormalized DBs which is a must for OLAP.

In addition, this feature would bring SQLite's engine very close to alsobeing able to function as a document store while at the same timespeeding up its relational processing (due to having smaller DBs on thedisk).


Kind regards,

lefteris

[*] APSW: http://code.google.com/p/apsw/
    madIS: http://code.google.com/p/madis/

[^] Like http://en.wikipedia.org/wiki/RCFile . Another idea would be foreach column to have a dictionary at the start and then the column's datawould reference the column's dictionary.

_______________________________________________
sqlite-users mailing list
[email protected]
http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users

Re: [sqlite] Covering index scan optimization

Reply via email to