[ANNOUNCE] IMCS: In Memory Columnar Store for PostgreSQL

knizhnik Fri, 03 Jan 2014 07:29:58 -0800

I want to announce implementation of In-Memory Columnar Store extensionfor PostgreSQL.

Vertical representation of data is stored in PostgreSQL shared memory.

Various basic and sophisticated analytic operators are provided formanipulation with timeseries.


      GitHub repository: https://github.com/knizhnik/imcs/
      Documentation: http://www.garret.ru/imcs/user_guide.html
      Sources: http://www.garret.ru/imcs-1.02.tar.gz

Columnar store manager stores data tables as sections of columns of datarather than as rows of data.Most of traditional DBMS-es store data in rows ("horizontally"): allrecord attributes are stored together.Such approach allows to load the whole record using one read operationwhich usually leads to better performance for OLTPqueries (which access or update single records). But OLAP queries aremostly performing operations on individual columns,for example calculating sum or average of some column. In this casevertical data representation, when data for each columnis stored independently, is more efficient. There are several DBMS-es inmarker which are based on vertical model: Vertica,SciDB,... Also most of mainstream commercial databases also provide OLAPextensions based on vertical storage:Blue Acceleration for DB2, Oracle Database In-Memory Option, MicrosoftSQL server column store...

Columnar store or vertical representation of data allows to achievebetter performance in comparison with classical horizontalrepresentation due to three factors:* Reducing size of fetched data: only columns involved in query areaccessed.* Vector operations. Applying an operator to set of values (tile) makesit possible to minimize interpretation cost.Also SIMD instructions of modern processors accelerate execution ofvector operations.* Compression of data. Certainly compression can also be used for allthe records, but independent compression of each column can give muchbetter results without significant extra CPU overhead. For example suchsimple compression algorithm like RLE(run-length-encoding) allows not only to reduce used space, but alsominimize number of performed operations.




--
Sent via pgsql-announce mailing list (pgsql-announce@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-announce

[ANNOUNCE] IMCS: In Memory Columnar Store for PostgreSQL

Reply via email to