[GENERAL] Best way to handle multi-billion row read-only table?

Asher Tue, 09 Feb 2010 10:10:20 -0800

Hello.

I'm putting together a database to store the readings from variousmeasurement devices for later processing. Since these things (waterpressure monitors attached to very large water pipes) take readings at200Hz and are typically deployed over multiple sites for several monthsat a time I've got many billions of rows of data, each (at the moment)with the following simple format:


        value REAL NOT NULL,
        sample_time TIMESTAMP WITH TIME ZONE NOT NULL,
        channel INTEGER REFERENCES channel(id) NOT NULL

(Where the "channel" table contains metadata to identify the particularsensor, data logger, etc. used to obtain the data and the combination ofchannel and sample_time is unique.)

Once loaded into the database the data will never be deleted or modifiedand will typically be accessed over a particular date range for aparticular channel (e.g. "sample_time >= X AND sample_time <= Y ANDchannel=Z"). A typical query won't return more than a few million rowsand speed is not desperately important (as long as the time is measuredin minutes rather than hours).

Are there any recommended ways to organise this? Should I partition mybig table into multiple smaller ones which will always fit in memory(this would result in several hundreds or thousands of sub-tables)? Arethere any ways to keep the index size to a minimum? At the moment I havea few weeks of data, about 180GB, loaded into a single table and indexedon sample_time and channel and the index takes up 180GB too.

Since this is all for a typically budget-restricted PhD project thehardware is just a high-end desktop workstation with (at the moment)2*2TB drives organised into a single 4TB partition using FreeBSD's vinumsystem.



Many thanks for any help,
Asher.

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

[GENERAL] Best way to handle multi-billion row read-only table?

Reply via email to