Re: [GENERAL] arrays of floating point numbers / linear algebra operations into the DB

Joe Conway Fri, 01 Feb 2008 12:54:45 -0800

Enrico Sirola wrote:

typically, arrays contain 1000 elements, and an operation is eithermultiply it by a scalar or multiply it element-by-element with anotherarray. The time to rescale 1000 arrays, multiply it for another arrayand at the end sum all the 1000 resulting arrays should be enough to becarried on in an interactive application (let's say 0.5s). This, in thecase when no disk-access is required. Disk access will obviouslydowngrade performances a bit ad the beginning, but the workload ismostly read-only so after a while the whole table will be cached anyway.The table containing the arrays would be truncated/repopulated every dayand the number of arrays is expected to be more or less 150000 (at leastthis is what we have now). Nowadays, we have a c++ middleware betweenthe calculations and an aggressive caching of the table contents (and wedon't use arrays, just a row per element) but the application could berefactored (and simplified a lot) if we have a smart way to save datainto the DB.


I don't know if the speed will meet your needs, but you might test to
see if PL/R will work for you:

  http://www.joeconway.com/plr/

You could use pg.spi.exec() from within the R procedure to grab the
arrays, do all of your processing inside R (which uses whatever BLAS
you've set it up to use), and then return the result out to Postgres.

Joe


---------------------------(end of broadcast)---------------------------
TIP 6: explain analyze is your friend

Re: [GENERAL] arrays of floating point numbers / linear algebra operations into the DB

Reply via email to