Enrico Sirola wrote:
Hello,
I'd like to perform linear algebra operations on float4/8 arrays. These tasks are tipically carried on using ad hoc optimized libraries (e.g. BLAS). In order to do this, I studied a bit how arrays are stored internally by the DB: from what I understood, arrays are basically a vector of Datum, and floating point numbers are stored by reference into Datums. At a first glance, this seem to close the discussion because in order to perform fast linear algebra operations, you need to store array items in consecutive memory cells. What are the alternatives? Create a new specialized data type for floating point vectors? Basically, the use-case is to be able to rescale, add and multiply (element-by-element)
vectors.

I'm not sure about the internals of PostgreSQL (eg. the Datum object(?) you mention), but if you're just scaling vectors, consecutive memory addresses shouldn't be absolutely necessary. Add and multiply operations within a linked list (which is how I'm naively assuming Datum storage for arrays in memory is implemented) will be "roughly" just as fast.

How many scaling operations are you planning to execute per second, and how many elements do you scale per operation?

Colin

---------------------------(end of broadcast)---------------------------
TIP 2: Don't 'kill -9' the postmaster

Reply via email to