Re: Parallel threads in query

Konstantin Knizhnik Thu, 01 Nov 2018 00:20:28 -0700



On 31.10.2018 22:07, Darafei "Komяpa" Praliaskouski wrote:

Hi,
I've tried porting some of PostGIS algorithms to utilize multiplecores via OpenMP to return faster.
Question is, what's the best policy to allocate cores so we can playnice with rest of postgres?
What I'd like to see is some function that I can call and get a numberof threads I'm allowed to run, that will also advise rest of postgresto not use them, and a function to return the cores back (or do itautomatically at the end of query). Is there an infrastructure for that?

I do not completely understand which PostGIS algorithms you are goingto make parallel.

So may be you should first clarify it.

There are three options to perform parallel execution of the singlequery in Postgres now:

1. Use existed Postgres parallel capabilities. For example if there issome expensive function f() which you are going to execute concurrently,then you do not need to do anything: parallel seq scan will do it foryou. You can configure arbitrary number of parallel workers and socontrol level of concurrency.The restriction of the current Postgres parallel query processingimplementation is that

- parallel workers are started for each query

- it is necessary to serialize and pass to parallel workers a lot ofthings from coordinator- in case of seqscan, workers will compete for pages to scan, soeffective number of workers should be < 10, while most powerful modernservers have hundreds of COU cores.

2. Implement you own parallel plan nodes using existed Postgres parallelinfrastructure. Such approach has most chances to be committed inPostgres core.But disadvantages are mostly the same as in 1) Exchange of data betweendifferent process is much more complex and expensive than access tocommon memory in case of threads. Mostly likely you will have to useshared message queue and dynamic shared memory, implemented in Postgresspecially for interaction of parallel workers .

3. Use multithreading to provide concurrent execution of your particularalgorithm (s[awn threads within backend). You should be very carefulwith this approach, because Postgres code is not thread safe. So youshould not try to execute in thread any subplan or call any postgresfunctions (unless you are 100% sure that them are thread safe).This approach may be easy to implement and provide better performancethan 1). But please notice its limitations. I have used such approach inmy IMCS extension (In-Memory-Columnar-Store).

You can look at pg_strom extension as an example of providing parallelquery execution (in this case using parallel capabilities of video cards).


--

Konstantin Knizhnik
Postgres Professional: http://www.postgrespro.com
The Russian Postgres Company

Re: Parallel threads in query

Reply via email to