Let's make PostgreSQL multi-threaded

Heikki Linnakangas Mon, 05 Jun 2023 07:52:23 -0700

I spoke with some folks at PGCon about making PostgreSQL multi-threaded,so that the whole server runs in a single process, with multiplethreads. It has been discussed many times in the past, last thread onpgsql-hackers was back in 2017 when Konstantin made some experiments [0].

I feel that there is now pretty strong consensus that it would be a goodthing, more so than before. Lots of work to get there, and lots ofdetails to be hashed out, but no objections to the idea at a high level.

The purpose of this email is to make that silent consensus explicit. Ifyou have objections to switching from the current multi-processarchitecture to a single-process, multi-threaded architecture, pleasespeak up.

If there are no major objections, I'm going to update the developer FAQ,removing the excuses there for why we don't use threads [1]. And we canstart to talk about the path to get there. Below is a list of somehurdles and proposed high-level solutions. This isn't an exhaustivelist, just some of the most obvious problems:


# Transition period

The transition surely cannot be done fully in one release. Even if wecould pull it off in core, extensions will need more time to adapt.There will be a transition period of at least one release, probablymore, where you can choose multi-process or multi-thread model using aGUC. Depending on how it goes, we can document it as experimental at first.


# Thread per connection

To get started, it's most straightforward to have one thread perconnection, just replacing backend process with a backend thread. In thefuture, we might want to have a thread pool with some kind of ascheduler to assign active queries to worker threads. Or multiplethreads per connection, or spawn additional helper threads for specifictasks. But that's future work.


# Global variables

We have a lot of global and static variables:

$ objdump -t bin/postgres | grep -e "\.data" -e "\.bss" | grep -v"data.rel.ro" | wc -l

Some of them are pointers to shared memory structures and can stay asthey are. But many of them are per-connection state. The moststraightforward conversion for those is to turn them into thread-localvariables, like Konstantin did in [0].

It might be good to have some kind of a Session context struct that wepass everywhere, or maybe have a single thread-local variable to holdit. Many of the global variables would become fields in the Session. Butthat's future work.


# Extensions

A lot of extensions also contain global variables or other things thatbreak in a multi-threaded environment. We need a way to label extensionsthat support multi-threading. And in the future, also extensions that*require* a multi-threaded server.

Let's add flags to the control file to mark if the extension isthread-safe and/or process-safe. If you try to load an extension that'snot compatible with the server's mode, throw an error.

We might need new functions in addition _PG_init, called at connectionstartup and shutdown. And background worker API probably needs some changes.


# Exposed PIDs

We expose backend process PIDs to users in a few places.pg_stat_activity.pid and pg_terminate_backend(), for example. They needto be replaced, or we can assign a fake PID to each connection whenrunning in multi-threaded mode.


# Signals

We use signals for communication between backends. SIGURG in latches,and SIGUSR1 in procsignal, for example. Those primitives need to berewritten with some other signalling mechanism in multi-threaded mode.In principle, it's possible to set per-thread signal handlers, and senda signal to a particular thread (pthread_kill), but I think it's betterto just rewrite them.

We also document that you can send SIGINT, SIGTERM or SIGHUP to anindividual backend process. I think we need to deprecate that, and maybecome up with some convenient replacement. E.g. send a message withbackend ID to a unix domain socket, and a new pg_kill executable to sendthose messages.


# Restart on crash

If a backend process crashes, postmaster terminates all other backendsand restarts the system. That's hard (impossible?) to do safely ifeverything runs in one process. We can continue have a separatepostmaster process that just monitors the main process and restarts iton crash.


# Thread-safe libraries

Need to switch to thread-safe versions of library functions, e.g.uselocale() instead of setlocale().

The Python interpreter has a Global Interpreter Lock. It's not possibleto create two completely independent Python interpreters in the sameprocess, there will be some lock contention on the GIL. Fortunately, thepython community just accepted https://peps.python.org/pep-0684/. That'sexactly what we need: it makes it possible for separate interpreters tohave their own GILs. It's not clear to me if that's in Python 3.12already, or under development for some future version, but by the timewe make the switch in Postgres, there probably will be a solution incpython.

At a quick glance, I think perl and TCL are fine, you can have multipleinterpreters in one process. Need to check any other libraries we use.

[0]https://www.postgresql.org/message-id/flat/9defcb14-a918-13fe-4b80-a0b02ff85527%40postgrespro.ru

[1]https://wiki.postgresql.org/wiki/Developer_FAQ#Why_don.27t_you_use_raw_devices.2C_async-I.2FO.2C_.3Cinsert_your_favorite_wizz-bang_feature_here.3E.3F


--
Heikki Linnakangas
Neon (https://neon.tech)

Let's make PostgreSQL multi-threaded

Reply via email to