On 2021/01/27 14:08, Masahiko Sawada wrote:
On Wed, Jan 27, 2021 at 10:29 AM Fujii Masao
<masao.fu...@oss.nttdata.com> wrote:
You fixed some issues. But maybe you forgot to attach the latest patches?
Yes, I've attached the updated patches.
Thanks for updating the patch! I tried to review 0001 and 0002 as the
self-contained change.
+ * An FDW that implements both commit and rollback APIs can request to register
+ * the foreign transaction by FdwXactRegisterXact() to participate it to a
+ * group of distributed tranasction. The registered foreign transactions are
+ * identified by OIDs of server and user.
I'm afraid that the combination of OIDs of server and user is not unique. IOW,
more than one foreign transactions can have the same combination of OIDs of
server and user. For example, the following two SELECT queries start the
different foreign transactions but their user OID is the same. OID of user
mapping should be used instead of OID of user?
CREATE SERVER loopback FOREIGN DATA WRAPPER postgres_fdw;
CREATE USER MAPPING FOR postgres SERVER loopback OPTIONS (user 'postgres');
CREATE USER MAPPING FOR public SERVER loopback OPTIONS (user 'postgres');
CREATE TABLE t(i int);
CREATE FOREIGN TABLE ft(i int) SERVER loopback OPTIONS (table_name 't');
BEGIN;
SELECT * FROM ft;
DROP USER MAPPING FOR postgres SERVER loopback ;
SELECT * FROM ft;
COMMIT;
+ /* Commit foreign transactions if any */
+ AtEOXact_FdwXact(true);
Don't we need to pass XACT_EVENT_PARALLEL_PRE_COMMIT or XACT_EVENT_PRE_COMMIT
flag? Probably we don't need to do this if postgres_fdw is only user of this
new API. But if we make this new API generic one, such flags seem necessary so
that some foreign data wrappers might have different behaviors for those flags.
Because of the same reason as above, AtEOXact_FdwXact() should also be called
after CallXactCallbacks(is_parallel_worker ? XACT_EVENT_PARALLEL_COMMIT :
XACT_EVENT_COMMIT)?
+ /*
+ * Abort foreign transactions if any. This needs to be done before
marking
+ * this transaction as not running since FDW's transaction callbacks
might
+ * assume this transaction is still in progress.
+ */
+ AtEOXact_FdwXact(false);
Same as above.
+/*
+ * This function is called at PREPARE TRANSACTION. Since we don't support
+ * preparing foreign transactions yet, raise an error if the local transaction
+ * has any foreign transaction.
+ */
+void
+AtPrepare_FdwXact(void)
+{
+ if (FdwXactParticipants != NIL)
+ ereport(ERROR,
+ (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
+ errmsg("cannot PREPARE a transaction that has
operated on foreign tables")));
+}
This means that some foreign data wrappers suppporting the prepare transaction
(though I'm not sure if such wappers actually exist or not) cannot use the new
API? If we want to allow those wrappers to use new API, AtPrepare_FdwXact()
should call the prepare callback and each wrapper should emit an error within
the callback if necessary.
+ foreach(lc, FdwXactParticipants)
+ {
+ FdwXactParticipant *fdw_part = (FdwXactParticipant *)
lfirst(lc);
+
+ if (fdw_part->server->serverid == serverid &&
+ fdw_part->usermapping->userid == userid)
Isn't this ineffecient when starting lots of foreign transactions because we
need to scan all the entries in the list every time?
+static ConnCacheEntry *
+GetConnectionCacheEntry(Oid umid)
+{
+ bool found;
+ ConnCacheEntry *entry;
+ ConnCacheKey key;
+
+ /* First time through, initialize connection cache hashtable */
+ if (ConnectionHash == NULL)
+ {
+ HASHCTL ctl;
+
+ ctl.keysize = sizeof(ConnCacheKey);
+ ctl.entrysize = sizeof(ConnCacheEntry);
+ ConnectionHash = hash_create("postgres_fdw connections", 8,
+ &ctl,
+
HASH_ELEM | HASH_BLOBS);
Currently ConnectionHash is created under TopMemoryContext. With the patch,
since GetConnectionCacheEntry() can be called in other places, ConnectionHash
may be created under the memory context other than TopMemoryContext? If so,
that's safe?
- if (PQstatus(entry->conn) != CONNECTION_OK ||
- PQtransactionStatus(entry->conn) != PQTRANS_IDLE ||
- entry->changing_xact_state ||
- entry->invalidated)
...
+ if (PQstatus(entry->conn) != CONNECTION_OK ||
+ PQtransactionStatus(entry->conn) != PQTRANS_IDLE ||
+ entry->changing_xact_state)
Why did you get rid of the condition "entry->invalidated"?
I'm reading 0001 and 0002 patches to pick up the changes for postgres_fdw that
worth applying independent from 2PC feature. If there are such changes, IMO we
can apply them in advance, and which would make the patches simpler.
Thank you for reviewing the patches!
+ if (PQresultStatus(res) != PGRES_COMMAND_OK)
+ ereport(ERROR, (errmsg("could not commit transaction on server
%s",
+
frstate->server->servername)));
You changed the code this way because you want to include the server name in
the error message? I agree that it's helpful to report also the server name
that caused an error. OTOH, since this change gets rid of call to
pgfdw_rerport_error() for the returned PGresult, the reported error message
contains less information. If this understanding is right, I don't think that
this change is an improvement.
Right. It's better to use do_sql_command() instead.
Instead, if the server name should be included in the error message,
pgfdw_report_error() should be changed so that it also reports the server name?
If we do that, the server name is reported not only when COMMIT fails but also
when other commands fail.
Of course, if this change is not essential, we can skip doing this in the first
version.
Yes, I think it's not essential for now. We can improve it later if we want.
- /*
- * Regardless of the event type, we can now mark ourselves as out of the
- * transaction. (Note: if we are here during PRE_COMMIT or PRE_PREPARE,
- * this saves a useless scan of the hashtable during COMMIT or PREPARE.)
- */
- xact_got_connection = false;
With this change, xact_got_connection seems to never be set to false. Doesn't
this break pgfdw_subxact_callback() using xact_got_connection?
I think xact_got_connection is set to false in
pgfdw_cleanup_after_transaction() that is called at the end of each
foreign transaction (i.g., in postgresCommitForeignTransaction() and
postgresRollbackForeignTransaction()).
But as you're concerned below, it's reset for each foreign transaction
end rather than the parent's transaction end.
+ /* Also reset cursor numbering for next transaction */
+ cursor_number = 0;
Originally this variable is reset to 0 once per transaction end. But with the
patch, it's reset to 0 every time when a foreign transaction ends at each
connection. This change would be harmless fortunately in practice, but seems
not right theoretically.
This makes me wonder if new FDW API is not good at handling the case where some
operations need to be performed once per transaction end.
I think that the problem comes from the fact that FDW needs to use
both SubXactCallback and new FDW API.
If we want to perform some operations at the end of the top
transaction per FDW, not per foreign transaction, we will either still
need to use XactCallback or need to rethink the FDW API design. But
given that we call commit and rollback FDW API for only foreign
servers that actually started a transaction, I’m not sure if there are
such operations in practice. IIUC there is not at least from the
normal (not-sub) transaction termination perspective.
One feature in my mind that may not match with this new API is to perform
transaction commits on multiple servers in parallel. That's something like the
following. As far as I can recall, another proposed version of 2pc on
postgres_fdw patch included that feature. If we want to implement this to
increase the performance of transaction commit in the future, I'm afraid that
new API will prevent that.
foreach(foreign transactions)
send commit command
foreach(foreign transactions)
wait for reply of commit
On second thought, new per-transaction commit/rollback callback is essential
when users or the resolver process want to resolve the specifed foreign
transaction, but not essential when backends commit/rollback foreign
transactions. That is, even if we add per-transaction new API for users and
resolver process, backends can still use CallXactCallbacks() when they
commit/rollback foreign transactions. Is this understanding right?
IIUC xact_got_transaction is used to skip iterating over all cached
connections to find open remote (sub) transactions. This is not
necessary anymore at least from the normal transaction termination
perspective. So maybe we can improve it so that it tracks whether any
of the cached connections opened a subtransaction. That is, we set it
true when we created a savepoint on any connections and set it false
at the end of pgfdw_subxact_callback() if we see that xact_depth of
all cached entry is less than or equal to 1 after iterating over all
entries.
OK.
Regarding cursor_number, it essentially needs to be unique at least
within a transaction so we can manage it per transaction or per
connection. But the current postgres_fdw rather ensure uniqueness
across all connections. So it seems to me that this can be fixed by
making individual connection have cursor_number and resetting it in
pgfdw_cleanup_after_transaction(). I think this can be in a separate
patch.
Maybe, so let's work on this later, at least after we confirm that
this change is really necessary.
Regards,
--
Fujii Masao
Advanced Computing Technology Center
Research and Development Headquarters
NTT DATA CORPORATION