Re: pgsql: Refactor libpq state machine for negotiating encryption

2024-04-25 Thread Heikki Linnakangas

On 24/04/2024 15:57, Peter Eisentraut wrote:

On 08.04.24 03:25, Heikki Linnakangas wrote:

Refactor libpq state machine for negotiating encryption

This fixes the few corner cases noted in commit 705843d294, as shown
by the changes in the test.


Either this or something nearby appears to have broken the error
reporting from psql or libpq when failing to get an SSL connection:

PG16:

psql 'sslmode=require host=localhost'
psql: error: connection to server at "localhost" (::1), port 65432
failed: server does not support SSL, but SSL was required

master:

psql 'sslmode=require host=localhost'
psql: error: connection to server at "localhost" (::1), port 65432 failed:

(sic, the output ends after the colon).

This commit removed that detail error message string from the code, but
I don't see any similar message that would take its place.


Thanks, attached patch puts back those messages.

This makes one change in the error message behavior compared to v16, in 
the case that the server responds to GSSRequest with an error instead of 
rejecting it with 'N'. Previously, libpq would hide the error that the 
server sent in that case, assuming that it was because the server is an 
old pre-v12 version that doesn't support GSS and doesn't understand the 
GSSRequest message. A v11 server responds with a "FATAL: unsupported 
frontend protocol 1234.5680: server supports 2.0 to 3.0" error if you 
try to connect to it with GSS. That was a reasonable assumption when the 
feature was introduced, but v12 was released a long time ago and I don't 
think it's the most probable cause anymore. The attached patch changes 
things so that libpq prints the error message that the server sent in 
that case, making the "server responds with error to GSSRequest" case 
behave the same as the "server responds with error to SSLRequest" case. 
We could keep the old behavior if we wanted to, but this is the most 
convenient way to handle this in the new libpq code, and makes sense 
anyway IMHO.



While testing this some more, I noticed this existing case with stable 
versions:


psql "host=enc-test-localhost.postgresql.example.com hostaddr=127.0.0.1 
sslmode=disable gssencmode=require"

psql: error: connection to server at "127.0.0.1", port 5432 failed:

In the server log:

024-04-25 16:38:59.372 EEST [3939163] LOG:  could not accept GSSAPI 
security context
2024-04-25 16:38:59.372 EEST [3939163] DETAIL:  No credentials were 
supplied, or the credentials were unavailable or inaccessible: Key table 
entry not found


I didn't have everything configured correctly, which is why it failed. 
That's fine. But the psql error message is missing here too.


The attached patch does not fix that case. I think the correct fix would 
go somewhere in pqsecure_open_gss(). Whenever pqsecure_open_gss() 
returns PGRES_POLLING_FAILED, it should also put an error in the error 
buffer with libpq_append_conn_error(). The corresponding 
pgtls_open_client() function for TLS does that, and so does 
pqsecure_open_gss() for many other cases, but apparently not that one.


--
Heikki Linnakangas
Neon (https://neon.tech)
From fd3261a5b78dae6bcfdafe86d14736da43befbbe Mon Sep 17 00:00:00 2001
From: Heikki Linnakangas 
Date: Thu, 25 Apr 2024 14:53:29 +0300
Subject: [PATCH 1/1] libpq: Fix error messages when server rejects SSL or GSS

These messages were lost in commit 05fd30c0e7. Put them back.

This makes one change in the error message behavior compared to v16,
in the case that the server responds to GSSRequest with an error
instead of rejecting it with 'N'. Previously, libpq would hide the
error that the server sent, assuming that you got the error because
the server is an old pre-v12 version that doesn't support GSS and
doesn't understand the GSSRequest message. A v11 server sends a
"FATAL: unsupported frontend protocol 1234.5680: server supports 2.0
to 3.0" error if you try to connect to it with GSS. That was a
reasonable assumption when the feature was introduced, but v12 was
released a long time ago and I don't think it's the most probable
cause anymore. The attached patch changes things so that libpq prints
the error message that the server sent in that case, making the
"server responds with error to GSSRequest" case behave the same as the
"server responds with error to SSLRequest" case.

Reported-by: Peter Eisentraut
Discussion: https://www.postgresql.org/message-id/bb3b94da-afc7-438d-8940-cb946e553...@eisentraut.org
---
 src/interfaces/libpq/fe-connect.c | 49 ++-
 1 file changed, 35 insertions(+), 14 deletions(-)

diff --git a/src/interfaces/libpq/fe-connect.c b/src/interfaces/libpq/fe-connect.c
index ec20e3f3a9..4a4d0b9e80 100644
--- a/src/interfaces/libpq/fe-connect.c
+++ b/src/interfaces/libpq/fe-connect.c
@@ -2861,12 +2861,17 @@ keep_going:		/* We will come back to here until there is
 		need_new_connection = false;
 	}
 
-	/* Decide what to do next, if SSL or GSS negotiation fails */
-#define ENCRYPTION_NEGOTIATION_FAILED() \

Re: pgsql: Refactor libpq state machine for negotiating encryption

2024-04-24 Thread Peter Eisentraut

On 08.04.24 03:25, Heikki Linnakangas wrote:

Refactor libpq state machine for negotiating encryption

This fixes the few corner cases noted in commit 705843d294, as shown
by the changes in the test.


Either this or something nearby appears to have broken the error 
reporting from psql or libpq when failing to get an SSL connection:


PG16:

psql 'sslmode=require host=localhost'
psql: error: connection to server at "localhost" (::1), port 65432 
failed: server does not support SSL, but SSL was required


master:

psql 'sslmode=require host=localhost'
psql: error: connection to server at "localhost" (::1), port 65432 failed:

(sic, the output ends after the colon).

This commit removed that detail error message string from the code, but 
I don't see any similar message that would take its place.






Re: pgsql: Refactor libpq state machine for negotiating encryption

2024-04-12 Thread Heikki Linnakangas

On 11/04/2024 20:07, Heikki Linnakangas wrote:

On 11/04/2024 02:33, Thomas Munro wrote:

On Thu, Apr 11, 2024 at 11:25 AM Tom Lane  wrote:

Thomas Munro  writes:

If -Dssl=none and -Dgssapi=disabled, compilation of fe-connect.c
fails: call to undeclared function 'encryption_negotiation_failed'.  I
didn't look too hard, but maybe ENABLE_GSS and USE_GSS are confused?


For me, configure --with-gssapi fails like that, but the other three
combinations of --with-openssl and --with-gssapi compile OK.  I don't
find it terribly surprising that the buildfarm isn't covering that
combination ...


Oops, right, correction to my report: it is indeed -Dssl=none
-Dgssapi=enabled that is broken, not the other combinations.


Yes, I misspelled ENABLE_GSS as USE_GSS.

After fixing that, the new tests are failing; the expected output for
many of the cases is different when GSSAPI support is not compiled in. I
think the test tables need to be rearranged some more to take that into
account, or we will end up with a ridiculous amount of different
expected outputs.

I will take a closer look at that tomorrow. As a bandaid fix, we could
temporarily disable the new tests with that combination of configure
options, it's still better test coverage than not having the tests at
all. But given that no buildfarm members are testing that combination I
think it can wait a day.


Fixed the compilation with that combination, and the expected test 
output. Thanks for the report!


--
Heikki Linnakangas
Neon (https://neon.tech)





Re: pgsql: Refactor libpq state machine for negotiating encryption

2024-04-11 Thread Heikki Linnakangas

Thanks for the report!

On 11/04/2024 02:33, Thomas Munro wrote:

On Thu, Apr 11, 2024 at 11:25 AM Tom Lane  wrote:

Thomas Munro  writes:

If -Dssl=none and -Dgssapi=disabled, compilation of fe-connect.c
fails: call to undeclared function 'encryption_negotiation_failed'.  I
didn't look too hard, but maybe ENABLE_GSS and USE_GSS are confused?


For me, configure --with-gssapi fails like that, but the other three
combinations of --with-openssl and --with-gssapi compile OK.  I don't
find it terribly surprising that the buildfarm isn't covering that
combination ...


Oops, right, correction to my report: it is indeed -Dssl=none
-Dgssapi=enabled that is broken, not the other combinations.


Yes, I misspelled ENABLE_GSS as USE_GSS.

After fixing that, the new tests are failing; the expected output for 
many of the cases is different when GSSAPI support is not compiled in. I 
think the test tables need to be rearranged some more to take that into 
account, or we will end up with a ridiculous amount of different 
expected outputs.


I will take a closer look at that tomorrow. As a bandaid fix, we could 
temporarily disable the new tests with that combination of configure 
options, it's still better test coverage than not having the tests at 
all. But given that no buildfarm members are testing that combination I 
think it can wait a day.


--
Heikki Linnakangas
Neon (https://neon.tech)





Re: pgsql: Refactor libpq state machine for negotiating encryption

2024-04-10 Thread Thomas Munro
On Thu, Apr 11, 2024 at 11:25 AM Tom Lane  wrote:
> Thomas Munro  writes:
> > If -Dssl=none and -Dgssapi=disabled, compilation of fe-connect.c
> > fails: call to undeclared function 'encryption_negotiation_failed'.  I
> > didn't look too hard, but maybe ENABLE_GSS and USE_GSS are confused?
>
> For me, configure --with-gssapi fails like that, but the other three
> combinations of --with-openssl and --with-gssapi compile OK.  I don't
> find it terribly surprising that the buildfarm isn't covering that
> combination ...

Oops, right, correction to my report: it is indeed -Dssl=none
-Dgssapi=enabled that is broken, not the other combinations.




Re: pgsql: Refactor libpq state machine for negotiating encryption

2024-04-10 Thread Tom Lane
Thomas Munro  writes:
> If -Dssl=none and -Dgssapi=disabled, compilation of fe-connect.c
> fails: call to undeclared function 'encryption_negotiation_failed'.  I
> didn't look too hard, but maybe ENABLE_GSS and USE_GSS are confused?

For me, configure --with-gssapi fails like that, but the other three
combinations of --with-openssl and --with-gssapi compile OK.  I don't
find it terribly surprising that the buildfarm isn't covering that
combination ...

regards, tom lane




Re: pgsql: Refactor libpq state machine for negotiating encryption

2024-04-10 Thread Thomas Munro
On Mon, Apr 8, 2024 at 1:25 PM Heikki Linnakangas
 wrote:
> Refactor libpq state machine for negotiating encryption
>
> This fixes the few corner cases noted in commit 705843d294, as shown
> by the changes in the test.
>
> Author: Heikki Linnakangas, Matthias van de Meent
> Reviewed-by: Jacob Champion

If -Dssl=none and -Dgssapi=disabled, compilation of fe-connect.c
fails: call to undeclared function 'encryption_negotiation_failed'.  I
didn't look too hard, but maybe ENABLE_GSS and USE_GSS are confused?




pgsql: Refactor libpq state machine for negotiating encryption

2024-04-07 Thread Heikki Linnakangas
Refactor libpq state machine for negotiating encryption

This fixes the few corner cases noted in commit 705843d294, as shown
by the changes in the test.

Author: Heikki Linnakangas, Matthias van de Meent
Reviewed-by: Jacob Champion

Branch
--
master

Details
---
https://git.postgresql.org/pg/commitdiff/05fd30c0e730bd5238f62d2fdfdcfaf28b16b225

Modified Files
--
src/interfaces/libpq/fe-connect.c  | 427 -
src/interfaces/libpq/libpq-int.h   |  14 +-
.../libpq_encryption/t/001_negotiate_encryption.pl |  26 +-
3 files changed, 265 insertions(+), 202 deletions(-)