> On Feb 10, 2026, at 08:03, Rahila Syed <[email protected]> wrote:
> 
> Hi,
> 
> Please find attached rebased and updated patches that include a few fixes
> suggested by Daniel.
> 
> Fixes include
> 1. Changed LW_SHARED to LW_EXCLUSIVE when resetting client_keys.
> 2. Replaced calls to MemoryContextReset with MemoryContextDelete since a
> new memory context is created each time.
> 3. Removed leftover code from earlier versions in ipci.c 
> 4. Fix the TAP test to compare strings correctly.
> 5. Added more comments in the TAP test.
> 
> Thank you,
> Rahila Syed
> <v50-0001-Add-function-to-report-memory-context-statistics.patch><v50-0002-Test-module-to-test-memory-context-reporting-wit.patch>

Hi Rahila,

Thanks for the patch. I applied v50 locally and played with it a little bit 
then I reviewed v50. Basically I think it’s convenient to query a server 
process’ memory usage via a SQL statement.

Here comes my review comments.

1 - 0001- mcxt.c
```
+/*
+ * MemoryContextStatsCounter
+ *
+ * Accumulate statistics counts into *totals. totals should not be NULL.
+ * This involves a non-recursive tree traversal.
+ */
+void
+MemoryContextStatsCounter(MemoryContext context, MemoryContextCounters *totals,
+                                                 int *num_contexts)
+{
+       int                     ichild = 1;
+
+       *num_contexts = 0;
+       context->methods->stats(context, NULL, NULL, totals, false);
```

As the header comment says that “totals should not be NULL”, maybe add 
Assert(total!=NULL) to ensure that.

2 - 0001- mcxt.c
```
+       *num_contexts = 0;
+       context->methods->stats(context, NULL, NULL, totals, false);
+
+       for (MemoryContext curr = context->firstchild;
+                curr != NULL;
+                curr = MemoryContextTraverseNext(curr, context))
+       {
+               curr->methods->stats(curr, NULL, NULL, totals, false);
+               ichild++;
+       }
+
+       /*
+        * Add the count of all the children contexts which are traversed
+        * including the parent.
+        */
+       *num_contexts = *num_contexts + ichild;
+}
```

*num_contexts is only initialized to 0, then *num_contexts = *num_contexts + 
ichild. Looks like the the initialization is unnecessary, and the assignment 
can just be *num_contexts = ichild.

3 - 0001 - mcxtfuncs.c
```
+typedef struct MemoryStatsEntry
+{
+       char            name[MEMORY_CONTEXT_NAME_SHMEM_SIZE];
+       char            ident[MEMORY_CONTEXT_IDENT_SHMEM_SIZE];
+       int                     path[100];
```

You already defined a constant MAX_PATH_DISPLAY_LENGTH below, would it make 
sense to pull up the macro definition and use it for “path” definition?

4 - 0001 - mctxfuncs.c
```
+/*
+ * Per backend dynamic shared hash entry for memory context statistics
+ * reporting.
+ */
+typedef struct MemoryStatsDSHashEntry
+{
+       char            key[64];
+       ConditionVariable memcxt_cv;
+       bool            stats_written;
+       int                     target_server_id;
+       int                     total_stats;
+       bool            summary;
+       dsa_pointer memstats_dsa_pointer;
+} MemoryStatsDSHashEntry;
+
+static const dshash_parameters memctx_dsh_params = {
+       offsetof(MemoryStatsDSHashEntry, memcxt_cv),
+       sizeof(MemoryStatsDSHashEntry),
+       dshash_strcmp,
+       dshash_strhash,
+       dshash_strcpy
+};
```

I wonder why we cannot just use the integer ProcNumber as the hash key? I think 
dshash fully supports fixed-size binary keys. We can use dshash_memcmp and 
dshash_memhash for compare and hash functions.

5 - 0001 - mctxfuncs.c
```
+static List *
+compute_context_path(MemoryContext c, HTAB *context_id_lookup)
+{
+       bool            found;
+       List       *path = NIL;
+       MemoryContext cur_context;
+
+       for (cur_context = c; cur_context != NULL; cur_context = 
cur_context->parent)
+       {
+               MemoryContextId *cur_entry;
+
+               cur_entry = hash_search(context_id_lookup, &cur_context, 
HASH_FIND, &found);
+
+               if (!found)
+               {
+                       elog(NOTICE, "hash table corrupted, can't construct 
path value");
+                       return NIL;
+               }
+               path = lcons_int(cur_entry->context_id, path);
+       }
+       return path;
+}
```

As in the hash entry, path is a 100-elements array, would it make sense to also 
limit the path list to not append more than 100 items in this function?

6 - 0001 - mctxfuncs.c
```
+       entry->stats_written = true;
+       dshash_release_lock(MemoryStatsDsHash, entry);
+       hash_destroy(context_id_lookup);
+
+       MemoryContextSwitchTo(oldcontext);
+       MemoryContextDelete(memstats_ctx);
+       /* Notify waiting client backend and return */
+       ConditionVariableSignal(&entry->memcxt_cv);
+}
```

This looks like a race condition. ConditionVariableSignal(&entry->memcxt_cv); 
is called after the lock is released. So, there is a chance that a process has 
been terminated, and its before_shmem_exit callback acquired the entry lock and 
deleted the entry from the hash. So, I think we should send the signal before 
releasing the lock.

7 - 0001 - mctxfuncs.c
```
+               /*
+                * Wait for MEMORY_STATS_MAX_TIMEOUT. If no statistics are 
available
+                * within the allowed time then return NULL. The timer is 
defined in
+                * milliseconds since that's what the condition variable sleep 
uses.
+                */
+               if (ConditionVariableTimedSleep(&entry->memcxt_cv,
+                                                                               
(MEMORY_STATS_MAX_TIMEOUT * 1000),
+                                                                               
WAIT_EVENT_MEM_CXT_PUBLISH))
```

For the wait loop, if the condition wakes up for some other reason, it will 
loop back and wait for the other 5 seconds, then total wait period will exceed 
5 seconds, maybe 9 seconds. This is not a big deal, but the doc explicitly says 
“If the process does not respond with memory contexts statistics in 5 seconds”, 
so that behavior might be inconsistent with the doc.

I think we can calculate remaining time and pass remaining milliseconds into 
ConditionVariableTimedSleep.

8 - 0001 - mctxfuncs.c
```
+               /*
+                * Wait for MEMORY_STATS_MAX_TIMEOUT. If no statistics are 
available
+                * within the allowed time then return NULL. The timer is 
defined in
+                * milliseconds since that's what the condition variable sleep 
uses.
+                */
+               if (ConditionVariableTimedSleep(&entry->memcxt_cv,
+                                                                               
(MEMORY_STATS_MAX_TIMEOUT * 1000),
+                                                                               
WAIT_EVENT_MEM_CXT_PUBLISH))
+               {
+                       /* Timeout has expired, return NULL */
+                       memstats_dsa_cleanup(key);
+                       memstats_client_key_reset(procNumber);
+                       ConditionVariableCancelSleep();
+                       ereport(NOTICE,
+                                       errmsg("request for memory context 
statistics for PID %d timed out",
+                                                  pid));
+                       PG_RETURN_NULL();
+               }
+               entry = dshash_find_or_insert(MemoryStatsDsHash, key, &found);
+               Assert(found);
```

After ConditionVariableTimedSleep, rather than Assert(found), I think we should 
check if (!found). Because it waits for 5 seconds without holding the lock, the 
process may terminate and delete the entry during the period.

9 - 0001 - proc.c
```
diff --git a/src/backend/storage/lmgr/proc.c b/src/backend/storage/lmgr/proc.c
index 8560a903bc8..f68583aa820 100644
--- a/src/backend/storage/lmgr/proc.c
+++ b/src/backend/storage/lmgr/proc.c
@@ -51,6 +51,7 @@
 #include "storage/procsignal.h"
 #include "storage/spin.h"
 #include "storage/standby.h"
+#include "utils/memutils.h"
 #include "utils/timeout.h"
 #include "utils/timestamp.h"
```

This file is sololy added an include without any other change, that seems 
unneeded. I tried to remove this include, and the build still passed.

Best regards,
--
Chao Li (Evan)
HighGo Software Co., Ltd.
https://www.highgo.com/






Reply via email to