> JFYI, the patch does not apply to the head. There is a conflict in
> multiple files.
For review purposes, I applied v8 to the March 6 code-base. I have yet
to review in detail, please find my initial thoughts:
1)
I found that 'inactive_replication_slot_timeout' works only if there
was any walsender ever started for that slot . The logic is under
'am_walsender' check. Is this intentional?
If I create a slot and use only pg_logical_slot_get_changes or
pg_replication_slot_advance on it, it never gets invalidated due to
timeout. While, when I set 'max_slot_xid_age' or say
'max_slot_wal_keep_size' to a lower value, the said slot is
invalidated correctly with 'xid_aged' and 'wal_removed' reasons
respectively.
Example:
With inactive_replication_slot_timeout=1min, test1_3 is the slot for
which there is no walsender and only advance and get_changes SQL
functions were called; test1_4 is the one for which pg_recvlogical was
run for a second.
test1_3 | 785 | | reserved | | t
| |
test1_4 | 798 | | lost | inactive_timeout | t |
2024-03-13 11:52:41.58446+05:30 |
And when inactive_replication_slot_timeout=0 and max_slot_xid_age=10
test1_3 | 785 | | lost | xid_aged | t
| |
test1_4 | 798 | | lost | inactive_timeout | t |
2024-03-13 11:52:41.58446+05:30 |
2)
The msg for patch 3 says:
--------------
a) when replication slots is lying inactive for a day or so using
last_inactive_at metric,
b) when a replication slot is becoming inactive too frequently using
last_inactive_at metric.
--------------
I think in b, you want to refer to inactive_count instead of last_inactive_at?
3)
I do not see invalidation_reason updated for 2 new reasons in system-views.sgml
thanks
Shveta