Re: Changing the state of data checksums in a running cluster
От | Tomas Vondra |
---|---|
Тема | Re: Changing the state of data checksums in a running cluster |
Дата | |
Msg-id | dfe57980-f594-46c5-af39-852ff30d34fa@vondra.me обсуждение исходный текст |
Ответ на | Re: Changing the state of data checksums in a running cluster (Tomas Vondra <tomas@vondra.me>) |
Ответы |
Re: Changing the state of data checksums in a running cluster
|
Список | pgsql-hackers |
On 8/27/25 14:39, Tomas Vondra wrote: > ... > > And this happened on Friday: > > commit c13070a27b63d9ce4850d88a63bf889a6fde26f0 > Author: Alexander Korotkov <akorotkov@postgresql.org> > Date: Fri Aug 22 18:44:39 2025 +0300 > > Revert "Get rid of WALBufMappingLock" > > This reverts commit bc22dc0e0ddc2dcb6043a732415019cc6b6bf683. > It appears that conditional variables are not suitable for use > inside critical sections. If WaitLatch()/WaitEventSetWaitBlock() > face postmaster death, they exit, releasing all locks instead of > PANIC. In certain situations, this leads to data corruption. > > ... > > I think it's very likely the checksums were broken by this. After all, > that linked thread has subject "VM corruption on standby" and I've only > ever seen checksum failures on standby on the _vm fork. > Forgot to mention - I did try with c13070a27b reverted, and with that I can reproduce the checksum failures again (using the fixed TAP test). It's not a definitive proof, but it's a hint c13070a27b63 was causing the checksum failures. regards -- Tomas Vondra
В списке pgsql-hackers по дате отправления: