Spurious standby query cancellations

Поиск

Список

Период

Сортировка

От	Jeff Janes
Тема	Spurious standby query cancellations
Дата	27 августа 2015 г. 21:55:23
Msg-id	CAMkU=1xJUe3DHmtEN6OUNb7-V5hm3nm_9qzCBRDCDVCSHmwWLg@mail.gmail.com обсуждение исходный текст
Ответы	Re: Spurious standby query cancellations
Список	pgsql-hackers

Дерево обсуждения

In ResolveRecoveryConflictWithLock, there is the comment:

* If blowing away everybody with conflicting locks doesn't work, after

* the first two attempts then we just start blowing everybody away until

* it does work.

But what it does is something different than that.

At least under some conditions, it will wait patiently for all initially conflicting locks to go away. If that doesn't work twice (because new lockers joined while we were waiting for the old ones to go away), then it will wait patiently for all transactions throughout the system to go away even if they don't conflict, perhaps not even realizing that the lock has become free in the mean time.

Then when its patience runs out, it kills everything on the system. But it never tried to blow away just the conflicting locks, instead it just tried to wait them out.

The fact that trying to wait them out didn't work (twice) doesn't mean that killing them wouldn't have worked.

I think that it was intended that num_attempts would get incremented only once WaitExceedsMaxStandbyDelay becomes true, but that is not what happens with the current code.

Currently WaitExceedsMaxStandbyDelay only has one caller. I think it we should take the sleep code out of that function and move it into the existing call site, and then have ResolveRecoveryConflictWithLock check with WaitExceedsMaxStandbyDelay before incrementing num_attempts.

Cheers,

Jeff

Вложения

standby_lock_cancel_v1.patch

В списке pgsql-hackers по дате отправления:

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Spurious standby query cancellations

Вложения