Re: Raising the checkpoint_timeout limit

Поиск

Список

Период

Сортировка

От	Andres Freund
Тема	Re: Raising the checkpoint_timeout limit
Дата	2 февраля 2016 г. 14:32:08
Msg-id	20160202113200.GU8743@awork2.anarazel.de обсуждение исходный текст
Ответ на	Re: Raising the checkpoint_timeout limit (Simon Riggs <simon@2ndQuadrant.com>)
Список	pgsql-hackers

Дерево обсуждения

On 2016-02-02 11:37:15 +0100, Simon Riggs wrote:
> If people wish to turn off crash recovery, they can already set fsync=off.
> I don't wish to see us support a setting that causes problems for people
> that don't understand what checkpoints are and why everybody needs them.

I don't think fsync=off and very long checkpoints are really
comparable. Many large modern machines, especially with directly
attached storage and/or large amounts of memory, take a *long* while to
boot. So any outage will be dealth with a failover anyway.  But at the
same time, a database in the 10TB+ range can't easily be copied again.
Thus running with fsync=off isn't something that you'd want in those
scenarios - it'd prevent the previous master/other standbys from failing
back/catching up; the databases could be arbitrarily corrupted after
all.

Additionally a significant portion of the cost of checkpoints are full
page writes - you easily can get into the situation where you have
~20MB/sec normal WAL without FPWs, but with them 300MB/s. That rate is
rather expensive, regardless fsync=off.

> The current code needs to act differently with regard to very low settings,
> so when we are a small number of blocks remaining we don't spend hours
> performing them. Allowing very large values would make that even more
> strange.

Why is that a good thing? Every checkpoint triggers a new round of full
page writes. I don't see why you want to accellerate a checkpoint, just
because there's few writes remaining? Yes, the current code partially
behaves that way, but that's imo more an implementation artifact or even
a bug.

> Some systems offer a recovery_time_objective setting that is used to
> control how frequently checkpoints occur. That might be a more usable
> approach.

While desirable, I have no idea to realistically calculate that :(. It's
also a lot bigger than just adjusting a pointlessly low GUC limit.

Regards,

Andres

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Masahiko Sawada
Дата: 02 февраля 2016 г., 14:26:06
Сообщение: Re: Freeze avoidance of very large table.

Следующее

От: Magnus Hagander
Дата: 02 февраля 2016 г., 14:47:37
Сообщение: Re: Add links to commit fests to patch summary page

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Raising the checkpoint_timeout limit

Предыдущее

Следующее