Re: Deriving Recovery Snapshots

Поиск

Список

Период

Сортировка

От	Simon Riggs
Тема	Re: Deriving Recovery Snapshots
Дата	22 октября 2008 г. 08:03:43
Msg-id	1224673546.27145.218.camel@ebony.2ndQuadrant обсуждение исходный текст
Ответ на	Re: Deriving Recovery Snapshots (Heikki Linnakangas <heikki.linnakangas@enterprisedb.com>)
Ответы	Re: Deriving Recovery Snapshots
Список	pgsql-hackers

Дерево обсуждения

On Wed, 2008-10-22 at 12:29 +0300, Heikki Linnakangas wrote:
> Simon Riggs wrote:
> > On Thu, 2008-10-16 at 18:52 +0300, Heikki Linnakangas wrote:
> >> Simon Riggs wrote:
> >>> * The backend slot may not be reused for some time, so we should take
> >>> additional actions to keep state current and true. So we choose to log a
> >>> snapshot from the master into WAL after each checkpoint. This can then
> >>> be used to cleanup any unobserved xids. It also provides us with our
> >>> initial state data, see later.
> >> We don't need to log a complete snapshot, do we? Just oldestxmin should 
> >> be enough.
> > 
> > Possibly, but you're thinking that once we're up and running we can use
> > less info.
> > 
> > Trouble is, you don't know when/if the standby will crash/be shutdown.
> > So we need regular full snapshots to allow it to re-establish full
> > information at regular points. So we may as well drop the whole snapshot
> > to WAL every checkpoint. To do otherwise would mean more code and less
> > flexibility.
> 
> Surely it's less code to write the OldestXmin to the checkpoint record, 
> rather than a full snapshot, no? And to read it off the checkpoint record.

You may be missing my point.

We need an initial state to work from.

I am proposing we write a full snapshot after each checkpoint because it
allows us to start recovery again from that point. If we wrote only
OldestXmin as you suggest it would optimise the size of the WAL record
but it would prevent us from restarting at that point.

Also, passing OldestXmin only would not work in the presence of long
running statements. Passing the snapshot allows us to see that FATAL
errors have occurred much sooner.

BTW, the way I have coded it means that if we skip writing a checkpoint
on a quiet system then we would also skip writing the snapshot.

-- Simon Riggs           www.2ndQuadrant.comPostgreSQL Training, Services and Support

В списке pgsql-hackers по дате отправления:

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Deriving Recovery Snapshots