Fsync request queue

Поиск

Список

Период

Сортировка

От	Andres Freund
Тема	Fsync request queue
Дата	25 апреля 2018 г. 00:00:54
Msg-id	20180424180054.inih6bxfspgowjuc@alap3.anarazel.de обсуждение исходный текст
Ответы	Re: Fsync request queue (Heikki Linnakangas <hlinnaka@iki.fi>)
Список	pgsql-hackers

Дерево обсуждения

Hi,

While thinking about the at the fsync mess, I started looking at the
fsync request queue. I was primarily wondering whether we can keep FDs
open long enough (by forwarding them to the checkpointer) to guarantee
that we see the error. But that's mostly irrelevant for what I'm
wondering about here.

The fsync request queue often is fairly large. 20 bytes for each
shared_buffers isn't a neglebible overhead. One reason it needs to be
fairly large is that we do not deduplicate while inserting, we just add
an entry on every single write.

ISTM that using a hashtable sounds saner, because we'd deduplicate on
insert. While that'd require locking, we can relatively easily reduce
the overhead of that by keeping track of something like mdsync_cycle_ctr
in MdfdVec, and only insert again if the cycle was incremented since.

Right now if the queue is full and can't be compacted we end up
fsync()ing on every single write, rather than once per checkpoint
afaict. That's a fairly horrible.

For the case that there's no space in the map, I'd suggest to just do
10% or so of the fsync in the poor sod of a process that finds no
space. That's surely better than constantly fsyncing on every single
write. We can also make bgwriter check the size of the hashtable on a
regular basis and do some of them if it gets too full.

The hashtable also I think has some advantages for the future. I've
introduced something very similar in my radix tree based buffer mapping.

Greetings,

Andres Freund

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Peter Eisentraut
Дата: 24 апреля 2018 г., 23:39:44
Сообщение: Re: Toast issues with OldestXmin going backwards

Следующее

От: Юрий Соколов
Дата: 25 апреля 2018 г., 00:12:00
Сообщение: Re: [HACKERS] Clock with Adaptive Replacement

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Fsync request queue

Предыдущее

Следующее