Re: [EXTERNAL] Re: WIP: WAL prefetch (another approach)

Поиск
Список
Период
Сортировка
От Stephen Frost
Тема Re: [EXTERNAL] Re: WIP: WAL prefetch (another approach)
Дата
Msg-id 20200827182642.GO29590@tamriel.snowman.net
обсуждение исходный текст
Ответ на RE: [EXTERNAL] Re: WIP: WAL prefetch (another approach)  (Sait Talha Nisanci <Sait.Nisanci@microsoft.com>)
Ответы Re: [EXTERNAL] Re: WIP: WAL prefetch (another approach)  (Andres Freund <andres@anarazel.de>)
Список pgsql-hackers
Greetings,

* Sait Talha Nisanci (Sait.Nisanci@microsoft.com) wrote:
> OS version is Ubuntu 18.04.5 LTS.
> Filesystem is ext4 and block size is 4KB.

[...]

* Sait Talha Nisanci (Sait.Nisanci@microsoft.com) wrote:
> I have run some benchmarks for this patch. Overall it seems that there is a good improvement with the patch on
recoverytimes: 
>
> The VMs I used have 32GB RAM, pgbench is initialized with a scale factor 3000(so it doesn’t fit to memory, ~45GB).
>
> In order to avoid checkpoints during benchmark, max_wal_size(200GB) and checkpoint_timeout(200 mins) are set to a
highvalue.  
>
> The run is cancelled when there is a reasonable amount of WAL ( > 25GB). The recovery times are measured from the
REDOlogs. 
>
> I have tried combination of SSD, HDD, full_page_writes = on/off and max_io_concurrency = 10/50, the recovery times
areas follows (in seconds): 
>
>                    No prefetch        |     Default prefetch values  |          Default + max_io_concurrency = 50
> SSD, full_page_writes = on    852        301                197
> SSD, full_page_writes = off    1642        1359                1391
> HDD, full_page_writes = on    6027        6345                6390
> HDD, full_page_writes = off    738        275                192
>
> Default prefetch values:
> -    Max_recovery_prefetch_distance = 256KB
> -    Max_io_concurrency = 10
>
> It probably makes sense to compare each row separately as the size of WAL can be different.

Is WAL FPW compression enabled..?  I'm trying to figure out how, given
what's been shared here, that replaying 25GB of WAL is being helped out
by 2.5x thanks to prefetch in the SSD case.  That prefetch is hurting in
the HDD case entirely makes sense to me- we're spending time reading
pages from the HDD, which is entirely pointless work given that we're
just going to write over those pages entirely with FPWs.

Further, if there's 32GB of RAM, and WAL compression isn't enabled and
the WAL is only 25GB, then it's very likely that every page touched by
the WAL ends up in memory (shared buffers or fs cache), and with FPWs we
shouldn't ever need to actually read from the storage to get those
pages, right?  So how is prefetch helping so much..?

I'm not sure that the 'full_page_writes = off' tests are very
interesting in this case, since you're going to get torn pages and
therefore corruption and hopefully no one is running with that
configuration with this OS/filesystem.

Thanks,

Stephen

Вложения

В списке pgsql-hackers по дате отправления:

Предыдущее
От: Sait Talha Nisanci
Дата:
Сообщение: RE: [EXTERNAL] Re: WIP: WAL prefetch (another approach)
Следующее
От: Andres Freund
Дата:
Сообщение: Re: [EXTERNAL] Re: WIP: WAL prefetch (another approach)