Re: Parallel Seq Scan vs kernel read ahead

Поиск

Список

Период

Сортировка

От	Amit Kapila
Тема	Re: Parallel Seq Scan vs kernel read ahead
Дата	20 мая 2020 г. 05:23:28
Msg-id	CAA4eK1Kxb56ty8CSfcX+9SMA1gzfODQQbpAs_EfKi7ZFj4e0zg@mail.gmail.com обсуждение исходный текст
Ответ на	Parallel Seq Scan vs kernel read ahead (Thomas Munro <thomas.munro@gmail.com>)
Ответы	Re: Parallel Seq Scan vs kernel read ahead Re: Parallel Seq Scan vs kernel read ahead
Список	pgsql-hackers

Дерево обсуждения

On Wed, May 20, 2020 at 7:24 AM Thomas Munro <thomas.munro@gmail.com> wrote:
>
> Hello hackers,
>
> Parallel sequential scan relies on the kernel detecting sequential
> access, but we don't make the job easy.  The resulting striding
> pattern works terribly on strict next-block systems like FreeBSD UFS,
> and degrades rapidly when you add too many workers on sliding window
> systems like Linux.
>
> Demonstration using FreeBSD on UFS on a virtual machine, taking ball
> park figures from iostat:
>
>   create table t as select generate_series(1, 200000000)::int i;
>
>   set max_parallel_workers_per_gather = 0;
>   select count(*) from t;
>   -> execution time 13.3s, average read size = ~128kB, ~500MB/s
>
>   set max_parallel_workers_per_gather = 1;
>   select count(*) from t;
>   -> execution time 24.9s, average read size = ~32kB, ~250MB/s
>
> Note the small read size, which means that there was no read
> clustering happening at all: that's the logical block size of this
> filesystem.
>
> That explains some complaints I've heard about PostgreSQL performance
> on that filesystem: parallel query destroys I/O performance.
>
> As a quick experiment, I tried teaching the block allocated to
> allocate ranges of up 64 blocks at a time, ramping up incrementally,
> and ramping down at the end, and I got:
>

Good experiment.  IIRC, we have discussed a similar idea during the
development of this feature but we haven't seen any better results by
allocating in ranges on the systems we have tried.  So, we want with
the current approach which is more granular and seems to allow better
parallelism.  I feel we need to ensure that we don't regress
parallelism in existing cases, otherwise, the idea sounds promising to
me.

-- 
With Regards,
Amit Kapila.
EnterpriseDB: http://www.enterprisedb.com

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Thomas Munro
Дата: 20 мая 2020 г., 04:53:24
Сообщение: Parallel Seq Scan vs kernel read ahead

Следующее

От: Noah Misch
Дата: 20 мая 2020 г., 06:05:00
Сообщение: Re: Problem with pg_atomic_compare_exchange_u64 at 32-bit platformwd

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Parallel Seq Scan vs kernel read ahead

Предыдущее

Следующее