Re: Avoiding hash join batch explosions with extreme skew and weird stats

Поиск

Список

Период

Сортировка

От	David Kimura
Тема	Re: Avoiding hash join batch explosions with extreme skew and weird stats
Дата	4 мая 2020 г. 23:39:36
Msg-id	CAHnPFjQiYN83NjQ4KvjX19Wti==uzyw8D24va56zJKzOt+B51A@mail.gmail.com обсуждение исходный текст
Ответ на	Re: Avoiding hash join batch explosions with extreme skew and weird stats (David Kimura <david.g.kimura@gmail.com>)
Список	pgsql-hackers

Дерево обсуждения

On Wed, Apr 29, 2020 at 4:44 PM David Kimura <david.g.kimura@gmail.com> wrote:
>
> Following patch adds logic to create a batch 0 file for serial hash join so
> that even in pathalogical case we do not need to exceed work_mem.

Updated the patch to spill batch 0 tuples after it is marked as fallback.

A couple questions from looking more at serial code:

1) Does the current pattern to repartition batches *after* the previous
   hashtable insert exceeds work_mem still make sense?

   In that case we'd allow ourselves to exceed work_mem by one tuple. If that
   doesn't seem correct anymore then I think we can move the space exceeded
   check in ExecHashTableInsert() *before* actual hashtable insert.

2) After batch 0 is marked fallback, does the logic to insert into its batch
   file fit more in MultiExecPrivateHash() or ExecHashTableInsert()?

   The latter already has logic to decide whether to insert into hashtable or
   batchfile

Thanks,
David

Вложения

v6-0002-Implement-fallback-of-batch-0-for-serial-adaptive.patch

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Andres Freund
Дата: 04 мая 2020 г., 22:41:45
Сообщение: Re: design for parallel backup

Следующее

От: Tom Lane
Дата: 05 мая 2020 г., 00:22:01
Сообщение: Re: Poll: are people okay with function/operator table redesign?

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Avoiding hash join batch explosions with extreme skew and weird stats

Вложения

Предыдущее

Следующее