Re: Default setting for enable_hashagg_disk

Поиск

Список

Период

Сортировка

От	Jeff Davis
Тема	Re: Default setting for enable_hashagg_disk
Дата	25 июня 2020 г. 17:15:52
Msg-id	2401e86059624be1917fd0fbcd08700f52f2ba12.camel@j-davis.com обсуждение исходный текст
Ответ на	Re: Default setting for enable_hashagg_disk (Robert Haas <robertmhaas@gmail.com>)
Ответы	Re: Default setting for enable_hashagg_disk
Список	pgsql-hackers

Дерево обсуждения

On Thu, 2020-06-25 at 11:46 -0400, Robert Haas wrote:
> Because the first problem is so bad, most people
> set the value relatively conservatively and just live with the
> performance consequences. But this also means that they have memory
> left over most of the time, so the idea of letting a node burst above
> its work_mem allocation when something unexpected happens isn't
> crazy:
> as long as only a few nodes do that here and there, rather than, say,
> all the nodes doing it all at the same time, it's actually fine.

Unexpected things (meaning underestimates) are not independent. All the
queries are based on the same stats, so if you have a lot of similar
queries, they will all get the same underestimate at once, and all be
surprised when they need to spill at once, and then all decide they are
entitled to ignore work_mem at once.

>  If we
> had a smarter system that could dole out more work_mem to nodes that
> would really benefit from it and less to nodes where it isn't likely
> to make much difference, that would be similar in spirit but even
> better.

That sounds more useful and probably not too hard to implement in a
crude form. Just have a shared counter in memory representing GB. If a
node is about to spill, it could try to decrement the counter by N, and
if it succeeds, it gets to exceed work_mem by N more GB.

Regards,
    Jeff Davis

В списке pgsql-hackers по дате отправления:

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Default setting for enable_hashagg_disk