Re: Postgresql OOM

Поиск

Список

Период

Сортировка

От	Tom Lane
Тема	Re: Postgresql OOM
Дата	6 июня 23:55:00
Msg-id	1140897.1717707300@sss.pgh.pa.us обсуждение исходный текст
Ответ на	Re: Postgresql OOM (Radu Radutiu <rradutiu@gmail.com>)
Ответы	Re: Postgresql OOM
Список	pgsql-hackers

Дерево обсуждения

Radu Radutiu <rradutiu@gmail.com> writes:
> The query itself runs fine in a reasonable time with enable_parallel_hash =
> 'off'. I see two problems - one is the wrong execution plan (right after
> running analyze), the second and the most important is the huge memory
> usage (far exceeding work_mem and shared buffers) leading to OOM.
> See https://explain.depesz.com/s/yAqS for the explain plan
> with enable_parallel_hash = 'off.

What it looks like to me is that the join key column has very skewed
statistics, such that a large majority of the tuples end up in the
same hash bucket (probably they even all have identical keys).  I think
the memory growth is coming from the executor repeatedly splitting
the buckets in a vain attempt to separate those tuples into multiple
buckets.

The planner should recognize this situation and avoid use of hash
join in such cases, but maybe the statistics aren't reflecting the
problem, or maybe there's something wrong with the logic specific
to parallel hash join.  You've not really provided enough information
to diagnose why the poor choice of plan.

            regards, tom lane

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Peter Eisentraut
Дата: 06 июня, 23:48:36
Сообщение: Re: small fix for llvm build

Следующее

От: Pavel Luzanov
Дата: 07 июня, 00:10:34
Сообщение: Re: Things I don't like about \du's "Attributes" column

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Postgresql OOM

Предыдущее

Следующее