Re: Disk-based hash aggregate's cost model

Поиск

Список

Период

Сортировка

От	Jeff Davis
Тема	Re: Disk-based hash aggregate's cost model
Дата	4 сентября 2020 г. 21:31:36
Msg-id	011877614fa1279c97ce6e897ea2f0dc90124483.camel@j-davis.com обсуждение исходный текст
Ответ на	Re: Disk-based hash aggregate's cost model (Tomas Vondra <tomas.vondra@2ndquadrant.com>)
Ответы	Re: Disk-based hash aggregate's cost model
Список	pgsql-hackers

Дерево обсуждения

On Fri, 2020-09-04 at 14:56 +0200, Tomas Vondra wrote:
> Those charts show that the CP_SMALL_TLIST resulted in smaller temp
> files
> (per EXPLAIN ANALYZE the difference is ~25%) and also lower query
> durations (also in the ~25% range).

I was able to reproduce the problem, thank you.

Only two attributes are needed, so the CP_SMALL_TLIST projected schema
only needs a single-byte null bitmap.

But if just setting the attributes to NULL rather than projecting them,
the null bitmap size is based on all 16 attributes, bumping the bitmap
size to two bytes.

MAXALIGN(23 + 1) = 24
MAXALIGN(23 + 2) = 32

I think that explains it. It's not ideal, but projection has a cost as
well, so I don't think we necessarily need to do something here.

If we are motivated to improve this in v14, we could potentially have a
different schema for spilled tuples, and perform real projection at
spill time. But I don't know if that's worth the extra complexity.

Regards,
    Jeff Davis

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Alexey Kondratov
Дата: 04 сентября 2020 г., 21:31:14
Сообщение: Re: Global snapshots

Следующее

От: Andres Freund
Дата: 04 сентября 2020 г., 21:53:04
Сообщение: Re: Improving connection scalability: GetSnapshotData()

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Disk-based hash aggregate's cost model

Предыдущее

Следующее