Re: bad estimation together with large work_mem generates terrible slow hash joins

Поиск

Список

Период

Сортировка

От	Tomas Vondra
Тема	Re: bad estimation together with large work_mem generates terrible slow hash joins
Дата	1 июля 2014 г. 02:24:39
Msg-id	53B1F1A8.8060909@fuzzy.cz обсуждение исходный текст
Ответ на	Re: bad estimation together with large work_mem generates terrible slow hash joins (Tomas Vondra <tv@fuzzy.cz>)
Ответы	Re: bad estimation together with large work_mem generates terrible slow hash joins Re: bad estimation together with large work_mem generates terrible slow hash joins
Список	pgsql-hackers

Дерево обсуждения

On 30.6.2014 23:12, Tomas Vondra wrote:
> Hi,
>
> attached is v5 of the patch. The main change is that scaling the number
> of buckets is done only once, after the initial hash table is build. The
> main advantage of this is lower price. This also allowed some cleanup of
> unecessary code.
>
> However, this new patch causes warning like this:
>
>     WARNING:  temporary file leak: File 231 still referenced
>
> I've been eyeballing the code for a while now, but I still fail to see
> where this comes from :-( Any ideas?

Meh, the patches are wrong - I haven't realized the tight coupling
between buckets/batches in ExecHashGetBucketAndBatch:

  *bucketno = hashvalue & (nbuckets - 1);
  *batchno = (hashvalue >> hashtable->log2_nbuckets) & (nbatch - 1);

The previous patches worked mostly by pure luck (the nbuckets was set
only in the first batch), but once I moved the code at the end, it
started failing. And by "worked" I mean "didn't throw an error, but
probably returned bogus results with (nbatch>1)".

However, ISTM this nbuckets-nbatch coupling is not really necessary,
it's only constructed this way to assign independent batch/bucket. So I
went and changed the code like this:

  *bucketno = hashvalue & (nbuckets - 1);
  *batchno = (hashvalue >> (32 - hashtable->log2_nbatch));

I.e. using the top bits for batchno, low bits for bucketno (as before).

Hopefully I got it right this time. At least it seems to be working for
cases that failed before (no file leaks, proper rowcounts so far).

regards
Tomas

Вложения

hashjoin-nbuckets-growth-v6-with-guc.patch

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Andres Freund
Дата: 01 июля 2014 г., 01:29:07
Сообщение: Re: Spinlocks and compiler/memory barriers

Следующее

От: Tom Lane
Дата: 01 июля 2014 г., 02:42:48
Сообщение: Re: PATCH: Allow empty targets in unaccent dictionary

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: bad estimation together with large work_mem generates terrible slow hash joins

Вложения

Предыдущее

Следующее