Re: Hash joins vs small-integer join values

Поиск

Список

Период

Сортировка

От	Tom Lane
Тема	Re: Hash joins vs small-integer join values
Дата	1 июня 2007 г. 14:12:45
Msg-id	25013.1180707154@sss.pgh.pa.us обсуждение исходный текст
Ответ на	Re: Hash joins vs small-integer join values (Gregory Stark <stark@enterprisedb.com>)
Список	pgsql-hackers

Дерево обсуждения

Gregory Stark <stark@enterprisedb.com> writes:
> hashany of a 4-byte value degenerates to pretty much just a call to mix().
> Perhaps we should just expose a hash12() that takes three integers and calls
> mix() on them like hash_any does.

I don't see any use for that, but probably there is a use in hash_uint32(x)
that has the same result as hash_any(&x, sizeof(uint32)).

> And that fix you committed a while back to improve the catcache hash function
> made a huge difference. Now I'm wondering if it shouldn't just be invoking
> hash_any() or mix() too.

No, if the hash values are reasonably random it should be fine as-is.
There should not be any need for multiple layers of hash_any calls.
The big problem with the old version of that code (IMHO) was that it
threw away hash bits, which it doesn't anymore.

What I'm wondering is whether we can't actually simplify the callers, if
we put all the burden on the hash functions to produce all-bits-random
results.  In particular hashjoin probably ought to switch to power-of-2
nbuckets to avoid integer divisions ...
        regards, tom lane

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Tom Lane
Дата: 01 июня 2007 г., 13:39:27
Сообщение: Re: Feature freeze status report

Следующее

От: Alvaro Herrera
Дата: 01 июня 2007 г., 15:57:49
Сообщение: Re: Autovacuum versus rolled-back transactions

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Hash joins vs small-integer join values

Предыдущее

Следующее