costing of hash join

Поиск

Список

Период

Сортировка

От	Jeff Janes
Тема	costing of hash join
Дата	4 января 2014 г. 01:18:56
Msg-id	CAMkU=1wMr-Ft8=hFfXDy+KRjPVgrN5LPovaLwkrQH9+YzU277g@mail.gmail.com обсуждение исходный текст
Ответы	Re: costing of hash join
Список	pgsql-hackers

Дерево обсуждения

I'm trying to figure out why hash joins seem to be systematically underused in my hands. In the case I am immediately looking at it prefers a merge join with both inputs getting seq scanned and sorted, despite the hash join being actually 2 to 3 times faster, where inputs and intermediate working sets are all in memory. I normally wouldn't worry about a factor of 3 error, but I see this a lot in many different situations. The row estimates are very close to actual, the errors is only in the cpu estimates.

A hash join is charged cpu_tuple_cost for each inner tuple for inserting it into the hash table:

* charge one cpu_operator_cost for each column's hash function. Also,

* tack on one cpu_tuple_cost per inner row, to model the costs of

* inserting the row into the hashtable.

But a sort is not charged a similar charge to insert a tuple into the sort memory pool:

* Also charge a small amount (arbitrarily set equal to operator cost) per

* extracted tuple. We don't charge cpu_tuple_cost because a Sort node

* doesn't do qual-checking or projection, so it has less overhead than

* most plan nodes. Note it's correct to use tuples not output_tuples

Are these operations different enough to justify this difference? The qual-checking (and I think projection) needed on a hash join should have already been performed by and costed to the seq scan feeding the hashjoin, right?

Cheers,

Jeff

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Omar Kilani
Дата: 04 января 2014 г., 00:41:02
Сообщение: Re: Streaming replication bug in 9.3.2, "WAL contains references to invalid pages"

Следующее

От: Tom Lane
Дата: 04 января 2014 г., 01:51:00
Сообщение: Re: costing of hash join

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

costing of hash join

Предыдущее

Следующее