Re: cost_hashjoin

Поиск

Список

Период

Сортировка

От	Simon Riggs
Тема	Re: cost_hashjoin
Дата	30 августа 2010 г. 10:49:26
Msg-id	1283176151.1800.2360.camel@ebony обсуждение исходный текст
Ответ на	Re: cost_hashjoin (Greg Stark <gsstark@mit.edu>)
Список	pgsql-hackers

Дерево обсуждения

On Mon, 2010-08-30 at 13:34 +0100, Greg Stark wrote:
> On Mon, Aug 30, 2010 at 10:18 AM, Simon Riggs <simon@2ndquadrant.com> wrote:
> > cost_hashjoin() has some treatment of what occurs when numbatches > 1
> > but that additional cost is not proportional to numbatches.
> 
> Because that's not how our hash batching works. We generate two temp
> files for each batch, one for the outer and one for the inner. So if
> we're batching then every tuple of both the inner and outer tables
> (except for ones in the first batch) need to be written once and read
> once regardless of the number of batches.

Thanks for explaining. For some reason I thought we rewound the outer at
the start of each batch, which is better for avoiding cache spoiling.

-- Simon Riggs           www.2ndQuadrant.comPostgreSQL Development, 24x7 Support, Training and Services

В списке pgsql-hackers по дате отправления:

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: cost_hashjoin