Re: Parallel tuplesort (for parallel B-Tree index creation)

Поиск

Список

Период

Сортировка

От	Peter Geoghegan
Тема	Re: Parallel tuplesort (for parallel B-Tree index creation)
Дата	4 декабря 2016 г. 05:37:54
Msg-id	CAM3SWZSkg6CN--pZ-tcTVtU-trexvXc03RH3CX7Nx1RJtuBPEA@mail.gmail.com обсуждение исходный текст
Ответ на	Re: Parallel tuplesort (for parallel B-Tree index creation) (Alvaro Herrera <alvherre@2ndquadrant.com>)
Ответы	Re: Parallel tuplesort (for parallel B-Tree index creation) (Tomas Vondra <tomas.vondra@2ndquadrant.com>)
Список	pgsql-hackers

Дерево обсуждения

On Sat, Dec 3, 2016 at 5:45 PM, Alvaro Herrera <alvherre@2ndquadrant.com> wrote:
> I don't think a patch must necessarily consider all possible uses that
> the new feature may have.  If we introduce parallel index creation,
> that's great; if pg_restore doesn't start using it right away, that's
> okay.  You, or somebody else, can still patch it later.  The patch is
> still a step forward.

While I agree, right now pg_restore will tend to use or not use
parallelism for CREATE INDEX more or less by accident, based on
whether or not pg_class.reltuples has already been set by something
else (e.g., an earlier CREATE INDEX against the same table in the
restoration). That seems unacceptable. I haven't just suppressed the
use of parallel CREATE INDEX within pg_restore because that would be
taking a position on something I have a hard time defending any
particular position on. And so, I am slightly concerned about the
entire ecosystem of tools that could implicitly use parallel CREATE
INDEX, with undesirable consequences. Especially pg_restore.

It's not so much a hard question as it is an awkward one. I want to
handle any possible objection about there being future compatibility
issues with going one way or the other ("This paints us into a corner
with..."). And, there is no existing, simple way for pg_restore and
other tools to disable the use of parallelism due to the cost model
automatically kicking in, while still allowing the proposed new index
storage parameter ("parallel_workers") to force the use of
parallelism, which seems like something that should happen. (I might
have to add a new GUC like "enable_maintenance_paralleism", since
"max_parallel_workers_maintenance = 0" disables parallelism no matter
how it might be invoked).

In general, I have a positive outlook on this patch, since it appears
to compete well with similar implementations in other systems
scalability-wise. It does what it's supposed to do.

-- 
Peter Geoghegan

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Alvaro Herrera
Дата: 04 декабря 2016 г., 04:45:54
Сообщение: Re: Parallel tuplesort (for parallel B-Tree index creation)

Следующее

От: Tomas Vondra
Дата: 04 декабря 2016 г., 06:23:56
Сообщение: Re: Parallel tuplesort (for parallel B-Tree index creation)

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Parallel tuplesort (for parallel B-Tree index creation)

Предыдущее

Следующее