Re: [HACKERS] why not parallel seq scan for slow functions

Поиск

Список

Период

Сортировка

От	Amit Kapila
Тема	Re: [HACKERS] why not parallel seq scan for slow functions
Дата	12 августа 2017 г. 19:18:51
Msg-id	CAA4eK1+X89Qk8k3Q9feiOyy5rvbiMjsS55e6pDLA0zYRU+ACMg@mail.gmail.com обсуждение исходный текст
Ответ на	Re: [HACKERS] why not parallel seq scan for slow functions (Robert Haas <robertmhaas@gmail.com>)
Ответы	Re: [HACKERS] why not parallel seq scan for slow functions (Robert Haas <robertmhaas@gmail.com>) Re: [HACKERS] why not parallel seq scan for slow functions (Dilip Kumar <dilipbalaut@gmail.com>)
Список	pgsql-hackers

Дерево обсуждения

On Thu, Aug 10, 2017 at 1:07 AM, Robert Haas <robertmhaas@gmail.com> wrote:
> On Tue, Aug 8, 2017 at 3:50 AM, Amit Kapila <amit.kapila16@gmail.com> wrote:
>> Right.
>>
>> I see two ways to include the cost of the target list for parallel
>> paths before rejecting them (a) Don't reject parallel paths
>> (Gather/GatherMerge) during add_path.  This has the danger of path
>> explosion. (b)  In the case of parallel paths, somehow try to identify
>> that path has a costly target list (maybe just check if the target
>> list has anything other than vars) and use it as a heuristic to decide
>> that whether a parallel path can be retained.
>
> I think the right approach to this problem is to get the cost of the
> GatherPath correct when it's initially created.  The proposed patch
> changes the cost after-the-fact, but that (1) doesn't prevent a
> promising path from being rejected before we reach this point and (2)
> is probably unsafe, because it might confuse code that reaches the
> modified-in-place path through some other pointer (e.g. code which
> expects the RelOptInfo's paths to still be sorted by cost).  Perhaps
> the way to do that is to skip generate_gather_paths() for the toplevel
> scan/join node and do something similar later, after we know what
> target list we want.
>

I think skipping a generation of gather paths for scan node or top
level join node generated via standard_join_search seems straight
forward, but skipping for paths generated via geqo seems to be tricky
(See use of generate_gather_paths in merge_clump).  Assuming, we find
some way to skip it for top level scan/join node, I don't think that
will be sufficient, we have some special way to push target list below
Gather node in apply_projection_to_path, we need to move that part as
well in generate_gather_paths.


-- 
With Regards,
Amit Kapila.
EnterpriseDB: http://www.enterprisedb.com

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Michael Paquier
Дата: 12 августа 2017 г., 17:46:38
Сообщение: [HACKERS] Regressions failures with libxml2 on ArchLinux

Следующее

От: Tom Lane
Дата: 12 августа 2017 г., 21:31:35
Сообщение: Re: [HACKERS] pg_stat_statements query normalization, and the 'in' operator

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: [HACKERS] why not parallel seq scan for slow functions

Предыдущее

Следующее