Re: Parallel Sort

Поиск
Список
Период
Сортировка
От Noah Misch
Тема Re: Parallel Sort
Дата
Msg-id 20130515180437.GA234183@tornado.leadboat.com
обсуждение исходный текст
Ответ на Re: Parallel Sort  (Claudio Freire <klaussfreire@gmail.com>)
Ответы Re: Parallel Sort
Список pgsql-hackers
On Tue, May 14, 2013 at 12:15:24PM -0300, Claudio Freire wrote:
> You know what would be a low-hanging fruit that I've been thinking
> would benefit many of my own queries?
> 
> "Parallel" sequential scan nodes. Even if there's no real parallelism
> involved, when a query has to scan the same table at multiple nodes,
> if it's big, it would be worth parallelizing the scans to transform
> them into synchro scans.
> 
> I have absolutely no idea how this would work easily without forked
> workers, because the scans might be buried in more complex execution
> trees. But still, it's worth considering, that parallelizing may
> benefit more than core usage.
> 
> If execution nodes could be paused at arbitrary points, a "parallel
> scan" node could pause one branch that has consumed the circular
> buffer, letting another branches consume their part, and thus
> "parallelizing" branch execution. But this would be perhaps more
> complex than simply forking.

Execution nodes do pause between every output tuple, at least nominally.
Still, given the architecture of our executor and the planner work to
implement such a thing, I would not classify it as low-hanging fruit.  It
would primarily apply to a plan with independent sequential scans of the same
large (relative to total memory) relation.  I'm sure that comes up, but it
doesn't strike me as typical.

Thanks,
nm

-- 
Noah Misch
EnterpriseDB                                 http://www.enterprisedb.com



В списке pgsql-hackers по дате отправления:

Предыдущее
От: Josh Berkus
Дата:
Сообщение: Re: commit fest schedule for 9.4
Следующее
От: Andrew Dunstan
Дата:
Сообщение: Re: commit fest schedule for 9.4