Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

Поиск

Список

Период

Сортировка

От	Gavin Flower
Тема	Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]
Дата	26 сентября 2014 г. 21:39:58
Msg-id	5425B2A1.7040601@archidevsys.co.nz обсуждение исходный текст
Ответ на	Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ] (Alvaro Herrera <alvherre@2ndquadrant.com>)
Ответы	Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ] (Alvaro Herrera <alvherre@2ndquadrant.com>) Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ] (Gregory Smith <gregsmithpgsql@gmail.com>)
Список	pgsql-hackers

Дерево обсуждения

On 27/09/14 01:36, Alvaro Herrera wrote:
> Amit Kapila wrote:
>
>> Today while again thinking about the startegy used in patch to
>> parallelize the operation (vacuum database), I think we can
>> improve the same for cases when number of connections are
>> lesser than number of tables in database (which I presume
>> will normally be the case).  Currently we are sending command
>> to vacuum one table per connection, how about sending multiple
>> commands (example Vacuum t1; Vacuum t2) on one connection.
>> It seems to me there is extra roundtrip for cases when there
>> are many small tables in database and few large tables.  Do
>> you think we should optimize for any such cases?
> I don't think this is a good idea; at least not in a first cut of this
> patch.  It's easy to imagine that a table you initially think is small
> enough turns out to have grown much larger since last analyze.  In that
> case, putting one worker to process that one together with some other
> table could end up being bad for parallelism, if later it turns out that
> some other worker has no table to process.  (Table t2 in your example
> could grown between the time the command is sent and t1 is vacuumed.)
>
> It's simpler to have workers do one thing at a time only.
>
> I don't think it's a very good idea to call pg_relation_size() on every
> table in the database from vacuumdb.
>
Curious: would it be both feasible and useful to have multiple workers 
process a 'large' table, without complicating things too much?  The 
could each start at a different position in the file.


Cheers,
Gavin

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Stephen Frost
Дата: 26 сентября 2014 г., 21:39:20
Сообщение: Re: proposal: rounding up time value less than its unit.

Следующее

От: Alvaro Herrera
Дата: 26 сентября 2014 г., 21:48:09
Сообщение: Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

Предыдущее

Следующее