Обсуждение: Table maintenance: order of operations important?

Поиск
Список
Период
Сортировка

Table maintenance: order of operations important?

От
Jeff Boes
Дата:
We have a number of tables from which we expire (delete) rows nightly.
Is there any value to ordering the operations, in terms of either table
physical organization or elapsed time?

E.g.,

   DELETE FROM foo WHERE date_expires < now();
   VACUUM ANALYZE foo;
   CLUSTER foo;
   REINDEX TABLE foo;

How would you choose to order these (under 7.4.x) for fastest
turn-around? Does it matter?


--
It may not always be easy, convenient, or politically correct to stand
for truth and right, but it is the right thing to do. Always.
________                                          --M. Russell Ballard
Jeffery Boes <>< mur@qtm.net

Re: Table maintenance: order of operations important?

От
Tom Lane
Дата:
Jeff Boes <mur@qtm.net> writes:
>    DELETE FROM foo WHERE date_expires < now();
>    VACUUM ANALYZE foo;
>    CLUSTER foo;
>    REINDEX TABLE foo;

> How would you choose to order these (under 7.4.x) for fastest
> turn-around? Does it matter?

If you are going to CLUSTER then the VACUUM and the REINDEX are both
utterly redundant.  The ANALYZE is still useful but should be done after
CLUSTER since its physical-order-correlation stats will be quite wrong
if done beforehand.  In other words there is only one sane way to do
this and it is

    DELETE FROM foo WHERE date_expires < now();
    CLUSTER foo;
    ANALYZE foo;

You could possibly make a case for

    DELETE FROM foo WHERE date_expires < now();
    CLUSTER foo;
    VACUUM ANALYZE foo;

The VACUUM won't do anything useful in terms of reclaiming space (there
being none to reclaim just after a CLUSTER) but it would ensure that all
rows in the table are marked as committed-good, rather than leaving that
work to be done by the first transaction that happens to hit each row.

            regards, tom lane

Re: Table maintenance: order of operations important?

От
Gaetano Mendola
Дата:
Tom Lane wrote:

> Jeff Boes <mur@qtm.net> writes:
>
>>   DELETE FROM foo WHERE date_expires < now();
>>   VACUUM ANALYZE foo;
>>   CLUSTER foo;
>>   REINDEX TABLE foo;
>
>
>>How would you choose to order these (under 7.4.x) for fastest
>>turn-around? Does it matter?
>
>
> If you are going to CLUSTER then the VACUUM and the REINDEX are both
> utterly redundant.

Without cluster with 7.4.2 a REINDEX is redundant after a VACUUM FULL ?



Regards
Gaetano Mendola