Re: Tracing down buildfarm "postmaster does not shut down" failures

Поиск
Список
Период
Сортировка
От Tom Lane
Тема Re: Tracing down buildfarm "postmaster does not shut down" failures
Дата
Msg-id 4948.1455073337@sss.pgh.pa.us
обсуждение исходный текст
Ответ на Re: Tracing down buildfarm "postmaster does not shut down" failures  (Andrew Dunstan <andrew@dunslane.net>)
Ответы Re: Tracing down buildfarm "postmaster does not shut down" failures  (Noah Misch <noah@leadboat.com>)
Список pgsql-hackers
Andrew Dunstan <andrew@dunslane.net> writes:
> anyway, we got a failure pretty quickly:
>     pg_ctl: server does not shut down at 2016-02-09 21:10:11.914 EST
> ...
> LOG:  received fast shutdown request at 2016-02-09 21:09:11.824 EST
> ...
> LOG:  checkpointer dead at 2016-02-09 21:09:14.683 EST
> LOG:  all children dead at 2016-02-09 21:10:11.184 EST
> ...
> LOG:  lock files all released at 2016-02-09 21:10:11.211 EST

Hmm.  Apparently, pg_ctl gave up exactly one second too early.
The way the wait loop in pg_ctl is coded, it waits one second more
after the last get_pgpid() probe before complaining --- so the last
time it looked for the pidfile was approximately 21:10:10.914, just
300ms before the postmaster removed it.  I wonder if that's entirely
coincidence.

Still, it seems clear that the bulk of the shutdown time is indeed the
stats collector taking its time about shutting down, which is doubly
weird because the ecpg tests shouldn't have created very many tables,
so why would there be a lot of data to write?  Even granting that it's
not writing to ramdisk, 57 seconds to shut down seems pretty excessive.

I wonder if it's worth sticking some instrumentation into stats
collector shutdown?
        regards, tom lane



В списке pgsql-hackers по дате отправления:

Предыдущее
От: Jim Nasby
Дата:
Сообщение: Re: Tracing down buildfarm "postmaster does not shut down" failures
Следующее
От: Noah Misch
Дата:
Сообщение: Re: Tracing down buildfarm "postmaster does not shut down" failures