Re: BUG #5452: Server core dumps coming out of recovery mode

Поиск
Список
Период
Сортировка
От Heikki Linnakangas
Тема Re: BUG #5452: Server core dumps coming out of recovery mode
Дата
Msg-id 4BE3D0B8.8040205@enterprisedb.com
обсуждение исходный текст
Ответ на BUG #5452: Server core dumps coming out of recovery mode  ("Chris Copeland" <chris@cope360.com>)
Ответы Re: BUG #5452: Server core dumps coming out of recovery mode  (Chris Copeland <chris@cope360.com>)
Список pgsql-bugs
Chris Copeland wrote:
> I have two servers with the same hardware, OS, and pg binaries.  Log files
> are copied from the master to the standby and the standby is run in recovery
> mode.
>
> When the standby is triggered to come out of recovery mode, it fails and
> generates a core dump.  Upon trying to start it after failure, it starts
> looking for WAL files that it has already recovered.
>...
> 2010-05-06 10:57:30 CDT :LOG:  restored log file "00000001000000AF00000059"
> from archive
>>> >> Now I trigger the restore command to return 1 to stop the recovery
> 2010-05-06 10:59:30 CDT :LOG:  could not open file
> "pg_xlog/00000001000000AF0000005A" (log file 175, segment 90): No such file
> or directory
> 2010-05-06 10:59:30 CDT :LOG:  redo done at AF/59000068
> 2010-05-06 10:59:30 CDT :PANIC:  could not open file
> "pg_xlog/00000001000000AF00000059" (log file 175, segment 89): No such file
> or directory

At startup, the server needs to re-fetch the last checkpoint record.
That means calling restore_command again for a file that was already
restored. It looks like restore_command is failing at the re-fetch,
which causes the PANIC.

To trigger failover, restore_command needs to return 1, once, but it
must return 0 again on any subsequent calls. I suspect your
restore_command keeps returning 1 on the subsequent calls.

--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

В списке pgsql-bugs по дате отправления:

Предыдущее
От: Pavel Golub
Дата:
Сообщение: Re: BUG #5451: pg_restore doesn't close input .tar archive
Следующее
От: goutham jalakam
Дата:
Сообщение: Re: BUG #5450: system lacked sufficient buffer space, or because a queue was full, could not perform the operation