Re: Replication Cluster Monitoring

Поиск

Список

Период

Сортировка

От	Kevin Grittner
Тема	Re: Replication Cluster Monitoring
Дата	7 августа 2015 г. 19:12:38
Msg-id	638490037.514417.1438963938801.JavaMail.yahoo@mail.yahoo.com обсуждение исходный текст
Ответ на	Replication Cluster Monitoring (HEMPLEMAN Matthew <matthew.hempleman@alstom.com>)
Список	pgsql-admin

Дерево обсуждения

HEMPLEMAN Matthew <matthew.hempleman@alstom.com> wrote:

> I’m writing a Java application to monitor a streaming
> replication cluster (Windows).  I want to monitor the Master and
> initiate failover if necessary (something like a scaled down
> version of pgpool).  I also want to monitor the standby and
> terminate synchronous replication in the event of a failure.  At
> this point, my app is polling the Master every N seconds and
> triggering a failover if the wait is too long or it receives a
> connection error.  I’m worried that this method of assessing
> server health could lead to false-failovers.  Any suggestions as
> to specific health checks I could run or issues I should watch
> out for?

Such an approach has many race conditions that can cause problems.
You may want to do web searches on the terms "split-brain
syndrome", STONITH, fencing, and heartbeat (as they apply to
computing).  It is not trivial to get this right, and if it's not
right it can easily cause more down time than it prevents.  (That's
not unique to PostgreSQL; it's the nature of automating fail-over.)

Be sure to consider what happens for transient network failures on
each machine and combination of machines, or if a machine
temporarily has a load that causes it not to respond for seconds or
minutes.

--
Kevin Grittner
EDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

В списке pgsql-admin по дате отправления:

Предыдущее

От: Alex Ignatov
Дата: 07 августа 2015 г., 18:25:42
Сообщение: Re: Replication Cluster Monitoring

Следующее

От: Jamie Strachan
Дата: 10 августа 2015 г., 23:17:20
Сообщение: History File woes

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Replication Cluster Monitoring

Предыдущее

Следующее