[Date Prev][Date Next] [Chronological] [Thread] [Top]

No replication after power failure

I am not sure this would be considered a bug, but it is a problem for
us. If the master goes down, the replicas have no way of detecting it.
When the master is going back up again, all replica servers have to be
restarted. Is there a way to avoid this?

Using the KEEPALIVE option (socket or TCP) is not really an option since
the default timeout is 2 hours which is too long.

Another would be to have some kind of timeout in the epoll and check if
the master is responding, but that timeout is used for the runqueue?

Have you come across this? I was surprised to see that no one has had
any issues with it. Am I missing something?


/Stelios Grigoriadis