This discussion has been locked.

You can no longer post new replies to this discussion. If you have a question you can start a new discussion

"HA selfcheck" mails every hour

Hi,

since the last graceful takeover and finished syncing (what took like about 2hours) each of our two systems are sending mails with this subject: "[systemname][WARN-080] HA selfcheck" every hour.

This mail contains ha_selfcheck.txt with following text:
HA SELFMON WARN: Confd in wrong mode, switching to master mode...
and from slave node
HA SELFMON WARN: Confd in wrong mode, switching to slave mode...

Version 9.109-1

What does it mean?

Thanks
jauer

This thread was automatically locked due to age.

Parents

0 ciscoman over 12 years ago

Hi,

since the last graceful takeover and finished syncing (what took like about 2hours) each of our two systems are sending mails with this subject: "[systemname][WARN-080] HA selfcheck" every hour.

This mail contains ha_selfcheck.txt with following text:
HA SELFMON WARN: Confd in wrong mode, switching to master mode...
and from slave node
HA SELFMON WARN: Confd in wrong mode, switching to slave mode...

Version 9.109-1

What does it mean?

Thanks
jauer

Hi,
no real idea. However, I believe if a sync takes more than 20 minutes, there is something wrong.
Note sure if you have a failover setup (active-passive) I would check if the master still keeps the same or if the boxes really takeover to slave and back.
Or if the sync is really finished.
Second thing I would do is to reboot the slave(if it is active/passive) and to see if the sync now runs faster. Shouldn't heart.
Anything interesting in the /var/log/high-availability.log ?
Is real connectivity given between the HA sync interfaces?

On the command line: hs
:/root # hs
Current mode: HA MASTER with id 1 in state ACTIVE
-- Nodes -----------------------------------------------------------------------
MASTER: 1 ASG01 198.19.250.1 8.311 ACTIVE since Thu Dec 12 13:26:02 2013
SLAVE: 2 ASG02 198.19.250.2 8.311 ACTIVE since Thu Dec 12 13:46:40 2013
-- Load ------------------------------------------------------------------------
Node  1: [1m] 0.00  [5m] 0.01  [15m] 0.00
Node  2: [1m] 0.00  [5m] 0.01  [15m] 0.00

cheers,
Cancel
Vote Up 0 Vote Down

Cancel

Reply

0 ciscoman over 12 years ago

Hi,

since the last graceful takeover and finished syncing (what took like about 2hours) each of our two systems are sending mails with this subject: "[systemname][WARN-080] HA selfcheck" every hour.

This mail contains ha_selfcheck.txt with following text:
HA SELFMON WARN: Confd in wrong mode, switching to master mode...
and from slave node
HA SELFMON WARN: Confd in wrong mode, switching to slave mode...

Version 9.109-1

What does it mean?

Thanks
jauer

Hi,
no real idea. However, I believe if a sync takes more than 20 minutes, there is something wrong.
Note sure if you have a failover setup (active-passive) I would check if the master still keeps the same or if the boxes really takeover to slave and back.
Or if the sync is really finished.
Second thing I would do is to reboot the slave(if it is active/passive) and to see if the sync now runs faster. Shouldn't heart.
Anything interesting in the /var/log/high-availability.log ?
Is real connectivity given between the HA sync interfaces?

On the command line: hs
:/root # hs
Current mode: HA MASTER with id 1 in state ACTIVE
-- Nodes -----------------------------------------------------------------------
MASTER: 1 ASG01 198.19.250.1 8.311 ACTIVE since Thu Dec 12 13:26:02 2013
SLAVE: 2 ASG02 198.19.250.2 8.311 ACTIVE since Thu Dec 12 13:46:40 2013
-- Load ------------------------------------------------------------------------
Node  1: [1m] 0.00  [5m] 0.01  [15m] 0.00
Node  2: [1m] 0.00  [5m] 0.01  [15m] 0.00

cheers,
Cancel
Vote Up 0 Vote Down

Cancel

Children

No Data