[7.904][BUG][CLOSED] Unclean fail-over between nodes

Hi,

Probably linked with the other reported proxy issues with the cluster but I've found the following:

Fail node 2 (slave)
Told node to reboot via the management/high availability.
Graceful fail, no impact.  HTTP/SMTP traffic continued to be processed successfully on node 1.  Reconnected to master, synced and then started to process traffic.

Fail node 1 (master)  > node 2 (slave, promoted to master)
Node 2 promoted to master, webadmin fails (I've always seen this so I believe this is expected behaviour), log in again and see Node 2 as master.
Graceful failover: connections (VoIP, Cisco IPSec router) move across OK.  Node 1 reboots, comes back online as the slave, sync's, then starts processing traffic.

continues to

Astaro automated master promotion - transfer back from node 2 > 1 (1=preferred master)
Ungraceful fail, connections do not move across OK.
Had to manually shut down one of the WAN links (Eth1, Cable DHCP) before it would become usable even though it showed Up/Up status.  In/out both reported 0/0 kbit.

Parents

0 da_merlin over 16 years ago

Cant reproduce here with an DHCP interface.
Can you please try to reproduce? If it happens again:

* Please see if there is a dhcp process running: "ps ax|grep -i dhcp"
* Default GW of eth1 is set: "ip route show default dev eth1 table all|grep ^default"
* Default GW of eth1 is pingable.
* ARP resolution of Default GW is working on eth1: "arp -a -n -i eth1"
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel

Reply

0 da_merlin over 16 years ago

Cant reproduce here with an DHCP interface.
Can you please try to reproduce? If it happens again:

* Please see if there is a dhcp process running: "ps ax|grep -i dhcp"
* Default GW of eth1 is set: "ip route show default dev eth1 table all|grep ^default"
* Default GW of eth1 is pingable.
* ARP resolution of Default GW is working on eth1: "arp -a -n -i eth1"
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel

Children

No Data