This discussion has been locked.

You can no longer post new replies to this discussion. If you have a question you can start a new discussion

SplitBrain with active/passive HA under ASG120 since upgrade to UTM9

Hello,

I can reproduce SplitBrain HA after upgrade to UTM9. Seems, that HA is no longer reliable (at least under ASG120)?!?

I've fresh started with a "new" HA.
MASTER (was alone before) meets factory reseted "SLAVE"; 1-2 hours after sync and with ACTIVE HA nodes, the HA breaks and afterwards both nodes think they should be master!

Can someone give me a hint what to do about or is this a bug in latest UTM9?

I've attached the HA-Logs.

Regards,
rubber

This thread was automatically locked due to age.

Parents

0 AngeloC over 13 years ago

Hi Rubber,

Are you also using the local LAN interfaces for a heartbeat safety check? This can rule out if the problem is just with your HA interface cable/nic, as it will also use the LAN interface for heartbeat as a double-check.
Cancel
Vote Up 0 Vote Down

Cancel
0 rubber over 13 years ago in reply to AngeloC

Hi AngeloC,

Are you also using the local LAN interfaces for a heartbeat safety check? This can rule out if the problem is just with your HA interface cable/nic, as it will also use the LAN interface for heartbeat as a double-check.

No, I am only using the "HA-Interface"s with "crossover" link cable. Until v8.305 no need for any backup ha interface.
Cancel
Vote Up 0 Vote Down

Cancel

Reply

0 rubber over 13 years ago in reply to AngeloC

Hi AngeloC,

Are you also using the local LAN interfaces for a heartbeat safety check? This can rule out if the problem is just with your HA interface cable/nic, as it will also use the LAN interface for heartbeat as a double-check.

No, I am only using the "HA-Interface"s with "crossover" link cable. Until v8.305 no need for any backup ha interface.
Cancel
Vote Up 0 Vote Down

Cancel

Children

0 rubber over 13 years ago in reply to rubber

Now with only one ASG120 master (already more than once rebooted) ONE and ONLY ONE of seven RED's is no longer able to connect:

2012:09:21-08:16:11 abkasg-2 red_server[13269]: A******XX4BC3: connected OK, pushing config
2012:09:21-08:16:19 abkasg-2 red_server[13269]: A******XX4BC3: command 'PING 1'
2012:09:21-08:16:19 abkasg-2 red_server[13269]: A******XX4BC3: PING remote_tx=1 local_rx=0 diff=1
2012:09:21-08:16:19 abkasg-2 red_server[13269]: A******XX4BC3: PONG local_tx=8
2012:09:21-08:16:35 abkasg-2 red_server[13269]: A******XX4BC3: command 'PING 8'
2012:09:21-08:16:35 abkasg-2 red_server[13269]: A******XX4BC3: PING remote_tx=8 local_rx=0 diff=8
2012:09:21-08:16:35 abkasg-2 red_server[13269]: A******XX4BC3: PONG local_tx=25
2012:09:21-08:16:51 abkasg-2 red_server[13269]: A******XX4BC3: command 'PING 13'
2012:09:21-08:16:51 abkasg-2 red_server[13269]: A******XX4BC3: PING remote_tx=13 local_rx=0 diff=13
2012:09:21-08:16:51 abkasg-2 red_server[13269]: A******XX4BC3: PONG local_tx=42
2012:09:21-08:17:12 abkasg-2 red_server[13269]: A******XX4BC3: No in-tunnel frame for 60 seconds, exiting.
2012:09:21-08:17:12 abkasg-2 red_server[4595]: A******XX4BC3: disconnecting

Prior UTM9 also no problem with these. Should I return to ASG8 and wait some releases until UTM9 is clean?
Cancel
Vote Up 0 Vote Down

Cancel
0 rubber over 13 years ago in reply to rubber

Now with only one ASG120 master (already more than once rebooted) ONE and ONLY ONE of seven RED's is no longer able to connect

Forget about the issue with the RED; I've downgraded to ASG8, still the same issue with this one RED. Seems to be a local issue?!...
Cancel
Vote Up 0 Vote Down

Cancel