Guest User!

You are not Sophos Staff.

This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Upgrading to v18 causes HA Pair to stop working and caused 3 minute Disruption

I had the same problem. I have 2 XG750's in HA and it broke HD cluster when doing the update. That caused our company a 3 minute disruption in internet access. I thought HA stands for High Availability? I was running V17.5 MR15
on both XG750's.

The Upgrade to v18 caused a 3min. disruption! Does anyone know which logs to specifically look at in the /log
I am sure there is some logging showing services status ?

All in all, we waited and were dreary of this upgrade, seeing as we read previous reports where major difficulties arise.
So far, after the upgrade to v18.0.5 MR-5-Build586., no problems.

Regards, Scott

P.S. the other guys reporting a similar problem. the fact that the HA stopped working is put mildly, or ? 3 Minute Disruption in HA is a serious matter and I don't find more specifics on how to do a better After Action Review, so at least we could gain more insight ? That would be very helpful.



This thread was automatically locked due to age.
Parents
  • The Upgrade of V17.5 to V18 will cause both appliances to do it simultaneously. 

    See: docs.sophos.com/.../index.html

    SFOS 18.0 moved to SSH tunnel-based secure communication for the HA cluster. If you're upgrading the HA cluster to 18.0 or later for the first time, both the devices in the cluster will reboot simultaneously once. You'll receive an alert on the UI before you can proceed.

    As both appliances take some time to perform the upgrade, a 2-3 Minutes downtime is expected. Especially the bigger appliances take longer time to boot (hardware checks etc.). 

    This upgrade is a one time process and the HA should now upgrade as expected. 

Reply
  • The Upgrade of V17.5 to V18 will cause both appliances to do it simultaneously. 

    See: docs.sophos.com/.../index.html

    SFOS 18.0 moved to SSH tunnel-based secure communication for the HA cluster. If you're upgrading the HA cluster to 18.0 or later for the first time, both the devices in the cluster will reboot simultaneously once. You'll receive an alert on the UI before you can proceed.

    As both appliances take some time to perform the upgrade, a 2-3 Minutes downtime is expected. Especially the bigger appliances take longer time to boot (hardware checks etc.). 

    This upgrade is a one time process and the HA should now upgrade as expected. 

Children