This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Temporary TLS handshake timeouts

We're experiencing temporary TLS handshake timeouts resulting in websites not loading. For example if we want to got to google.com we sometimes see that the browser is trying to do the handshake and after the specified timeout the webpage does not load. Most times it needs 3 or 4 reloads before the page gets loaded. It will work then for a certain time before it happens again. This can be seen with several websites. No specific time, no specific websites, just TLS handshake suddenly failing for some time.

We already had this issue with 17.0.5 and it did not change with 17.1.1.

A few days ago I checked MTU and MSS and changed it from 1500 to 1492 according to our WAN connections. I thought it got better but now it seems that the issue still persists.

Has anybody seen this before?



This thread was automatically locked due to age.
Parents
  • Hi Jelle,

    I think this is about the same subject. I thought it was the imap proxy, but do suffer occasionally on some sites.

    https://community.sophos.com/products/xg-firewall/f/email-protection/105385/there-is-a-bug-in-the-email-imap-proxy

     

    Ian

  • This morning the issue became a major one as a lot of requests were failing at least the first time. So I rebooted the primary appliance and the auxiliary appliance took over. Guess what? The issue is not happening on the auxiliary (now primary) device although it is up since the installation of SFOS 17.1.1 like the other appliance.

    So why do the two device behave different although their uptime was the same? Could be an hardware issue or what is more likely that something is slowing down the appliance over time when it is primary in A-P mode.

    I'm going to check this during the next days. Maybe the issue can be seen on the other appliance too after it has been primary for some time.

  • Hi,

    were both boxes built using the same version of software? Have they both been upgraded in synch?

    Ian

  • Yes, both boxes were built with the same version and have been in HA cluster since then. Updates were always done in HA mode / in synch.

  • This issue seems to be kinda odd. 

    Can you reproduce it by going back to the "faulty" appliance? 

    And the HA sync only information between each other. So basically a module "can" get broken on one appliance and work on the other. 

    But do you see the same issue, if you only use the firewall? So without any protection? 

    If so - would take a deeper look at the Interface which this appliance use. Could be some kind of issue between ISP router and the other appliance WAN. Saw such cases quite often. 

  • I will first check if the issue comes up on the currently active appliance after some time. If not I will investigate further on the faulty device.

  • This morning we had the same issue with the former auxiliary device now being primary. So after around 12 days it happened again that connecting to the internet failed almost at every attempt. During the last days we already had problems with TLS handshakes again. Switched to the other HA-device by rebooting the primary appliance. Now everything works fine, at least for the next couple of days.

    I'm at MR1 but updating to MR3 seems to be no option as bugs from MR2 still aren't fixed.

Reply
  • This morning we had the same issue with the former auxiliary device now being primary. So after around 12 days it happened again that connecting to the internet failed almost at every attempt. During the last days we already had problems with TLS handshakes again. Switched to the other HA-device by rebooting the primary appliance. Now everything works fine, at least for the next couple of days.

    I'm at MR1 but updating to MR3 seems to be no option as bugs from MR2 still aren't fixed.

Children