This discussion has been locked.

You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Swap goes over the top possibly bad Up-2-Date

Hello,

yesterday all the gateways I'm administering started to show the very same problem. Approximately starting at 18:00 CET the memory usage started to go up and swap usage started to grow until it reached 100% and the gateway just died...

Anyone else seeing this? All gateways I have administrative access to are having the same problem (about 10 boxes, different configs and license states).

Something stinks here and I really need this resolved ASAP because it brings down even HA configs... [:(]

This thread was automatically locked due to age.

Parents

0 MBirdCZ over 17 years ago

Small update - it seems only versions 7.201 and above are affected. Gateways running 7.104 have no problem so far (at least those I have access to)
Cancel
Vote Up 0 Vote Down

Cancel
0 dpritt over 17 years ago in reply to MBirdCZ

Lost one firewall this morning already.

Primary firewall now on 99%, and I am trying to get a period to shut it down, as this has gone in less than 8 hours since 03:00 BST this morning, we need a fix urgently.

Backup is a ASG220 and has little free memory at the best of times.

If this shuts the business down it will cost a lot of money, and I will be looking for new firewalls tomorrow.

Get it sorted now please.

Dave P
Cancel
Vote Up 0 Vote Down

Cancel

Reply

0 dpritt over 17 years ago in reply to MBirdCZ

Lost one firewall this morning already.

Primary firewall now on 99%, and I am trying to get a period to shut it down, as this has gone in less than 8 hours since 03:00 BST this morning, we need a fix urgently.

Backup is a ASG220 and has little free memory at the best of times.

If this shuts the business down it will cost a lot of money, and I will be looking for new firewalls tomorrow.

Get it sorted now please.

Dave P
Cancel
Vote Up 0 Vote Down

Cancel

Children

0 mhofer over 17 years ago in reply to dpritt

Hi,

you don't have to shutdown the firewall. Connect via SSH to the console.
Go to /etc/mdw/scripts and then "./ctasd restart".

This should resolve your problem.
Cancel
Vote Up 0 Vote Down

Cancel
0 MBirdCZ over 17 years ago in reply to mhofer

Thanks,

at most boxes this frees the swap and memory however, at some boxes the degenerative status of the issue led the gateway to another type of problem which relates to the reporting/logging and SQL.

The syslog now gets filled by the following message at tremendous rate:

2008:07:31-13:04:01 (none) ulogd[6654]: sql1: transaction: cannot rollback - no transaction is active

Regards,
Zdenek
Cancel
Vote Up 0 Vote Down

Cancel
0 dpritt over 17 years ago in reply to MBirdCZ

The problem on one of my firewalls was I could not even logon, the console was timing out. I had to hit the big switch.

I have managed to get on one of the other firewalls and issued the command, but I have been monitoring the memory and it's being used up, at this rate it will be full again in about 6-8 hours.

I have tried to call support - nobody free, left voicemail on support line, but still waiting for anyone to contact me.

Dave
Cancel
Vote Up 0 Vote Down

Cancel
0 number2jcb over 17 years ago in reply to dpritt

Add me to the crowd, I came here to post to see what ctasd did. Glad to see I am not alone. Mine started spiralling out of control at 2pm EST and by 7PM EST it died. I'll be watching this thread religously for the solution since support is just about worthless.
Cancel
Vote Up 0 Vote Down

Cancel
0 tom_01 over 17 years ago in reply to number2jcb

Just FYI: we are aware of the issue and are working with our vendor (Commtouch) on a solution. I'll post here again when we have more information.
Cancel
Vote Up 0 Vote Down

Cancel
0 mrainey over 17 years ago in reply to mhofer

yep, restarting ctasd has an immediate impact on swap usage. However, the correct path is /var/mdw/scripts
Cancel
Vote Up 0 Vote Down

Cancel
0 BrucekConvergent over 17 years ago in reply to mrainey

It appears that the issue is fixed... the memory usage at 2 customer sites that were having a problem has stayed below 10% for the past 20 minutes or so... it was growing to almost 50% within an hour just a little while ago.

CTO, Convergent Information Security Solutions, LLC

https://www.convergesecurity.com

Advice given as posted on this forum does not construe a support relationship or other relationship with Convergent Information Security Solutions, LLC or its subsidiaries. Use the advice given at your own risk.
Cancel
Vote Up 0 Vote Down

Cancel
0 Jack Daniel over 17 years ago in reply to BrucekConvergent

Yes, the offending servers are now offline. You should see memory level off, if you restart ctasd now the memory use should be normal (and stay there).
Cancel
Vote Up 0 Vote Down

Cancel
0 Jack Daniel over 17 years ago in reply to Jack Daniel

If you are still experiencing problems, please restart ctasd as mentioned previously- the issue has been resolved and should not return.

Here is an excerpt from the vendor's statement on the issue from yesterday:

--------------------------------------------------------------------
As of 12:30 PM Pacific time Commtouch Detection Centers are back to
their full capacity.

Initial investigation of the issue, showed that the problem started at
4:30 AM Pacific time. It was a result of a fault in the Proactive
Pattern system that caused excessive downloads/updates, by some of the
clients. This caused additional load on the Data Centers. In turn some
of the clients experienced intermittency while communicating with the
Commtouch Data Centers.

The fault in the Proactive Pattern update system was fixed.

During the upcoming week we will look into adding additional measures in
order to prevent such issues in the future.
We are sorry for any inconvenience,
-----------------------------------------------------
Cancel
Vote Up 0 Vote Down

Cancel
0 MBirdCZ over 17 years ago in reply to Jack Daniel

Well it seems there is still some problem with the ctasd or Comtouch servers because the memory allocation on my box is slowly going up and over the top to swap again...

Maybe it would be better to let the middleware to restart the ctasd daemon on a regular basis in order to prevent excessive memory/swap use... ?

I just restarted the daemon in question manually from the shell and it freed some 300M of memory.

Zdenek
- memory.JPG
- View
- Hide
Cancel
Vote Up 0 Vote Down

Cancel