I tested a 2 node cluster with two UTM 320. The HA conntrack daemon is restarting permanently at the MASTER Node:
2013:11:01-09:27:36 utm91-1 selfmonng[4206]: I check Failed increment conntrackd_running counter 1 - 3
2013:11:01-09:27:41 utm91-1 selfmonng[4206]: I check Failed increment conntrackd_running counter 2 - 3
2013:11:01-09:27:46 utm91-1 selfmonng[4206]: W check Failed increment conntrackd_running counter 3 - 3
2013:11:01-09:27:46 utm91-1 selfmonng[4206]: W NOTIFYEVENT Name=conntrackd_running Level=INFO Id=172 suppressed
2013:11:01-09:27:46 utm91-1 selfmonng[4206]: W triggerAction: 'cmd'
2013:11:01-09:27:46 utm91-1 selfmonng[4206]: W actionCmd(-): '/var/mdw/scripts/conntrackd restart'
2013:11:01-09:27:51 utm91-1 selfmonng[4206]: I check Failed increment conntrackd_running counter 1 - 3
2013:11:01-09:27:56 utm91-1 selfmonng[4206]: I check Failed increment conntrackd_running counter 2 - 3
2013:11:01-09:28:01 utm91-1 selfmonng[4206]: W check Failed increment conntrackd_running counter 3 - 3
2013:11:01-09:28:01 utm91-1 selfmonng[4206]: W NOTIFYEVENT Name=conntrackd_running Level=INFO Id=172 suppressed
2013:11:01-09:28:01 utm91-1 selfmonng[4206]: W triggerAction: 'cmd'
2013:11:01-09:28:01 utm91-1 selfmonng[4206]: W actionCmd(+): '/var/mdw/scripts/conntrackd restart'
2013:11:01-09:28:01 utm91-1 selfmonng[4206]: W child returned status: exit='0' signal='0'
Manual restart try:
# /var/mdw/scripts/conntrackd restart
Starting Conntrack Sync Daemon lockfile `/var/lock/conntrack.lock' exists, perhaps conntrackd already running?
I deleted the file /var/lock/conntrack.lock. It didn't solve the issue. The file comes back but the daemon doesn't start at all.
I updated a running cluster from 9.106 to 9.165.
Regards,
Marco