[8.900][CLOSED] UTM Beta 9.0 falls over after 1day

I got everything installed and working on the beta 9.0 and then today none of my boxes can touch the net. My iphone can though.....

0 RFCat_vk_01 over 14 years ago

Please advise about your configuration.

Do you the have proxy enabled?
Do you have NAT (MASQ) enabled
Do you have any Network protection rules?

Ian
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 Verditious over 14 years ago

Config:

Esxi running astaro as a host, two NICs ( physical and virtual ) one external grabbing comcast ip as it should, the other internal to push data to network. Internal nic plugged into switch which feeds two hosts with ethernet and a line coming out of the switch which feeds a wifi router. Local DNS and DHCP server resident on DC, also a ESXi host. Astaro configured with one masquerading rule to forward internal network to external.

Heres the fun part..... LOL!!!

Suddenly everything is working again. But I cant seem to get ICMP through the firewall. Do I need to specifically set a rule to allow my hosts to ping google.com? I was able to ping before....

To mysteriously go down then back up is weird. The only troubleshooting i did was to blow out the masq. rule and put it back.

Thanks!!!
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 thefuzz4 over 14 years ago in reply to Verditious

For my home use, the first rule in my firewall is to allow all internal traffic out. I am able to ping just fine. Perhaps try creating that rule and see what happens after you do that?
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 StephenWeber over 14 years ago

Did you restore from a 8.301 backup or earlier? I noticed that I nothing but problems when I restored my config. Once I restore to Factory Defaults and then just recreated everything I ran stable.
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 thefuzz4 over 14 years ago in reply to thefuzz4

oh and also I'm running in the same setup as you
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 thefuzz4 over 14 years ago in reply to StephenWeber

Did you restore from a 8.301 backup or earlier? I noticed that I nothing but problems when I restored my config. Once I restore to Factory Defaults and then just recreated everything I ran stable.

I did my restore from my 8.301 no issues so far. I'm hoping that it stays that way as I would hate to have to recreate all from scratch lol.
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 thefuzz4 over 14 years ago in reply to thefuzz4

Of course after posting this for some reason I just had my first fail over to the slave. I will need to do some research to find out why my master just took a 5 min dump. It didn't reboot or anything so I'm not sure why the slave decided that it needed to kick in. But I sure do love having HA at home lol. At least here on these forums people don't question why you do things at home lol
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 Billybob over 14 years ago

High availability at home is pointless. Specially if you are already virtualizing. This beta has been fairly stable and not saying kernel panics are impossible but probably points to a misconfiguration in your esxi. I personally have never seen any of my systems just "laying down" on ANY version or beta I have ever used physical or virtual.
A daemon misbehaving.... sure. Linux kernel laying down, not likely without faulty hardware.
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 thefuzz4 over 14 years ago in reply to Billybob

Yeah the machine didn't reboot or anything the slave just decided that it wanted to be master for about 5 mins then everything went back to normal. I know that the HA at home is over kill but it is nice when I do my up2dates so my internet stays up. Its also nice when I install updates on the ESXi host since you need to put all of the guests into suspend while the updates install.
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel

0 thefuzz4 over 14 years ago in reply to Billybob

So going through the HA Log looks like the master just didn't send a heartbeat for bit


2012:03:05-14:03:29 phoenix-2 ha_daemon[5358]: id="38C1" severity="info" sys="System" sub="ha" name="Node 1 is dead, received no heart beats!"

2012:03:05-14:03:29 phoenix-2 ha_daemon[5358]: id="38B5" severity="info" sys="System" sub="ha" name="Master is dead, taking over!"

2012:03:05-14:03:29 phoenix-2 ha_daemon[5358]: id="38B0" severity="info" sys="System" sub="ha" name="Switching to Master mode"

2012:03:05-14:04:18 phoenix-1 ha_daemon[12972]: id="38A1" severity="warn" sys="System" sub="ha" name="Another master around!"

2012:03:05-14:04:18 phoenix-1 ha_daemon[12972]: id="38A0" severity="info" sys="System" sub="ha" name="Node 2 changed mode: SLAVE -> MASTER"

2012:03:05-14:04:18 phoenix-1 ha_daemon[12972]: id="38A1" severity="warn" sys="System" sub="ha" name="Another master around!"

2012:03:05-14:04:18 phoenix-1 ha_daemon[12972]: id="38A1" severity="warn" sys="System" sub="ha" name="Another master around!"

2012:03:05-14:04:18 phoenix-1 ha_daemon[12972]: id="38A1" severity="warn" sys="System" sub="ha" name="Another master around!"

2012:03:05-14:04:18 phoenix-1 ha_daemon[12972]: id="38A1" severity="warn" sys="System" sub="ha" name="Another master around!"

2012:03:05-14:03:30 phoenix-2 ha_daemon[5358]: id="38A0" severity="info" sys="System" sub="ha" name="Activating sync process for database on node 1"

Not sure why but whatever its all back to normal now.