[NOTABUG] [7.386] Setting 2 nodes as cluster actif/actif does not work (properly) ?

Hi

I've just installed 2 news asg into 2 differents VMs both with 4 interfaces and bridged at VM level...
both astaro are working fines independantly... then I go in HA to set:

- for asg1:
operation mode: cluster actif/actif
sync interface: eth0
device name: asg1
device node id: 1
encryption key: ...
repeat: ...

so asg1 is becoming MASTER

- then on asg2 (I intentionnaly test that way):
operation mode: cluster actif/actif
sync interface: eth0
device name: asg2
device node id: 2
encryption key: ... (same as on asg1)
repeat: ...

then when connecting on asg1 I get first snapshot with only one line for asg1 MASTER then on system status I get 2 nodes... I think there is something wrong with HA detection/statuses ?

when on the asg1 console I get a prompt and when on asg2 I get a prompt

maybe related to that bug from CraigD ?

[7.385] Did something change for HA, can't get it to work?

thx

Parents

0 BuBU over 17 years ago

I did a reboot of asg2 it did not helped...

then I did reboot asg1 and now I see both nodes.... so probably something wrong and not started/restarted properly ?

thx
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel

Reply

0 BuBU over 17 years ago

I did a reboot of asg2 it did not helped...

then I did reboot asg1 and now I see both nodes.... so probably something wrong and not started/restarted properly ?

thx
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel

Children

0 BuBU over 17 years ago in reply to BuBU

oh forgot something maybe important...

initial version for asg1 was 7.386 and I intentionnaly installed 7.380 on asg2 to see what happening in that case... then I activated cluster as described...

thx
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel

0 BuBU over 17 years ago in reply to BuBU

I just got UP2DATE failed on slave [:(]


2009:01:20-11:34:25 asg2-2 ha_daemon[3047]: id="38A2" severity="error" sys="System" sub="ha" name="Up2date failed!"

2009:01:20-11:34:05 asg1-1 ha_daemon[3048]: id="38A0" severity="info" sys="System" sub="ha" name="Node 2 changed state: UP2DATE -> UP2DATE-FAILED"

2009:01:20-11:34:05 asg1-1 ha_daemon[3048]: id="38A3" severity="error" sys="System" sub="ha" name="Up2date failed on Slave node 2, stopping up2date process!"

see the attached screenshot for up2date.log on asg2

thx

0 da_merlin over 17 years ago in reply to BuBU

Can you enter the command "hs" on both nodes and post the output ?

Is the ASG Master node connected to the Internet, so the Slave can download the necessary up2date packages to update to the master version ?
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 BuBU over 17 years ago in reply to da_merlin

Can you enter the command "hs" on both nodes and post the output ?

Is the ASG Master node connected to the Internet, so the Slave can download the necessary up2date packages to update to the master version ?

unfortunately I rebooted both astaro since... and messages are differents... here are the attached screenshots..

[ATTACH]1178[/ATTACH]

[ATTACH]1179[/ATTACH]
- hs_asg1.jpg
- View
- Hide
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 garnichda over 17 years ago in reply to BuBU

have you tried to disable HA then manually Update the Astaros to the same Firmware Verson (7.386) and then enable HA again if this fix the error?
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 da_merlin over 17 years ago in reply to garnichda

Ok seems there is an connectivity error. Slave can see the Master,
however Master has now knowledge of the Slave.

Slave probably hangs in the state of validating the HA secret key.

Can you try to ping the HA Master from Slave node?
ping 198.19.250.1
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 BuBU over 17 years ago in reply to garnichda

have you tried to disable HA then manually Update the Astaros to the same Firmware Verson (7.386) and then enable HA again if this fix the error?

will do.. but how can we be sure it will be done correctly when 7.400 will be final.. as upgrade is supposedly supported that way ! [:)]

thx
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 da_merlin over 17 years ago in reply to BuBU

Thought about it again and found probably the error:

VMware disables Ethernet interfaces in case of mac address conflicts.
Official ASG VMware images have for each Ethernet interface this line
in the vmware vmx configuration file:

ethernet0.ignoreMACAddressConflict = "TRUE"

Please add this line for each Ethernet interface, replace ethernet0 with ethernet1 etc...

ASG HA/Cluster adds virtual MAC addresses on each interface,
which are equal on all HA/Cluster nodes.
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 BuBU over 17 years ago in reply to da_merlin

Thought about it again and found probably the error:

VMware disables Ethernet interfaces in case of mac address conflicts.
Official ASG VMware images have for each Ethernet interface this line
in the vmware vmx configuration file:

ethernet0.ignoreMACAddressConflict = "TRUE"

Please add this line for each Ethernet interface, replace ethernet0 with ethernet1 etc...

ASG HA/Cluster adds virtual MAC addresses on each interface,
which are equal on all HA/Cluster nodes.

oups sorry this is not VMware VMs... but Citrix XenServer VMs (xenserver 5.0 express edition) which is to me far better than vmware [:)]
I think first times I tested cluster active/active with 7.200 or 7.300 (I did not remember actually...) that was working fine on a xenserver 4.1...
but I'm setting up 2 news VMs which will be exact same version 7.386 and will test with 7.305 if it's not working with 7.386 and will let you know..

thx
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 BuBU over 17 years ago in reply to BuBU

ok I've setup 2 new astaro 7.386 with cluster active/active as described earlier...

and I get asg2 as WORKER with state UNINITIALIZED and asg1 ACTIVE when runing hs on asg2.
and only asg1 as MASTER with state ACTIVE when running hs on asg1

what does mean UNINITIALIZED ? can I run anything manually to update the state ?

thx
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel