Hi All, I'm running ASG 7.502 - two servers in active/active cluster mode, master/slave config . The two servers share the workload between them and all appears to operate correctly. My observations however: In a scenario where the slave node is promoted to master due to the master being temporarily unavailable (failure etc.): The slave is correctly steps in and becomes the master without interruption to the services, exactly as it should. I can access reports, logs etc. perfectly from the promoted server for all historical data etc. The promoted server however contains no quarantined email and therefore end users cannot release incorrectly quarantined emails until the original master is restored to service. Any quarantined email that the promoted slave>master node then captures during its promotion is "lost" once the original master is restored. Is this expected behaviour or do I have a sync issue/bug? HA logs are looking good with no errors being reported. Many thanks for any thoughts or insights! Cheers, Darren

This discussion has been locked.

You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Active/Active cluster - expected behaviour for quarantined email?

Hi All,

I'm running ASG 7.502 - two servers in active/active cluster mode, master/slave config.
The two servers share the workload between them and all appears to operate correctly.

My observations however:
In a scenario where the slave node is promoted to master due to the master being temporarily unavailable (failure etc.):

The slave is correctly steps in and becomes the master without interruption to the services, exactly as it should. I can access reports, logs etc. perfectly from the promoted server for all historical data etc.
The promoted server however contains no quarantined email and therefore end users cannot release incorrectly quarantined emails until the original master is restored to service.
Any quarantined email that the promoted slave>master node then captures during its promotion is "lost" once the original master is restored.

Is this expected behaviour or do I have a sync issue/bug?
HA logs are looking good with no errors being reported.

Many thanks for any thoughts or insights!

Cheers,

Darren

This thread was automatically locked due to age.

0 BAlfson over 16 years ago

There was an issue in 7.30x, but there shouldn't be any problem now. My guess is that you have at least one scrambled PostgreSQL database. If you have Astaro support, please see what answer you get when you submit a ticket via MyAstaro.

Cheers - Bob

Sophos UTM Community Moderator
Sophos Certified Architect - UTM
Sophos Certified Engineer - XG
Gold Solution Partner since 2005

MediaSoft, Inc. USA
Cancel
Vote Up 0 Vote Down

Cancel
0 darrenl over 16 years ago in reply to BAlfson

Thanks Bob, I'll try a full rebuild w/config restore on both of the servers and will see what happens. [:D]
Cancel
Vote Up 0 Vote Down

Cancel
0 BAlfson over 16 years ago

If you are comfortable working from the command line, you also might google site:astaro.org postgresql

There are some posts that discuss reindex and some that discuss deleting and recreating the databases.

Rather than a reinstall, a factory reset will be enough. Wait until the "good" Astaro is master, then power-down and disconnect all ethernet connections from the slave. Now you can do a facory reset; when reconnected and powered up, the Master should have a reliable new Slave!

Cheers - Bob

Sophos UTM Community Moderator
Sophos Certified Architect - UTM
Sophos Certified Engineer - XG
Gold Solution Partner since 2005

MediaSoft, Inc. USA
Cancel
Vote Up 0 Vote Down

Cancel
0 darrenl over 16 years ago

Last night I killed the master and rebuilt it, today did the same to the slave node.
Emails are now correctly syncing from node to node as well as all of the reports/data. I did notice when trawling through the logs that there was also an error with "_asg schema_ not created".

Thanks Bob!
Cancel
Vote Up 0 Vote Down

Cancel