Guest User!

You are not Sophos Staff.

This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

SQL - How to remove duplicate entities from SEC

Hi there, its summer time, so maintenance time. I'm trying to remove deprecated computers from the SEC SOPHOS521 database using an SQL query that can get rid of hundreds of duplicates.

Our naming scheme for computernames is the following:

RoomNumber-Servicetag/Serial-Make

If a computers is renamed (or reimaged or when it receives an OS upgrade) the computername sometimes changes even though the Servicetag/Serial part stays the same. Sophos doesn't always recognize that this is the same entity thus creating duplicates that never Update and show up as Unknown and what not. Console clutter that distracts from the real issues!

Example:

old name: 506-TKJFREP-D

new name: ROAM-TKJFREP-D

or

old name: ROAM-0445001-H

new name: GH-0445001-H

The only constant is the Servicetag which is exactly 7 character (either letter or numbers) and that there is a dash encapsulating it. And the last character of the computername is always only one (1) character.

Right now I have the ability to find duplicates with the exact same name using the following SQL query:

SELECT c.Name,
	c.Description, 
	c.DomainName, 
	c.OperatingSystem, 
	c.Managed, 
	c.Deleted, 
	c.Connected,
	c.SAVOnAccess,
	c.LastMessageTime,
	c.insertedat, 
	c.IdentityTag,
	c.IPAddress,	
	c.QuarantineCount,
	c.LastLoggedOnUser,
	c.MessageSystemAddress,
	cgm.GroupID
FROM [SOPHOS521].[dbo].[ComputersAndDeletedComputers] as c
    inner join [SOPHOS521].[dbo].[ComputerGroupMapping] as cgm on cgm.ID = c.id
WHERE c.Name in(
	SELECT c.Name
	FROM [SOPHOS521].[dbo].[ComputersAndDeletedComputers] as c
        WHERE Deleted = 0
	GROUP BY c.name
	HAVING ( COUNT(c.name) > 1 )
)
order by c.name

This is done regularly and spitting out about 20 duplicates each month which I then delete manually from SEC.

Now, how would I change this query to find and automatically delete the entities that have different computernames, but the same Servicetag/Serial based on last report date (c.LastMessageTime)? Both LastMessageTimes would need to be compared and the one that is more recent stays while the other, older one gets deleted.

Jak for the rescue.

:50158


This thread was automatically locked due to age.
Parents
  • Hello RRR,

    the "full" statement (from this post) SELECTs only ServiceTag and LastMessageTime for the rows WHERE Ranking<>1. Thus there should be a few duplicates (if ruckus' screenshot shows "your data" at the latest in rows 64/65).

    You'll get the output shown in the screenshot (assuming you're using the Management Studio) by selecting the inner block exactly as shown and pressing F5 (or clicking Execute).

    The 1610 rows are the n-plicates, significantly less than the expected 1800+. The sophisticated statement should not select less data than the q&d one (at least I can't see a reason). Please check if the "inner" statement spits out the 3700+ rows, run also the q&d SELECT.

    Christian

    :51600
Reply
  • Hello RRR,

    the "full" statement (from this post) SELECTs only ServiceTag and LastMessageTime for the rows WHERE Ranking<>1. Thus there should be a few duplicates (if ruckus' screenshot shows "your data" at the latest in rows 64/65).

    You'll get the output shown in the screenshot (assuming you're using the Management Studio) by selecting the inner block exactly as shown and pressing F5 (or clicking Execute).

    The 1610 rows are the n-plicates, significantly less than the expected 1800+. The sophisticated statement should not select less data than the q&d one (at least I can't see a reason). Please check if the "inner" statement spits out the 3700+ rows, run also the q&d SELECT.

    Christian

    :51600
Children
No Data
Share Feedback
×

Submitted a Tech Support Case lately from the Support Portal?