-->
These old forums are deprecated now and set to read-only. We are waiting for you on our new forums!
More modern, Discourse-based and with GitHub/Google/Twitter authentication built-in.

All times are UTC - 5 hours [ DST ]



Forum locked This topic is locked, you cannot edit posts or make further replies.  [ 2 posts ] 
Author Message
 Post subject: Best approach to handle dulicate contact detection
PostPosted: Mon Oct 12, 2009 8:43 am 
Newbie

Joined: Mon Nov 10, 2008 8:43 am
Posts: 8
Hi All,

My app is supposed to find duplicate contacts of an user. The logic is to find a duplicate contact on the basis of email, name, and mobile number. If there is a match found in either of these attributes of a contact, then this contact can be considered as duplicate of that contact with which match was found.

I would like the group to guide me on the best possible design to handle this: I can think of following but please suggest which option can give me best performance:

1) Simply query all the contacts and then on each contact, do an iteration to find the matches
2) Make use of Hibernate search and index name, email and mobile
3) Make use of a Stored Procedure and then invoke it from Hibernate. SP will find all the duplicates.
4) Any other approach

rgds
nitin


Top
 Profile  
 
 Post subject: Re: Best approach to handle dulicate contact detection
PostPosted: Tue Oct 13, 2009 4:02 am 
Hibernate Team
Hibernate Team

Joined: Fri Oct 05, 2007 4:47 pm
Posts: 2536
Location: Third rock from the Sun
It depends of your definition of "duplicate", if you need to remove automatically all exact matches, a SP would perform best.
If you need some more flexibility I'd go for Hibernate Search, as you could scroll on all your users and for each one verify from the index if there are other users "very similar" to the current one, where you can define the "similarity" function in a flexible way (like ignoring small typos in address fields, telephone numbers which are the same but formatted differently, names having or not having titles in it...)

_________________
Sanne
http://in.relation.to/


Top
 Profile  
 
Display posts from previous:  Sort by  
Forum locked This topic is locked, you cannot edit posts or make further replies.  [ 2 posts ] 

All times are UTC - 5 hours [ DST ]


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
© Copyright 2014, Red Hat Inc. All rights reserved. JBoss and Hibernate are registered trademarks and servicemarks of Red Hat, Inc.