TY - JOUR T1 - Synonym Based Duplicate Record Detection AU - Amshakala, K. AU - Nedunchezhian, R. JO - Asian Journal of Information Technology VL - 12 IS - 7 SP - 236 EP - 241 PY - 2013 DA - 2001/08/19 SN - 1682-3915 DO - ajit.2013.236.241 UR - https://makhillpublications.co/view-article.php?doi=ajit.2013.236.241 KW - Data integration KW -duplicate record detection KW -WordNet ontology KW -synonyms KW -catalog integration KW -un-supervised matching AB - As the amount of data and data providers are increasing tremendously, there is a high demand for integrating data from heterogeneous data sources. Often, in the real world, entities have two or more representations and data are not defined in a consistent way across different data sources. When answering user’s query, results are returned to the users by combining data from several databases and the results include duplicate entries. Duplicate detection techniques detect multiple representations of identical real world entities. Without using duplicate record detection techniques, the quality of the extracted data remains low. This study presents an unsupervised duplicate record detection technique which does not require expert’s knowledge or hand coded rules to detect duplicate records. A large lexical database called WordNet ontology is used to match the entities. ER -