Contactous
  • Products
    • Contact Management >
      • Enterprise Contact Manager (ECM)
      • ECM Pricing
    • Data Quality >
      • DeDupe API
      • CRM Data Quality
    • Data Parser >
      • On-Premise Data Parser
      • Cloud-based Data Extractor and Parser
    • AI Content >
      • Personalized Learning
    • RAG-as-a-service >
      • Answerous
      • Free Trial
    • Carbon Estimation API
  • Use Cases
    • Digital Business Cards
    • Customer Golden Record
    • Sales Funnel from Dealers
    • Automated Document Parser
    • Relationship Intelligence
    • Marketing Leads Management
    • Intelligent Data Import
    • CRM Data Consolidation
    • Webinars and Events
    • Physical Business Cards
    • Enterprise Pre-CRM
  • Company
    • Contact Us
    • Terms Of Use
    • Privacy Policy
  • Login
  • Products
    • Contact Management >
      • Enterprise Contact Manager (ECM)
      • ECM Pricing
    • Data Quality >
      • DeDupe API
      • CRM Data Quality
    • Data Parser >
      • On-Premise Data Parser
      • Cloud-based Data Extractor and Parser
    • AI Content >
      • Personalized Learning
    • RAG-as-a-service >
      • Answerous
      • Free Trial
    • Carbon Estimation API
  • Use Cases
    • Digital Business Cards
    • Customer Golden Record
    • Sales Funnel from Dealers
    • Automated Document Parser
    • Relationship Intelligence
    • Marketing Leads Management
    • Intelligent Data Import
    • CRM Data Consolidation
    • Webinars and Events
    • Physical Business Cards
    • Enterprise Pre-CRM
  • Company
    • Contact Us
    • Terms Of Use
    • Privacy Policy
  • Login

customized deduplication

ECM is frequently used for Deduplication. Here we explain the type of data that can be addressed by the application.

First, let's define Deduplication. It is the process of duplicate detection of contact records followed by another process which for each entity (person, company) in the database either merges the identifed duplicate records into one combined record, or remove some records from the database until it only contains a single record for each entity. 

Contactous has options to run customizable pre-processes on the data, before deduplication. Examples of these pre-processes include removal of unwanted characters and words, expansion of abbreviations and correction of misspellings, segmenting attributes into consistent output attributes and more. Advance pre-processes include categorizing email addresses as Good/Bad/Doubtful or validating a data point with an external dataset. 

Merging requires identification of a master record. The data from all other duplicate records merges into the master. The merging process can be manual or can be triggered to be executed by the system automatically. 

Deduplication

Description

Default

These functions are present in every Contactous instance by default - 1) Exact matching by Email, Names and Mobile numbers and 2) Smart matching of mobile numbers. 

Exact match is simple. It is widely used by almost all users of Contactous. Smart matching of mobile numbers can identify different formats of a number as duplicates. ​For example, the following mobile numbers will detected as duplicates by the system:

+91-98336-90611
(9833) 690611
00 91 98336 90611
919833690611
​0-98336-90611
(91)98336.90611

The master record in the default function is the last created record within that set of duplicates. 

Customized

The deduplication function can be customized to fit into an enterprise's requirements. The customization is done by Contactous or its partners. The deduplication process can be as complex as needed by the user. 

Examples of customized deduplication are:

- Smart matching of names. For example, "Albert Einstein", "Dr. Albert Einstein", "Einstein, Albert" and "Dr Einstein, Albert" can be detected as duplicates, if that is the desired rule. It can be extended to include a name with typo error and records like "Alber Einstein" can be included in the duplicate set. 

- Smart matching on Company Names. For example, the records with company names of "Contactous Pte Ltd", "Contact ous Pte. Ltd.", "Contactous (P) Ltd" and "Contactous" will be detected as duplicates. 

- Smart matching on any other field, eg domain name from email address, website URLs, first/last N number of words of a field, etc.

- Pre-processes to segment attributes into consistent output ones can be applied to any field. For example, "101 Cecil Street", "101, Cecil Str" and "101 - Cecil St" can form a duplicate set. 

- Exact matching by any field, eg company name 

- Any combination of the above can be scenarios can be included within the duplicate detection rule set. 

The master record can be selected by rules defined by users. For example, the master record can be selected based on its timestamp, source or some complex criteria based on its values. 

If required, the duplicate records can remain in the system instead of getting deleted. These can be modified to a new set altogether by the program. Analytics are available to see number of duplicates by batches or source of records. 

For discussion on customized deduplication, please contact us.

© 2025 CONTACTOUS PTE LTD | ALL RIGHTS RESERVED

Support

FAQ
Contact Us

Resources

Privacy Policy
Terms of Use

Address

24 Raffles Place, #25-02A
Singapore 048621.
© 2016 CONTACTOUS PTE LTD
ALL RIGHTS RESERVED