Data Analyst / Consultant (Databricks, Data Matching, Entity Resolution)
Smart Information Management Systems · Singapore
Executive SummarySmart IMS Inc provides Digital technology & Cloud transformation services, Application & Infrastructure Management Services, Unified Communications and Insurance implementation services to customers across the Americas, Europe, Middle East, and Asia-Pacific regions. As the trusted technology and business partner of leading MNCs, including Global Investment Banks, Smart IMS is also a Microsoft Gold Certified Partner, Oracle Platinum Partner and AWS MSP Partner.We are seeking a highly skilled Data Analyst / Consultant with expertise in entity resolution and data matching to support a critical data mapping initiative. The primary objective of this role is to accurately map internal entity IDs to external identifier sets, ensuring high-quality 1:1 relationships across datasets.This is an immediate requirement with potential for extension into larger-scale data initiatives.Key ResponsibilitiesMap internal entity identifiers to external identifier systems (e.g., ISINs and other financial entity data)Perform entity matching using incomplete or partially structured data (e.g., entity names)Apply NLP and fuzzy matching techniques to improve match accuracyEnsure clean, reliable 1:1 mappings across datasetsWork on an initial dataset (2,000–3,000 records) with scalability in mindClean, transform, and standardize unstructured or imperfect datasetsCollaborate with stakeholders to validate matching logic and outcomesDocument methodologies and matching rules for future scalingRequired Skills & ExperiencePossess at least 2-5 years of relevant experience in entity resolution / record linkage / data matchingHands-on expertise in fuzzy matching techniques and NLP approachesStrong experience working with imperfect or unstructured datasetsProficiency in Databricks or similar big data platforms (e.g., Spark)Solid data wrangling and data engineering skillsStrong analytical and problem-solving abilitiesPreferred QualificationsExperience working in financial services or with financial datasetsExposure to large-scale data mapping or master data management (MDM)Programming skills in Python (libraries such as pandas, fuzzywuzzy, spaCy, etc.)