Job Title: AI/ML Engineer – Customer Data Platform (CDP)
Location: (Atlanta, GA / Frisco, TX)
Duration: Long Term
Must Have Skills (Mandatory Areas)
- Python (Core programming)
- Entity Resolution / Record Linkage / Deduplication
- Machine Learning (scikit-learn, ML models)
- Fuzzy Matching Techniques (Levenshtein, Jaro-Winkler, Jaccard)
- LLMs & Prompt Engineering (OpenAI / Anthropic)
- SQL + Distributed Processing (Spark / Dask)
- Vector Databases (Pinecone, pgvector, Qdrant)
- Embeddings & Semantic Search
Required Qualifications
- Bachelor's or Master's in Computer Science / Data Science or related field
- 3+ years of experience in AI/ML Engineering
- 1+ year of experience in entity resolution or record linkage
- Strong Python programming skills
- Experience with ML libraries: scikit-learn, HuggingFace Transformers
- Strong SQL skills + distributed systems (Spark / Dask)
- Experience with LLM APIs and vector search systems
- Familiarity with ML lifecycle tools (MLflow)
Preferred Skills
- Experience working with large-scale datasets (millions/billions of records)
- Understanding of precision/recall trade-offs in identity resolution
- Experience with RapidFuzz, jellyfish libraries
- Knowledge of customer data platforms (CDP)
—