Location: Baltimore, MD, Philadelphia PA onsite 
Contract
Contract
Job Description:
- Candidate should have strong 12+ Years of experience in data engineering architecture for large-scale platforms.
- Contribute towards defining platform roadmap/Architecture/ solution, Design, POCs, prototype, technical evaluation for tech stack finalization and guiding principle for the best practices etc.
- Data platform development strategy, Data migration strategy, Data validation strategy, To review code, checklist / coding standards etc.
- Creating data models to reduce system complexities and hence increase efficiency & reduce cost.
- Expertise with data ingestion/orchestration tools and working experience in Real-time processing Framework (Apache Spark), PySpark and in AWS Redshift, Apache Airflow and EMR etc
- Strong coding background in Python, SQL, PySpark. Proficiency in data virtualization (Dremio or similar).
- Experience with data governance and access control frameworks (Privacera, Apache Ranger, etc.).
- Knowledge of search & discovery platforms (Solr, Elasticsearch, Looker).
- Solid understanding of data security, authentication (Okta), and compliance frameworks.
- Familiarity with CI/CD pipelines and DevOps practices (Jenkins, Git, Docker, Kubernetes).
- Prior experience designing enterprise data platforms in healthcare, pharma, or regulated industries.
- Knowledge of machine learning pipelines and integration into data platforms.
—
—
