Title-Data Engineers (Databricks)
Location-Wilmington, DE /plano, TX / Houston, TX / Columbus, OH (Onsite)
Contract
Visa H1B
Required-(DELTA LAKE w/ Databricks strong candidate is requested)
Job Description:
We are seeking a skilled and proactive Data Engineer with strong expertise in Databricks and PySpark to join our data engineering team. The ideal candidate will have a proven track record in building scalable data pipelines, optimizing data processing workflows, and leveraging Databricks features to deliver high-performance data solutions.
Key Responsibilities:
Design, develop, and optimize scalable data pipelines using PySpark and Apache Spark on the Databricks platform.
Work extensively with Delta Lake, Databricks notebooks, and related tooling.
Implement and enhance data models that support business intelligence and analytics use cases.
Write and optimize complex SQL queries for data extraction, transformation, and analysis.
Perform data pipeline performance tuning and ensure robustness and scalability.
Collaborate with data analysts, data scientists, and business stakeholders to deliver clean, reliable, and well-modeled data.
Ensure best practices in coding, version control, and documentation.
Required Skills:
Strong hands-on experience with PySpark and Spark-based data processing.
Expert-level proficiency with Databricks, including working with Delta Lake, notebooks, and job orchestration.
Solid understanding of SQL, data modeling, and database optimization techniques.
Proven experience building and maintaining data pipelines in cloud or big data environments.
Deep understanding of data performance tuning and optimization strategies.
Familiarity with CI/CD practices for data engineering pipelines is a plus.
Experience with cloud platforms (e.g., Azure, AWS, or GCP) is an advantage.
Rajani Singh | Sr. IT Recruiter |