Senior Data Engineer responsible for building and maintaining scalable data ingestion systems for healthcare data. Optimizing data pipelines and collaborating with analytics engineers in a fast-growing environment.
Responsibilities
The Senior Data Engineer is responsible for building and maintaining scalable data ingestion infrastructure and operational systems.
Develop and optimize scalable data ingestion pipelines from platform sources (RDS, DynamoDB) into Snowflake.
Building event-driven pipelines using Kinesis, Airbyte, or other open-source frameworks to handle high-volume healthcare data.
Implementing and maintaining a staging-layer architecture that supports the broader medallion (staging → intermediate → marts) structure.
Creating configuration-driven, containerized toolsets (Docker/Kubernetes) to ensure data solutions are portable and maintainable.
Ensuring data reliability by building comprehensive monitoring, alerting, and automated testing for all ingestion processes.
Collaborating with analytics engineers to streamline the flow of data for dbt transformation.
Applying software engineering best practices, including modular design and test-driven development, to all data infrastructure.
Refactoring existing ingestion processes to improve performance, cost-efficiency, and scalability.
Mentoring mid-level and junior engineers through code reviews and sharing best practices in data operations.
Requirements
4-6+ years of professional experience in data engineering with a focus on data ingestion and infrastructure.
Proficiency in Python and SQL, with a track record of building production-grade data pipelines.
Strong experience with ingestion tools such as Kinesis, Airbyte, Kafka, or similar frameworks.
Hands-on experience with Snowflake and moving data from operational databases (RDS, DynamoDB) to cloud data warehouses.
Solid understanding of AWS services (S3, Lambda, Step Functions, RDS).
Experience with containerization (Docker) and deploying maintainable systems.
Knowledge of ELT patterns, specifically supporting analytics engineering workflows and dbt.
Experience with CDC (Change Data Capture) and incremental processing methodologies.
Detail-oriented mindset regarding data privacy and compliance (HIPAA experience is a plus).
Strong communication skills, with the ability to collaborate effectively across data science and engineering teams.
Data Engineer GCP Consultant developing scalable solutions in GCP. Collaborating with teams to implement data - driven business strategies in the financial services sector.
Lead Data Engineer at Capital One collaborating across Agile teams to develop cloud - based solutions. Utilizing programming languages and data technologies to empower financial services for millions.
Data Engineer developing modern data models and ETL processes for RUF's BI solutions based on Microsoft technologies. Focusing on data quality, performance, and automation in a cloud environment.
Senior Data Engineer at Cifas focusing on data pipeline design and management using Microsoft Fabric tools. Collaborating with teams to ensure data integrity and analytics performance.
Data Engineer responsible for designing and maintaining data pipelines and infrastructure, converting requirements into complex reports and dashboards. Consults with clients for data visualization dashboards and applications.
Senior Data Engineer developing scalable data pipelines for AI - driven decision - making in e - commerce. Join a team innovating online business management with autonomous agents.
Senior Data Architect designing and implementing data architectures at Join. Collaborating in agile squads focusing on quality and innovative processes.
Data Engineer at Join responsible for developing ETL/ELT pipelines and integrating data from various systems. Collaborate in an agile team environment to deliver quality data solutions.
Data Engineer responsible for developing, maintaining, and supporting data pipelines. Collaborating with business areas to ensure data quality and architecture improvements.
Data Engineering LP at Acuity Brands developing data models and pipelines, collaborating with global teams for innovative data insights and business decisions.