AWS Data Engineer designing and deploying infrastructure using Terraform for a modern Data Lakehouse. Driving ETL processes, data ingestion, and analytics using AWS technologies.
Responsibilities
Design, develop, and deploy AWS infrastructure using Terraform, including S3, Glue, IAM, Lake Formation, and Athena resources
Develop and maintain AWS Glue ETL jobs (PySpark or Python shell) for data ingestion, transformation, and curation across raw → clean → curated layers
Integrate Airflow (Amazon MWAA or self-managed) for orchestrating Glue jobs, data pipelines, and dependencies
Build and maintain Glue Catalog, manage metadata, and align with Lake Formation security policies
Write complex SQL queries for data validation, transformation, and reporting logic, ensuring efficient query performance
Manage Terraform state files, backend setup (S3 + DynamoDB), and environment-based deployments
Implement data ingestion frameworks for batch and near real-time pipelines
Collaborate with Snowflake and BI teams for seamless data consumption
Contribute to high availability (multi-AZ) and disaster recovery (multi-region) strategies for core data components
Requirements
6 years of experience as a Data Engineer or Cloud Engineer
Strong expertise in AWS Services: S3, Glue, Glue Catalog, Lake Formation, IAM, Athena, CloudWatch, Lambda (preferred)
Hands-on proficiency in Terraform (HCL) for infrastructure automation
Experience with Airflow DAGs for orchestration of Glue, S3, and external data flows
Solid understanding of PySpark / Python for ETL scripting
Strong ability to write and optimize complex SQL (joins, window functions, CTEs, and analytical queries)
Familiarity with data lake formats (Iceberg, Parquet, Delta, etc.)
Experience with CI/CD pipelines (GitHub Actions, CodePipeline, or Jenkins)
Benefits
Flexible work
Healthcare including dental, vision, mental health, and well-being programs
Financial well-being programs such as 401(k) and Employee Share Ownership Plan
Paid time off and paid holidays
Paid parental leave
Family building benefits like adoption assistance, surrogacy, and cryopreservation
Social well-being benefits like subsidized back-up child/elder care and tutoring
Data Architect defining data domains, models, and principles for Intelance's EA function. Collaborating with architects to ensure data lineage and integration.
Senior Data Architect responsible for building data infrastructure at Trexquant, integrating diverse datasets for research and simulation applications. Collaborating with teams to enhance data accessibility and quality.
Data Engineer responsible for developing data solutions and integrating systems for advanced analytics at Lilly. Focusing on data pipelines and solutions ensuring data quality and compliance.
Junior Data Engineer assisting with data - driven use - cases in the payment sector. Contributing to the establishment of a central data platform at S - Payment.
Senior Data Engineer leading tailored data - driven solutions delivery for public sector clients. Involves data transformation projects and building AI - powered tools for decision making.
Technical Lead in Data Engineering at Intentsify, building scalable applications for B2B marketing solutions. Leading a small team and making key technological decisions.
Data Engineer developing scalable data pipelines for RunBuggy's automotive logistics platform. Collaborate with cross - functional teams to unlock powerful insights and optimize data infrastructure.
Working Student in Data Engineering supporting the development of an energy management app's data backbone across Europe. Collaborate with diverse teams to ensure data quality and optimization.
Senior Data Engineer at Minsait responsible for designing and maintaining data infrastructure. Ensuring efficient and secure data collection, storage, and processing across various sectors.
Senior Data Engineer developing and maintaining scalable data pipelines at Quality Digital. Ensuring data quality, security, and compliance with best practices while collaborating with data teams.