Design, build, and maintain scalable data pipelines and ETL processes to support analytics, reporting, and data science initiatives
Develop and automate robust workflows for data ingestion, transformation, and delivery using orchestration tools and CI/CD pipelines
Leverage cloud platforms (Azure preferred) and technologies (Databricks, Azure Data Factory) to manage big data environments and enable advanced analytics
Participate in code reviews (PRs), enforce coding standards, and collaborate with cross-functional teams to ensure maintainable, high-quality solutions
Implement monitoring and observability for data pipelines, proactively identifying and resolving issues in production environments
Ensure data integrity, security, and compliance by applying best practices in data governance, quality management, and documentation
Work closely with business partners to understand requirements, deliver custom data solutions, and provide training and support for end users
Requirements
Bachelor’s degree in Computer Science, Engineering, Information Systems, or a related field (Master’s preferred)
2-3 years of relevant experience, or a Master’s degree in a related field
Strong proficiency with SQL, Python, and PySpark for data engineering tasks
Hands-on experience with cloud computing technologies for big data (Azure preferred), including Databricks and Azure Data Factory
Experience building and maintaining production-grade data pipelines, including monitoring and troubleshooting in live environments
Proficiency in using Git for version control and collaboration; experience participating in code reviews (PRs) and enforcing code quality standards
Experience working in an Agile environment, with knowledge of Agile methodologies and practices
Hands-on experience with CI/CD pipelines for automating data pipeline deployment and testing
Deep understanding and experience with Azure’s cloud architecture, services, and management tools
Knowledge of MLOps best practices, including deployment, monitoring, and maintenance of machine learning models
Experience participating in code reviews and collaborative development processes
Experience with building automated pipelines for data workflow deployment and monitoring in Azure Databricks
Experience supporting data-driven business transformation and data governance initiatives
Benefits
Numerous development opportunities offered to build your skills
Be part of a company with a higher purpose and contribute to making the world a better place
Health benefits for you and your family on your first day of employment
Four weeks of paid time off and two weeks of well-being pay per year, plus paid holidays
Excellent parental leave which includes a minimum of 16 weeks for mother and father
Future planning with our competitive retirement savings plan and tuition reimbursement program
Data Engineer building solutions on AWS for high - performance data processing. Leading initiatives in data architecture and analytics for operational support.
Senior Data Engineer overseeing Databricks platform integrity, optimizing data practices for efficient usage. Leading teams on compliance while mentoring a junior Data Engineer.
Associate Data Engineer contributing to software applications development and maintenance using Python. Collaborating with teams for clean coding and debugging practices in Pune, India.
Lead Data Engineer responsible for delivering scalable cloud - based data solutions and managing cross - functional teams. Collaborating with global stakeholders and ensuring high - quality project execution in a fast - paced environment.
Data Engineer focusing on development and optimization of data pipelines in an insurance context. Ensuring data integrity and supporting data - driven decision - making processes.
Data Engineer designing and implementing data pipelines and services for Ford Pro analytics. Working with diverse teams and technologies to drive data - driven solutions.
Full Stack Data Engineer on a Central Engineering Portfolio Team in Chennai delivering curated data products and collaborating with data engineers and product owners.
Data Engineer developing best - in - class data platforms for ClearBank with a focus on data insights and automation. Collaborating closely with stakeholders and supporting data science initiatives.
Data Engineer operating cloud - based data platform for Business Intelligence and Data Science. Collaborating on data architectures and ETL processes for Sparkassen - Finanzgruppe.
Data Engineer at Love, Bonito optimizing data pipelines and ensuring data quality for analytics. Collaborating on data architecture, operations, and compliance for effective data management.