Data Engineer at Vistra designing and maintaining data pipelines for analytics. Collaborating with teams and optimizing data integration using modern cloud technologies.
Responsibilities
Design and implement scalable ETL/ELT pipelines using AWS services including AWS Glue, Lambda, S3, and Step Functions
Build and optimize data integration processes connecting MySQL databases, APIs, and external data sources to analytical systems and data warehouses
Develop automated data quality monitoring, validation, and cleansing processes
Create and maintain data models, schemas, and documentation to support analytics teams
Implement real-time and batch data processing solutions using serverless architectures
Collaborate with development teams to integrate data collection points into Next.js applications and Node.js services
Build and maintain data analytics APIs and services
Monitor data pipeline performance, troubleshoot issues, and implement proactive alerting and logging mechanisms
Design and implement data backup, archival, and disaster recovery strategies
Work with data analysts and business stakeholders to understand reporting requirements
Requirements
Bachelor’s degree in Computer Science, Data Engineering, Mathematics, or a related technical field
4-6 years of hands-on data engineering experience with strong proficiency in Python for data processing, transformation, and pipeline development
Extensive experience with AWS data services including AWS Glue, Lambda, S3, Athena, Redshift, and Kinesis for building serverless data pipelines
Strong SQL skills and experience with MySQL database design, optimization, and administration including performance tuning and query optimization
Experience with data pipeline orchestration tools such as Apache Airflow, AWS Step Functions, or similar workflow management systems
Proficiency in data formats including JSON, CSV, Parquet, and Avro
Knowledge of data warehousing concepts, dimensional modeling, and analytics best practices for supporting business intelligence requirements
Experience with version control systems, CI/CD pipelines, and infrastructure as code practices for deploying and managing data infrastructure
AWS certifications such as AWS Certified Data Analytics Specialty or AWS Certified Solutions Architect
Experience with streaming data technologies including Apache Kafka, AWS Kinesis, or real-time data processing frameworks
Knowledge of machine learning workflows and experience building data pipelines that support ML model training and inference
Familiarity with business intelligence tools such as Tableau, Power BI, or AWS QuickSight for creating data visualizations and dashboards
Experience with containerization technologies like Docker and orchestration platforms for deploying data processing applications
Understanding of data governance, privacy regulations, and security best practices for handling sensitive data in cloud environments
Experience with NoSQL databases such as DynamoDB, MongoDB, or Elasticsearch for handling unstructured data and high-volume analytics workloads
Benefits
Flexible hybrid working arrangement
Birthday leave
Comprehensive medical insurance and dental coverage
Wellness allowance
Competitive annual leave entitlement
Internal mentorship program
Reimburse professional membership fees for certifications
Data Engineer at Equinix implementing data architecture solutions for scalability and analytics. Collaborating with teams to design data pipelines and maintain data models for business objectives.
Data Warehouse Architect developing and optimizing robust data warehouse environments on SAP BW/4HANA. Critical for enabling advanced analytics and reporting across the organization.
Data Engineering Manager leading a new Data Engineering team in Bengaluru. Shaping the design and scaling of core data engineering practices across the organization.
Sr. ETL/Data Warehouse Lead at Huntington designing, developing, and supporting ETL and Data Warehousing framework. Analyzing systems based on specifications and providing technical assistance.
Senior Google Data Architect designing and delivering scalable data solutions on Google Cloud Platform. Collaborating across teams to shape target - state data architectures and influence enterprise data strategy.
Data Engineer developing scalable data lake solutions and optimizing data pipelines at U.S. Bank. Collaborating with teams to manage data governance and cloud migration activities.
Lead AI, MLOps & Data Engineer at WedR, guiding complex data projects and AI innovation. Collaborate with diverse experts in a Product Studio for digital transformations.
Lead Azure Databricks Data Engineer implementing data solutions for data engineering projects at Ryan Specialty. Collaborating with stakeholders and mentoring junior staff on data pipelines and ETL processes.
Lead Azure Databricks Data Engineer at Ryan Specialty focused on implementing data solutions and collaborating with cross - functional teams to enhance data architecture.
Senior Data Engineer designing and implementing sustainable data solutions for diverse clients. Collaborating closely with stakeholders to enhance data services and platforms in a hybrid environment.