Databricks Senior DevOps Engineer designing and operating platforms on AWS and Databricks for Financial Crime. Focused on platform infrastructure, governance, security, and operations.
Responsibilities
Architect, build, and operate end-to-end data and ML platforms on AWS and Databricks.
Own and administer Databricks workspaces for the Financial Crime platform.
Lead DevSecOps and DataOps practices, including infrastructure-as-code (IaC) and CI/CD pipelines for data and ML workflows.
Configure and optimize Databricks compute clusters (job clusters and all-purpose clusters) for performance, scalability, and cost efficiency.
Manage and enforce governance through Unity Catalog, including access control, security policies, data lineage, and isolation.
Build and operate ML infrastructure, including model deployment and serving endpoints.
Integrate AWS services (e.g., S3, Redshift, Kinesis, Lambda, EKS/ECS) with Databricks runtime and Delta Lake.
Implement platform security best practices, including secrets management, audit logging, and compliance controls.
Optimize system performance and diagnose large-scale production issues.
Mentor engineering teams and define architectural best practices for high-scale data and ML systems.
Requirements
7+ years of experience in data platform architecture, cloud infrastructure, or ML platform engineering.
Strong enterprise-level experience with Databricks and AWS.
Deep expertise in Unity Catalog governance and security models.
Hands-on experience designing, deploying, and operating Databricks clusters in production.
Experience managing Model Serving / ML deployment infrastructure.
Strong implementation mindset — able to design, build, deploy, and operate platforms end-to-end.
Experience operating in regulated environments (banking/fintech preferred).
Application Security Manager at Evertec, handling security strategy and implementation in financial tech. Leading efforts in Application Security, DevSecOps, and compliance with financial regulations.
Site Reliability Engineer at Assecor, focusing on SLIs, SLOs, and incident management. Enhancing performance and reliability through observability and automation in a hybrid work environment.
DevOps Architect at Ascensus, responsible for technical direction and oversight for application engineering practices across scrum teams. Promotes DevOps culture and innovative solutions.
Cloud Site Reliability Engineer ensuring scalability, performance, and reliability of cloud infrastructure deployed in Woven City. Working with product owners and teams for innovative solutions.
Senior DevOps Engineer supporting enterprise - grade Kubernetes infrastructure and CI/CD automation for U.S. Army projects. Engaging in critical system designs and automation processes with a focus on cloud - based platforms.
Reliability Engineer focusing on mechanical systems in a long - standing Australian FMCG company. Ensure ongoing reliability improvements and support plant operations for iconic cereal production.
Software Engineer 2 developing full - stack solutions for U.S. Bank. Collaborating with teams to design and maintain best in class software experiences.
Principal Software Engineer at FIS driving reliability and performance in fintech environments. Collaborating across teams for high - scale, high - reliability solutions in the finance sector.
Senior Software Development Engineer involved in automation testing at CVS Health. Designing, developing, and implementing automated testing solutions in a collaborative environment.
Senior Site Reliability Engineer focusing on reliability and operational excellence of workflow orchestration platforms like Apache Airflow. Engaging in operations and engineering projects in a hybrid setup.