MLOps Engineer developing cloud infrastructure for AI applications at SS&C Technologies, specializing in financial services. Collaborating with teams to ensure robust deployment and monitoring.
Responsibilities
Design, implement, and maintain scalable cloud infrastructure for AI applications using AWS services, with a focus on ECS, S3, RDS, and Bedrock
Manage deployments with Kubernetes clusters, ensuring high availability and performance, and integrating with our existing infrastructure best practices
Develop and maintain CI/CD pipelines
Develop and expand our Infrastructure-as-Code (IaC) with Terraform to ensure consistent and reproducible deployments across multiple environments
Collaborate with Python developers to containerize AI applications and optimize them for production deployment
Design and implement cloud networking solutions that support secure and efficient communication between AI microservices
Maintain and monitor security best practices for AI applications, including secrets management, access controls, and compliance requirements
Troubleshoot and resolve infrastructure-related issues across the AI application stack, ensuring minimal disruption to business operations
Stay updated with emerging MLOps tools, cloud technologies, and industry best practices to continuously improve our deployment and infrastructure capabilities
Participate in the strategic planning of AI Innovation infrastructure, identifying opportunities for automation and process improvement
Requirements
Bachelor's degree in Computer Science, Software Engineering, DevOps, or a related technical field
3+ years of experience in DevOps, MLOps, or cloud infrastructure engineering
Proven experience with AWS cloud services, particularly ECS, S3, RDS, IAM, and networking components
Strong expertise in Infrastructure as Code tools, specifically Terraform for cloud resource management
Experience building and maintaining CI/CD pipelines for application deployment, preferably with ML/AI applications
Strong understanding of containerization technologies and best practices for production deployments
Experience with cloud networking, security, and compliance best practices
Basic Python and software development skills
Excellent communication and collaboration skills, with the ability to work effectively with development teams and stakeholders.
Benefits
Competitive salary and performance-based bonuses
Comprehensive benefits package, including health, dental, and retirement plans
Opportunities for professional growth and career advancement
A collaborative work environment focused on innovation, learning, and excellence
Machine Learning Engineer designing and implementing AI systems focused on Japanese language challenges at Woven by Toyota. Involves technical R&D, system design, and collaboration with cross - functional teams.
Principal Software Engineer leading MLOps within Analytics Platform at Sun Life. Focused on AWS and machine learning operations, collaborating across technical and business teams.
Machine Learning Engineer designing and optimizing deep learning models for safety - critical environments at Destinus. Shaping the future of high - speed, autonomous flight technologies.
Machine Learning Engineer optimizing personalization systems for Spotify's audio streaming service. Collaborating with cross - functional teams to enhance user experience and deliver recommendations.
Principal Machine Learning Engineer developing ML and GenAI solutions in a cloud - native environment at Flexera. Leading a high - impact team and driving operational excellence for ML infrastructure.
Senior ML Platform/Ops Engineer building AI - powered ML pipelines for a dynamic Ed - Tech company. Collaborating with ML scientists and engineers to ensure reliable deployment and observability.
Senior ML Platform/Ops Engineer building ML systems for AI - powered learning at Preply. Productionizing machine learning with high reliability, performance, and observability in a hybrid environment.
Machine Learning Engineer developing advanced Deep Learning models for autonomous driving technology at Mobileye. Collaborating in a high - end algorithmic engineering team on critical computer vision challenges.
Machine Learning Engineer focusing on vulnerabilities and security of AI systems at Carnegie Mellon University. Collaborating with a team to build robust prototypes and provide solutions for government sponsors.