MLOps Engineer developing cloud infrastructure for AI applications at SS&C Technologies, specializing in financial services. Collaborating with teams to ensure robust deployment and monitoring.
Responsibilities
Design, implement, and maintain scalable cloud infrastructure for AI applications using AWS services, with a focus on ECS, S3, RDS, and Bedrock
Manage deployments with Kubernetes clusters, ensuring high availability and performance, and integrating with our existing infrastructure best practices
Develop and maintain CI/CD pipelines
Develop and expand our Infrastructure-as-Code (IaC) with Terraform to ensure consistent and reproducible deployments across multiple environments
Collaborate with Python developers to containerize AI applications and optimize them for production deployment
Design and implement cloud networking solutions that support secure and efficient communication between AI microservices
Maintain and monitor security best practices for AI applications, including secrets management, access controls, and compliance requirements
Troubleshoot and resolve infrastructure-related issues across the AI application stack, ensuring minimal disruption to business operations
Stay updated with emerging MLOps tools, cloud technologies, and industry best practices to continuously improve our deployment and infrastructure capabilities
Participate in the strategic planning of AI Innovation infrastructure, identifying opportunities for automation and process improvement
Requirements
Bachelor's degree in Computer Science, Software Engineering, DevOps, or a related technical field
3+ years of experience in DevOps, MLOps, or cloud infrastructure engineering
Proven experience with AWS cloud services, particularly ECS, S3, RDS, IAM, and networking components
Strong expertise in Infrastructure as Code tools, specifically Terraform for cloud resource management
Experience building and maintaining CI/CD pipelines for application deployment, preferably with ML/AI applications
Strong understanding of containerization technologies and best practices for production deployments
Experience with cloud networking, security, and compliance best practices
Basic Python and software development skills
Excellent communication and collaboration skills, with the ability to work effectively with development teams and stakeholders.
Benefits
Competitive salary and performance-based bonuses
Comprehensive benefits package, including health, dental, and retirement plans
Opportunities for professional growth and career advancement
A collaborative work environment focused on innovation, learning, and excellence
Lead advanced machine learning model development and optimization at PayPal. Collaborate with teams to deploy scalable ML solutions in production environments.
Senior Machine Learning Engineer at Pivotal Health developing ML systems for healthcare reimbursement. Collaborating across teams to build and maintain reliable, production - grade machine learning systems.
Machine Learning Engineer working with Algorithm team on customer onboarding processes. Focus on execution and automation of models using computer vision and AI in sports industry.
Senior Machine Learning Engineer at Troveo designing and optimizing machine learning pipelines for AI video models. Collaborating with cross - functional teams to build scalable video data solutions.
Software Engineer focusing on ML infrastructure for drug discovery at Genesis AI. Leading engineering efforts to enhance scalable platforms for generative modeling and large - scale simulations.
AI/ML Engineer developing machine learning systems for TymeX's digital banking platform. Collaborating across teams to enhance customer interaction and personalization through AI technology.
Lead Machine Learning Engineering at Foundation Health, designing and deploying production ML systems. Collaborate with teams to implement AI/ML solutions for impactful healthcare.
Staff ML Engineer at GEICO leading the design and deployment of AI applications. Collaborating with teams to ensure scalability and reliability of generative AI solutions.