Configure and maintain CI/CD pipelines and support tools as a Sr. DevOps Engineer II. Collaborate on deployments, ensuring uptime and performance metrics meet expectations.
Responsibilities
Work with customers to design and implement customized installations of the C3 AI Platform that meet unique access and security requirements.
Maximize system uptime and availability, ensuring functional and performance SLAs.
Establish end-to-end monitoring and alerting on all critical aspects.
Solve complex problems for critical services and build automation to prevent problem recurrence.
Initiate and lead scripting and automation to streamline system updates and upgrades.
Set up critical infrastructure, tools, and framework to streamline the deployment cycle.
Work cross-functionally with Services and Engineering teams.
Requirements
Production experience with Generative AI
Bachelor’s degree in a Science, Technology, Engineering or Mathematics (STEM), or comparable area of study.
Demonstrated experience in deploying, managing, and operating scalable and fault-tolerant Kubernetes-based infrastructure in AWS, GCP, and other public clouds.
Experience with Infrastructure-as-Code configuration such as Terraform and Helm.
Experience in Bash or Python; to automate and monitor systems.
Excellent problem-solving, critical thinking, and communication skills.
Experience supporting as a DevOps or sys admin for commercial SaaS solutions. Customer facing experience is a plus.
Senior DevOps Engineer developing core infrastructure supporting Shelf products. Focused on building reliable, secure, and scalable systems in hybrid work environment.
Cloud/Kubernetes Engineer supporting hybrid infrastructure across AWS and on - premise Kubernetes environments. Automating tasks and managing production reliability, security, and scalability.
AWS Infrastructure DevOps Engineer at Growth Acceleration Partners supporting AWS environments and infrastructure automation. Focused on reliability, security, and operational efficiency across production environments.
Site Reliability Engineer driving innovation and automation for Banking Solutions and Payments. Collaborating with teams to ensure application performance and reliability in a dynamic environment.
Mainframe SRE working on critical payment systems for fintech, ensuring stability and security. Collaborating with teams to perform root cause analysis and automate processes.
DevOps Engineer responsible for cloud product delivery, platform reliability, and using AI tools in DevOps workflows. Building CI/CD pipelines and optimizing container workloads for security and performance.
Senior DevOps Engineer for Paysafe, designing and deploying AWS applications and infrastructure. Collaborating on cloud environments and improving processes for scalable solutions.
Senior Site Reliability Engineer at Broadridge managing infrastructure design and operational support. Collaborating with teams to improve automation, performance, and reliability of services in a hybrid environment.
DevSecOps Engineer building and maintaining Azure DevOps cloud applications with API backend. Roles include developing CI/CD pipeline and automating backend tasks.
Reliability Engineer II at Cargill applying technical expertise to enhance process and asset reliability. Collaborating with teams to execute engineering strategies for equipment optimization in a salt mine setting.