Configure and maintain CI/CD pipelines and support tools as a Sr. DevOps Engineer II. Collaborate on deployments, ensuring uptime and performance metrics meet expectations.
Responsibilities
Work with customers to design and implement customized installations of the C3 AI Platform that meet unique access and security requirements.
Maximize system uptime and availability, ensuring functional and performance SLAs.
Establish end-to-end monitoring and alerting on all critical aspects.
Solve complex problems for critical services and build automation to prevent problem recurrence.
Initiate and lead scripting and automation to streamline system updates and upgrades.
Set up critical infrastructure, tools, and framework to streamline the deployment cycle.
Work cross-functionally with Services and Engineering teams.
Requirements
Production experience with Generative AI
Bachelor’s degree in a Science, Technology, Engineering or Mathematics (STEM), or comparable area of study.
Demonstrated experience in deploying, managing, and operating scalable and fault-tolerant Kubernetes-based infrastructure in AWS, GCP, and other public clouds.
Experience with Infrastructure-as-Code configuration such as Terraform and Helm.
Experience in Bash or Python; to automate and monitor systems.
Excellent problem-solving, critical thinking, and communication skills.
Experience supporting as a DevOps or sys admin for commercial SaaS solutions. Customer facing experience is a plus.
DevSecOps Engineer responsible for enhancing Thales' secure hosting platforms in public and private clouds. Collaborating with teams to apply modern practices and build resilient infrastructures.
Develops high - automation services in Golang or Java within AWS, Kubernetes, and Azure. Supports teams in building secure applications while working in a hybrid environment.
DevOps Engineer specializing in AWS Cloud Infrastructure in a hybrid position. Collaborating within a supportive team to build modern infrastructure for VM - based applications.
Leading DevOps platform strategy for KIPMI Software's next - generation digital trust products. Collaborating with teams to implement scalable infrastructure and DevSecOps practices.
Join our DevOps team to build and manage GitHub pipelines and cloud - native Azure solutions. Collaborate with teams to drive DevOps best practices and optimize deployments.
Site Reliability Engineer enhancing system reliability and deployment practices at OpenLoop. Collaborating with cross - functional teams for incident management and performance tuning.
Senior DevOps Engineer enhancing Azure application reliability for a healthcare fintech platform. Collaborating closely with engineering teams to ensure deploy safety and observability.
DevOps Engineer contributing to tooling changes and leading a community of practice at Totara. Focused on collaboration, development, and support for internal teams.
Site Reliability Engineer responsible for infrastructure supporting AI platform. Safeguarding US customer data and ensuring compliance in the Aerospace and Defense sector.