Senior Operations Engineer driving efficiency and reliability in NVIDIA's global business operations. Collaborating with IT subsystems and automating operational workflows for organizational impact.
Responsibilities
Driving day-to-day interactions with NVIDIA wide IT subsystems, ensuring smooth operational workflows across infrastructure and applications.
Crafting and maintaining GitLab CI/CD pipelines to automate build, test, and deployment workflows.
Monitoring system health, building/maintaining dashboards, creating alerts, and producing operational reports.
Performing user offboarding, access reviews, and compliance-related tasks across multiple systems.
Drive interactions with various IT subsystems, ensuring API performance and integration stability meet defined SLAs and SLOs.
Coordinating changes and releases between engineering, operations, and security teams.
Enforcing security guidelines, managing vulnerability remediation, and collaborating with security teams on audits and assessments.
Maintaining documentation, SOPs, and process improvements to enhance operational maturity.
Requirements
8+ years of hands-on experience building/supporting complex services
BS/MS in Computer Science (or equivalent experience)
Knowledge in Python for automation, data handling, and tool development
Experience with monitoring tools (such as Prometheus, Grafana, Datadog, CloudWatch, Splunk)
Familiarity with ITSM practices, including incident, problem, and modification processes
Ability to perform secure and compliant offboarding and access-related tasks
Strong understanding of IT operations and system workflows
Senior DevSecOps Engineer/Developer responsible for building Humana's software security platform. Modernizing architecture and managing CI/CD pipelines as part of core engineering team.
Senior Information Security Analyst focusing on DevSecOps for Unidas, a major mobility company in Brazil. Responsible for optimizing security governance processes and delivering secure software.
DevOps Manager overseeing scaling for Seekr's AI platform using Kubernetes, Terraform, and Ansible. Leading a hands - on team and collaborating with engineering for efficiency.
Back - End & DevOps Software Developer contributing to building digital products to change the world. Specializing in back - end development and command of DevOps ecosystem for robust infrastructure.
Lead DevOps Developer at Boeing, focusing on CI/CD and cloud infrastructure management. Collaborating with teams to automate processes and improve system performance across environments.
Vulnerability & Configuration Management Engineer responsible for vulnerability management and remediation processes at Relax Gaming. Collaborate with IT teams to improve security measures across various platforms.
DevOps Engineer for designing and maintaining Azure - based hybrid cloud infrastructure for a company specializing in nature - based smart city solutions. Leading cloud architecture and mentoring engineers as part of a high - impact team.
SRE responsible for ensuring reliability and performance of IT systems at a digital transformation company specializing in public sector efficiency. Collaborating on system health, incident response, and automation tasks.
DevOps Senior role at Beyond Soluções managing CI/CD for .NET and Kubernetes applications. Collaborating on cloud solutions while fostering a culture of innovation and quality.
Senior Software Engineer at PayPal managing cloud infrastructure and DevOps solutions. Delivering complete SDLC solutions and guiding engineering teams for scalable and reliable services.