Senior DevOps Engineer optimizing CI/CD pipelines and enhancing machine learning frameworks for NVIDIA's Vision AI platform. Collaborating with cross-functional teams for efficient software delivery.
Responsibilities
Contribute the development and maintenance of advanced machine learning software and frameworks, optimizing for performance and scalability.
Enhance CI/CD pipelines to streamline the development, testing, and deployment of large-scale machine learning models.
Implement and manage cloud infrastructure for continuous integration, delivery, and deployment, ensuring high availability and scalability.
Collaborate with cross-functional teams, including engineering, QA, and research, to improve development workflows and enhance software delivery speed and quality.
Troubleshoot and resolve complex issues related to software development, containerization, and cloud infrastructure in production environments.
Write and maintain robust documentation for development and deployment processes.
Communicate effectively with technical and non-technical stakeholders to set shared expectations and ensure visibility around the release and deployment process.
Lead code reviews, testing, and debugging to ensure high-quality code and efficient workflows.
Mentor and guide junior engineers, fostering professional growth and enhancing team capabilities.
Requirements
Bachelor’s or master’s degree or equivalent experience in Computer Science, Information Systems, Electrical Engineering, or other related fields.
5+ years of experience in software engineering, with hands-on experience in CI/CD, cloud infrastructure, and advanced machine learning frameworks.
Proficiency with automation and orchestration tools, including Docker, Kubernetes, Jenkins, and Terraform, or similar CI/CD Tools.
Experience with cloud platforms like AWS, Azure, or GCP.
Strong programming skills in Python and/or other relevant languages.
Experience in developing and deploying scalable software solutions.
Strong analytical and problem-solving skills with a focus on practical and scalable solutions.
Familiarity with version control systems and configuration management.
Excellent written and verbal English communication skills, with demonstrated success collaborating across time zones and functions.
Analyzing vulnerabilities and implementing security strategies within the software development cycle at Redbelt Security. Ensuring compliance with security requirements and providing guidance to the development team.
Data Center Network Deployment Engineer for NVIDIA's HPC/AI Infrastructure team. Deploying and managing large scale AI Data Centers with a focus on networking and automation.
Deployment Engineer at Megaport expanding global network using technology with collaborative team culture and problem solvers. Engage with stakeholders to deliver effective networking solutions.
Senior DevOps/Infrastructure Engineer at Thndr focusing on cloud infrastructure and DevOps best practices. Leading initiatives to improve scalable and secure financial applications.
DevOps Engineer assisting developers in leveraging DevOps tooling and best practices for Cat Digital applications. Collaborating closely with development teams to optimize delivery and troubleshooting.
Reliability Engineer providing strategic support at Y12 National Security Complex. Enhancing equipment reliability and maintainability through proactive maintenance strategies.
Upper Steering System Design and Release Engineer responsible for managing steering components and suppliers. Engaging in design and development of upper steering systems for Ford vehicles in a hybrid capacity.
Senior DevOps Engineer implementing CI/CD solutions for software projects. Requires expertise in Docker, Azure, and IAC tools in a hybrid work environment.
DevOps Engineer ensuring the stability and scalability of the justtrack platform. Collaborate with development teams managing the cloud infrastructure for a SaaS solution.