Kubernetes Platform Engineer developing scalable AI platform solutions for VMware Cloud Foundation. Engaging with cross-functional teams and ensuring quality of the Private AI feature set.
Responsibilities
Collaborate with cross-functional teams to design and deliver expanded capabilities of Kubernetes-based platform services for AI
Own the AI platform’s end user / in-product CLI across all components, help guide other teams in how to deliver CLI based experience for the platform
Decompose vague problems into detailed requirements, and develop solutions that meet the needs of our customers
Develop and maintain automated tests to ensure the quality and reliability of the Private AI feature set
Participate in code reviews and ensure that the code is aligned with VMware's coding standards and best practices
Troubleshoot and resolve complex issues related to Private AI services and how those services interface with other components of the stack such as storage, networking, etc.
Requirements
5+ years experience in scalable distributed systems in Go or C++
5+ years of hands on experience with Container technologies (Docker and Kubernetes)
Hands on experience deploying and maintaining Kubernetes Operators is a big plus
Proven knowledge of systems design
Strong analytical and diagnostic skills with ability to work independently
Excellent communication and collaboration skills, with the ability to work with cross-functional teams
Experience with agile development methodologies and version control systems, such as Git
BS in Computer Science or related technical fields and 8+ years of related experience in the software industry or MS in Computer Science or related technical fields and 6+ years of related experience in the software industry.
Cloud Engineer at SDG Group managing data volume optimization for GCP. Designing workflows and ensuring efficient data processing in a hybrid work environment.
Product Reliability Engineer role at Kraken focused on scalable energy management solutions. Collaborating with teams to ensure product performance and system resilience in a hybrid work environment.
Frontend Platform Developer at Borrowell building foundational components for product teams in a remote - first environment. Collaborating with cross - functional teams to enhance code quality and app reliability.
AI Platform Engineer role at RAVL focused on developing GenAI platforms and agent - based architectures. Building scalable integration layers for enterprise AI in a growing engineering team.
ML Platform Engineer at RAVL designing scalable machine learning platforms for financial services. Leading development on Azure Databricks and optimizing MLOps pipelines for enterprise environments.
Lead Platform Engineer managing cloud - native infrastructure at InsurTech company, driving architectural decisions and enhancing platform reliability under a culture of simplicity and care.
Principal Platform Engineer leading platform architecture and operations at Automata, transforming lab automation through integrated technology solutions in a hybrid work model.
Power Platform Engineer developing solutions using PowerPlatform at knowmad mood. Collaborating with multidisciplinary teams in Madrid for quality project delivery in a hybrid mode.
Cloud Operations Engineer responsible for ensuring operational stability of Saviynt’s cloud platform. Collaborating with teams to troubleshoot issues and implement improvements in a dynamic environment.