Staff Software Engineer leading architecture and delivery of cloud-native AI platform for Cloudera. Optimizing AI stack and ensuring seamless integration for enterprises.
Responsibilities
Design and implement elegant, scalable application services (Go/Node.js) that wrap AI capabilities for enterprise use.
Lead the deployment of inference servers (vLLM, Triton) using KServe, KubeRay, or Knative to ensure serverless-style scaling for AI workloads.
Build internal tooling, SDKs, and "AI Gateways" that enhance team agility and simplify the integration of Foundation Models (Llama, GPT) into product features.
Architect robust Retrieval-Augmented Generation (RAG) pipelines and prompt management services that integrate seamlessly with vector databases and enterprise data sources.
Partner with UI engineers, UX designers, and Product Management to ensure the AI platform is not just powerful, but highly usable for internal developers.
Ensure AI workloads are secure, multi-tenant, and optimized for GPU resource scheduling (MIG, fractional GPUs) within Kubernetes.
Requirements
Bachelor’s degree with 6+ years of software engineering experience (or equivalent Masters/PhD tenure), with at least 2+ years focused on AI/ML systems.
Expert proficiency in Python (for AI ecosystem) and strong competence in a systems language like Go or Rust/C++ (for high-performance serving layers).
Deep understanding of LLM deployment challenges and runtimes (e.g., vLLM, ONNX, TorchServe, Triton).
Familiarity with quantization techniques (AWQ, GPTQ) to optimize model size/speed.
Experience building complex workflows using tools like LangChain or LlamaIndex, and deploying them on containerized infrastructure (Docker/Kubernetes).
Ability to navigate the rapidly changing AI landscape, filtering hype from practical engineering solutions, and driving technical alignment across teams.
Advanced Software Engineer developing backend cloud services for IoT devices. Designing APIs and optimizing software architectures for enhanced performance in a hybrid environment.
Senior Full Stack Engineer designing and implementing React + TypeScript features for smart home applications. Leveraging Micro - Frontend architecture and maintaining legacy systems in a hybrid work environment.
Software Engineer designing, implementing, and testing features for Cloudera's Data Warehouse. Collaborating with various teams to enhance product adoption and support customers efficiently.
Software Engineer in Storage Engineering team at Cloudera building Apache Ozone for data transformation. Join a passionate team working on innovative data solutions and scaling challenges.
Software Engineer at Cloudera designing and developing software products. Responsible for maintaining performance, testing, and communication with teams.
Software Development Engineer III developing the leading map renderer at Mapbox for dynamic, performant maps. Collaborating on an ambitious roadmap with cutting - edge browser technology.
Full Stack Developer for La Centrale, enhancing features on a high - traffic vehicle marketplace. Collaborating in agile teams to improve SEO, performance, and user experiences.
Software Architect developing future digital services for Omegapoint. Collaborating with teams and managing architecture responsibilities in a hybrid work environment.
Senior Software Engineer helping build a Creative Intelligence Platform for enterprise brands. Focusing on innovative technology and product excellence in a fast - growing global SaaS company.
Staff Software Engineer at Ingrid, a European startup focused on delivery optimisation for retailers and customers. Leading engineering quality and driving teams alignment on architectural directions.