Senior ML Services Engineer at Adobe developing robust AI/ML infrastructure solutions for large-scale AI models. Collaborating with cross-functional teams to optimize services leveraging AWS and Kubernetes.
Responsibilities
Design, develop, and maintain robust AI/ML infrastructure solutions to support the inference and deployment of large-scale AI models using Kubernetes and Python on popular services such as AWS cloud
Optimization of services to address high performance, latency, and throughput (load) requirements
Understanding sophisticated service requirements and technical constraints of various platforms while implementing solutions to vastly simplify the software stack, accelerating the inference ML models
Experience in building platform features to generalize across multiple customer applications
Collaborate closely with client or customer application teams to build re-usable solutions
Build the infrastructure for developing efficient, reliable, testable services code in a variety of technical stacks
Work closely with partner engineering teams to guide the development process from requirements and design through development, integration, testing, and deployment
Partner closely with various Adobe teams advising on using our technology, investigating bugs, and collaborating on providing new features
Respond to urgent production issues requiring fast resolution and deployment of code fixes/updates
Participate in inventing technology that has an enormous impact across Adobe, writing patents, and participating in an active internal community of software development professionals
Requirements
Bachelor's degree, Master’s degree, or equivalent experience in Computer Science, Engineering, Mathematics, etc. or equivalent practical experience
Firm computer science fundamentals, including design patterns, algorithms, asymptotic complexity, parallelism, and database schema design
Previous experience building, optimizing and operating GPU intensive machine learning workloads in production environments with strong hands-on experience with large-scale GenAI model inference
Exceptional understanding of model serving, orchestration, scaling, GPU resource management
Highly proficient using programming languages such as Go/Python/Rust, Linux environments, k8s, and AWS
Well established distributed computing principles, proven experience building high scale high performance cloud platforms and services
Extensive experience with CI/CD and an in-depth knowledge of containerization and modern deployment strategies & monitoring tools
Works well in a small, collaborative, highly productive team environment across multiple geographies
Excellent verbal and written communication skills
Bonus Qualifications: Experience with GPU-based ML inference services
Benefits
Adobe aims to make Adobe.com accessible to any and all users
Mission Systems Avionics Engineer supporting engineering design, integration, and certification of mission avionics systems. Collaborating with teams on KC - 46 project tasks in Everett, WA and Tukwila, WA.
Engineer responsible for advanced engineering work activities in control systems at AEP. Influencing direction and managing tasks across teams and departments.
Engineer III providing technical expertise and guidance in resolving complex engineering problems at Duke Energy. The role involves project management and delivering engineering solutions with minimal supervision.
Mid Level Technology Resiliency Engineer at USAA ensuring resilience and compliance across technology infrastructure. Collaborating with teams to improve risk management and operational functions.
Associate Operational Engineer at Manulife supporting cloud and server - based infrastructure. Engaging in project planning and infrastructure deployment.
Electrical Engineer focusing on DER Gateway projects for real - time monitoring of distributed energy resources. Designing communication interfaces and collaborating with grid operations teams to enhance grid reliability.
Senior Rapid Development Engineer at Humana designing and building analytics solutions to transform data into actionable insights. Collaborating with engineering teams and business stakeholders for high impact analytics products.
Lead Traffic Engineer responsible for establishing traffic engineering practice in Fort Worth. Oversee projects, mentor team, and drive business development in the transportation sector.
Civil Engineering professional managing wastewater, drinking water, and stormwater projects for Lochmueller Group. Seeking a Professional Engineer with extensive experience and project management skills.
Roadway Project Engineer designing roadway infrastructure for a growing civil engineering firm. Collaborating on various transportation projects and driving quality and excellence in delivery.