Solution Architect developing comprehensive AI infrastructure solutions for deployment at d-Matrix. Collaborating with clients to enable successful integration of d-Matrix based solutions.
Responsibilities
Develop end-to-end AI infrastructure reference solutions optimized for d-Matrix servers including compute, networking, storage, and orchestration layers, in collaboration with various internal teams.
Create reference blueprints that integrate smoothly into cloud-native and on-prem environments.
Develop infrastructure-as-code templates and examples using Ansible, Terraform, and Helm for provisioning d-Matrix-based nodes and clusters.
Integrate with Kubernetes-based systems to enable model deployment, auto-scaling, and fault-tolerant execution.
Design and deploy telemetry and monitoring frameworks to support real-time visibility into d-Matrix cluster health, job status, and system performance.
Integrate with industry-standard observability stacks (e.g., Prometheus, Grafana, OpenTelemetry) for data collection, visualization, and alerting.
Develop dashboards, health check systems, and metric pipelines that track performance, availability, and operational KPIs
Collaborate with performance and software teams to validate infrastructure using real-world workloads and benchmarks.
Incorporate telemetry hooks for benchmark reporting and feedback-driven tuning.
Create and publish detailed infrastructure deployment guides, monitoring configuration templates, and operational best practices.
Collaborate with customers and OEM/ISV ecosystem, enable them to adopt and customize reference solutions to their specific datacenter environments and/or software stacks.
Requirements
Bachelor's or Master’s degree in Computer Science, or related technical field
10+ years of experience in infrastructure solution architecture, systems management, DevOps, or platform engineering roles.
Experience working with GPUs, custom AI accelerators or heterogeneous compute environments.
Proven expertise in building, managing, and monitoring full-stack AI infrastructure at scale.
Solution Engineering Manager leading a high - performing team for B2B SaaS solutions at Ironclad. Collaborating with Sales and Customer Outcomes to drive customer value realization and team performance.
Senior Solution Architect leading design and development of solutions for defense and federal clients. Collaborating with teams to ensure technically sound solutions aligned with mission outcomes.
AI Solutions Architect promoting and selling cutting - edge AI software solutions for HP. Collaborating with business development to drive adoption and contribute to customer base growth.
AI Solutions Architect at Wavestone designing scalable AI architectures and integrating complex solutions for enterprise environments in Switzerland. Collaborating in a team focused on innovation and security.
Senior Clinical Solutions Manager at Philips driving strategic medical plans and developing clinical information for cardiac monitoring products. Collaborating with cross - functional teams to enhance product compliance and safety initiatives.
Client Solutions Consultant at Fiserv supporting executives in developing clear, compelling materials. Involves converting concepts into presentations and ensuring professional documentation.
Lead Data Solutions Architect to oversee delivery of data platform with client implementation. Collaborating closely with engineers and product teams in a hybrid role based in London.
BI & Data Solutions Consultant managing BI landscape and projects. Involved in ETL optimization, dashboard design, and promoting self - service culture.
SW customer engineer ensuring successful technology integration for Autonomous Driving systems at Mobileye. Hands - on role troubleshooting complex issues with car manufacturers and R&D teams.
Senior Solution Architect designing scalable, cloud - native solutions for telecommunications sector. Collaborating with business stakeholders to optimize and modernize large - scale enterprise applications.