AI Infrastructure Engineer focusing on scalable backend systems for AI workflows in a fast-paced startup. Collaborating on reliability, data performance, and infrastructure for rapid growth.
Responsibilities
Design and implement scalable backend architectures for AI workloads (inference, orchestration, monitoring).
Own distributed job orchestration with Temporal and related systems.
Improve data pipeline performance by designing smarter caching strategies (e.g., file deduplication, hot/cold storage, Redis caching layers) to reduce redundant compute and API calls.
Build observability, monitoring, retries, and fault tolerance into all workflows.
Manage infrastructure reliability, incident response, and performance.
Develop tooling and platform infrastructure to support rapid growth.
Partner with ML engineers to bring models to production at scale.
Requirements
4+ years of backend engineering (Python is a must).
Strong background in distributed systems, job orchestration, and task queues.
Deep knowledge of concurrency, parallelism, and multithreading—including async/await, event loops, thread pools, synchronization primitives, deadlocks, and race conditions—is a must.
Hands-on experience with Temporal, Redis, Airflow, Celery, RabbitMQ (or similar).
Experience with LLM serving and routing fundamentals (rate limiting, streaming, load balancing, budgets).
Comfortable with containers & orchestration: Docker, Kubernetes.
Familiarity with cloud platforms (AWS/GCP) and IaC (Terraform).
Experience with multiple storage systems: S3, Postgres, MongoDB, Redis, and Elasticsearch.
Track record scaling systems in startups or fast-paced environments.
Understanding of deploying, monitoring, and optimizing AI/ML systems in production with strong CI/CD practices.
Kubernetes Infrastructure Engineer focused on developing software infrastructure for Quantum Key Distribution as a service. Join zerothird in Vienna, a leader in quantum cryptography technology.
IT Infrastructure Engineer managing on - prem and cloud infrastructure in aviation data solutions. Collaborating in a well - coordinated team for flexible project work and customer impact.
Infrastructure Architect designing and implementing scalable solutions at Regions. Collaborating with teams on enterprise - wide architecture and infrastructure improvements.
Cloud Infrastructure Engineer at EVENTIM designing AWS infrastructure and implementing DevOps practices. Collaborating with teams on scalability, security, and automation initiatives.
Infrastructure Engineer at BAE Systems Digital Intelligence designing and maintaining enterprise - grade infrastructure platforms. Role involves Linux, Windows, cloud environments, and security responsibilities.
Sr. AWS and Infrastructure Engineer defining and owning AWS infrastructure architecture for scalable production environments. Leading security architecture and compliance implementation with a focus on cost optimization and CI/CD.
Senior Infrastructure Architect at Cambio, leading IT solutions in healthcare transformation. Driving architecture and infrastructure initiatives for e - health solutions in Sweden.
Staff ML Infrastructure Engineer building and scaling robust Compute platforms for Simulation and data workflows at GM. Collaborating with engineers to drive efficiency and reliability in AI infrastructure.
IT Infrastructure Engineer managing network and digital infrastructure for Physicians Insurance, a boutique mutual insurance company. Collaborating on design, deployment, and maintenance operations.