Software Engineer developing backend services for AION's AI cloud platform. Collaborating with teams to implement high-performance, scalable distributed systems in a hybrid work environment.
Responsibilities
Build and maintain platform services across AION's Compute and Inference platforms, working closely with senior engineers and platform leads
Implement features for multi-cloud orchestration, resource scheduling, model deployment pipelines, and autoscaling systems
Write well-maintained, production-grade code with proper abstractions, design patterns, and comprehensive test coverage
Contribute to low-level design (LLD) including service APIs, database schema design, data models, and component interactions
Collaborate with senior engineers on high-level design discussions, providing implementation perspectives and feasibility inputs
Develop RESTful APIs and gRPC services for platform control planes, resource management, and inference serving
Design and implement database schemas for storing platform state, resource metadata, billing data, and observability metrics
Work with distributed storage systems, message queues (Kafka, RabbitMQ), and databases (PostgreSQL, Redis) to build reliable platform components
Build event-driven architectures for asynchronous processing, job scheduling, and platform automation
Implement monitoring, logging, and alerting for platform services to ensure production reliability
Write comprehensive unit tests, integration tests, and end-to-end tests to ensure code reliability
Participate in code reviews, providing constructive feedback and learning from senior engineers' perspectives
Refactor existing code to improve maintainability, performance, and scalability
Document design decisions, API specifications, and operational runbooks for platform services
Debug production issues and contribute to incident response and post-mortems
Requirements
2-4 years of experience in backend engineering, platform development, or distributed systems
Strong proficiency in Golang—you write idiomatic Go code with proper error handling, concurrency patterns, and testing
Solid understanding of backend systems fundamentals: RESTful APIs, microservices architecture, and API design principles
Hands-on experience with databases (PostgreSQL, MySQL) including schema design, query optimization, and transactions
Familiarity with storage systems (object storage like S3, block storage, distributed file systems) and their use cases
Experience working with message queues (Kafka, RabbitMQ, NATS) and event-driven architectures
Understanding of distributed systems concepts: consensus, eventual consistency, fault tolerance, and retry mechanisms
Experience with containerization (Docker) and basic Kubernetes concepts
Knowledge of testing frameworks and practices (unit tests, integration tests, mocking)
Familiarity with Git, CI/CD pipelines, and modern development workflows
Exposure to cloud platforms (AWS/GCP/Azure) and their core services is a plus
Experience with infrastructure-as-code (Terraform) or observability tools (Prometheus, Grafana) is beneficial
Benefits
**Preferred Attributes:**
Founder-level ownership and bias for action.
Strong strategic thinking and ability to connect technical decisions to business impact.
Excellent communication and mentoring skills.
Thrives in ambiguity, fast-paced environments, and early-stage startup culture.
**Why Join AION?**
Work directly with high-pedigree founders shaping technical and product strategy.
Build infrastructure powering the future of AI compute globally.
Significant ownership and impact with equity reflective of your contributions.
Competitive compensation, flexible work options, and wellness benefits
Senior Software Engineer for developing scalable frameworks and systems at an AI customer service solution. Joining the Foundations team to ensure reliability and performance in production environments.
CI/CD Software Engineer at Serko enhancing developer experience and engineering culture. Building and improving platform engineering capabilities for reliable product delivery.
Software Engineer contributing to design and development of features for Simpro Group SaaS products. Collaborating with teams to create scalable and reliable solutions.
Lead Software Engineer at Simpro Group responsible for designing and developing scalable applications. Providing technical direction and mentorship while ensuring high - quality product delivery in a collaborative environment.
Technical Lead Engineer overseeing HVDC engineering projects at Hitachi Energy. Collaborating on design, contract management, and construction support while ensuring compliance with regulations.
Senior Embedded Software Developer at NewTec working on safety - critical systems in diverse industries. Leading project teams and developing embedded software solutions with a societal impact.
Software Engineer at Contour Education, creating impactful software solutions for students. Collaborating on full - stack features and enhancing the Learning Portal experience.
GPU Performance Engineer in Micron's Smart Manufacturing and AI team. Focusing on large - scale modeling, optimization, and deployment of AI solutions in their memory solutions.
Senior Project Engineer leading Smart Manufacturing projects at Cargill to enhance process efficiency. Focus on engineering new technologies and improving existing manufacturing systems.
Backend infrastructure developer working on software that powers kiosks for global checkout experience. Join a high - impact team at Mashgin to create innovative AI solutions.