Software Engineer developing backend services for AION's AI cloud platform. Collaborating with teams to implement high-performance, scalable distributed systems in a hybrid work environment.
Responsibilities
Build and maintain platform services across AION's Compute and Inference platforms, working closely with senior engineers and platform leads
Implement features for multi-cloud orchestration, resource scheduling, model deployment pipelines, and autoscaling systems
Write well-maintained, production-grade code with proper abstractions, design patterns, and comprehensive test coverage
Contribute to low-level design (LLD) including service APIs, database schema design, data models, and component interactions
Collaborate with senior engineers on high-level design discussions, providing implementation perspectives and feasibility inputs
Develop RESTful APIs and gRPC services for platform control planes, resource management, and inference serving
Design and implement database schemas for storing platform state, resource metadata, billing data, and observability metrics
Work with distributed storage systems, message queues (Kafka, RabbitMQ), and databases (PostgreSQL, Redis) to build reliable platform components
Build event-driven architectures for asynchronous processing, job scheduling, and platform automation
Implement monitoring, logging, and alerting for platform services to ensure production reliability
Write comprehensive unit tests, integration tests, and end-to-end tests to ensure code reliability
Participate in code reviews, providing constructive feedback and learning from senior engineers' perspectives
Refactor existing code to improve maintainability, performance, and scalability
Document design decisions, API specifications, and operational runbooks for platform services
Debug production issues and contribute to incident response and post-mortems
Requirements
2-4 years of experience in backend engineering, platform development, or distributed systems
Strong proficiency in Golang—you write idiomatic Go code with proper error handling, concurrency patterns, and testing
Solid understanding of backend systems fundamentals: RESTful APIs, microservices architecture, and API design principles
Hands-on experience with databases (PostgreSQL, MySQL) including schema design, query optimization, and transactions
Familiarity with storage systems (object storage like S3, block storage, distributed file systems) and their use cases
Experience working with message queues (Kafka, RabbitMQ, NATS) and event-driven architectures
Understanding of distributed systems concepts: consensus, eventual consistency, fault tolerance, and retry mechanisms
Experience with containerization (Docker) and basic Kubernetes concepts
Knowledge of testing frameworks and practices (unit tests, integration tests, mocking)
Familiarity with Git, CI/CD pipelines, and modern development workflows
Exposure to cloud platforms (AWS/GCP/Azure) and their core services is a plus
Experience with infrastructure-as-code (Terraform) or observability tools (Prometheus, Grafana) is beneficial
Benefits
**Preferred Attributes:**
Founder-level ownership and bias for action.
Strong strategic thinking and ability to connect technical decisions to business impact.
Excellent communication and mentoring skills.
Thrives in ambiguity, fast-paced environments, and early-stage startup culture.
**Why Join AION?**
Work directly with high-pedigree founders shaping technical and product strategy.
Build infrastructure powering the future of AI compute globally.
Significant ownership and impact with equity reflective of your contributions.
Competitive compensation, flexible work options, and wellness benefits
Software Engineer II developing secure and scalable logistics solutions at Teleport. Collaborating in a hybrid environment to drive innovative tech solutions.
Senior Software Engineer leading development of AI - native systems and tools for financial decision making. Collaborating in a hybrid environment to deliver impactful software solutions.
Senior Full Stack Software Engineer at Fiserv enhancing the Clover platform and mentoring teams. Engage in building reliable applications using Java, Typescript, and modern frameworks.
Lead Engineer providing technical expertise for CSF - related issues and customer deployments. Role involves troubleshooting complex issues and collaborating with development teams.
Senior Software Developer creating software solutions for healthcare company Solventum. Leading design sessions and driving innovations in health, material, and data science.
Full Stack Developer developing critical systems supporting Post trade management at SEB. Collaborating within an agile team to innovate for a sustainable financial future.
Senior Fullstack developer at SEB developing AI product in cloud data portfolio. Collaborating with teams to deliver exceptional results in Investment Advice area.
Junior Software Engineer at Hedvig developing innovative insurance products and collaborating closely with cross - functional teams while working with modern technologies.
Engineering Lead leading a squad of engineers to develop the Digital Twin Platform at Woven by Toyota. Engaging with customers and ensuring quality deliverables in a hybrid environment.
Junior Fullstack Developer at Etinet responsible for frontend and backend development. Collaborating with teams to build scalable and high - performance digital solutions.