Graduate Site Reliability Engineer at SiXworks developing skills in automation and cloud technologies while working in a collaborative team environment. Focus on supporting scalable systems and services through best practices in DevOps.
Responsibilities
Supporting the delivery, upgrade, and maintenance of core services and project platforms.
Helping develop and maintain monitoring and management tools.
Assisting in ensuring high‑quality monitoring, alerting, and observability across systems.
Working with engineers, developers, operations, and QA to improve reliability and performance.
Contributing to the design and documentation of new deployments and services.
Helping build proof‑of‑concepts (PoCs) for new technologies and bringing them towards production readiness.
Learning how to conduct basic security checks, vulnerability assessments, and system hardening.
Supporting automation efforts using tools such as Ansible, Terraform, or CI/CD pipelines.
Troubleshooting issues across Linux, Windows, containers, and cloud environments.
Helping ensure documentation is clear, accurate, and up to date.
Learning and applying DevOps and SRE best practices.
Collaborating with and learning from senior team members.
Requirements
Basic understanding of Linux or Windows operating systems.
Some experience with scripting (Bash, Python, PowerShell, etc.) — even from personal projects or coursework.
Interest in automation and DevOps tools (Ansible, Terraform, CI/CD, Git).
Exposure to cloud platforms such as Azure, AWS, or similar (hands‑on or theoretical).
Strong problem‑solving mindset and willingness to experiment.
Eagerness to learn new technologies and work in a fast‑paced environment.
Good communication skills and ability to work well in a team.
Junior DevOps Engineer responsible for designing and deploying scalable infrastructure in cloud environments. Collaborating on operational enhancements and security monitoring within a high - velocity environment.
DevOps Engineer at EOS imaging enhancing cloud solutions and automating processes for healthcare applications. Collaborating on international projects to ensure data compliance and efficiency.
Primary post - sales technical owner ensuring reliability of ML workloads for strategic customers at AI company. Collaborating with teams to drive technical success and product improvements.
Site Reliability Engineer ensuring scalable infrastructure in AI product deployment for top AI companies. Involves building automated processes and collaborating across teams.
Engineer supporting enterprise - scale Microsoft 365 environment at NIH. Implementing automated testing frameworks and secure development practices in Federal Government program.
Senior Cloud Engineer developing cloud - native applications and optimizing CI/CD pipelines at GRAYOAK. Collaborating in interdisciplinary teams on innovative cloud projects with a focus on data and AI.
Senior Manager Site Reliability Engineering at WEX ensuring system scalability and resilience while leading engineering best practices. Collaborating with cross - functional teams to enhance reliability across platforms.
SRE DevOps Engineer developing scalable solutions for Consumer Products and Retail Services at Capgemini. Focusing on Kubernetes, Terraform, and CI/CD automation with a flexible work culture.
DevOps Analyst at SONDA managing integrations of technological solutions in Brasília. Focused on infrastructure management and continuous improvement of processes.