Site Reliability Engineer automating infrastructure and operations at DTEX Systems. Seeking candidates with strong software engineering background and experience in cloud environments.
Responsibilities
Design, write, and maintain software, primarily in Python, to automate the provisioning, deployment, and configuration management of our infrastructure
Contribute to the adoption and maturation of Terraform, establishing and maintaining best practices for state management, modularization, and version control
Utilize Ansible and/or Saltstack to ensure consistency, repeatability, and standardization across all environments
Develop robust CI/CD pipelines for both infrastructure and application deployments, replacing manual processes
Implement and mature monitoring, logging, and alerting systems to proactively improve system reliability
Participate in a “follow the sun” on-call rotation, focusing on sustainable incident response, blameless postmortems, and driving continuous improvement
Champion SRE principles, automation, and coding best practices within the team and across the organization
Requirements
3+ years of hands-on experience managing production environments in AWS and/or GCP.
Strong proficiency in Python.
Demonstrated ability to write clean, maintainable, and testable code to solve infrastructure problems.
Experience with Terraform, including best practices for state management and modular design in complex environments.
Strong knowledge of Linux internals and high competency in Bash scripting and command-line operations.
Proficiency with Ansible and/or Saltstack as configuration management tools.
Expert level understanding of Git and collaborative workflows, such as branching strategies and code review best practices.
MS/BS in Computer Science/Computer Engineering or related field of study (or equivalent experience).
IT Infrastructure Engineer managing on - prem and cloud infrastructure in aviation data solutions. Collaborating in a well - coordinated team for flexible project work and customer impact.
Infrastructure Architect designing and implementing scalable solutions at Regions. Collaborating with teams on enterprise - wide architecture and infrastructure improvements.
Cloud Infrastructure Engineer at EVENTIM designing AWS infrastructure and implementing DevOps practices. Collaborating with teams on scalability, security, and automation initiatives.
Infrastructure Engineer at BAE Systems Digital Intelligence designing and maintaining enterprise - grade infrastructure platforms. Role involves Linux, Windows, cloud environments, and security responsibilities.
Sr. AWS and Infrastructure Engineer defining and owning AWS infrastructure architecture for scalable production environments. Leading security architecture and compliance implementation with a focus on cost optimization and CI/CD.
Senior Infrastructure Architect at Cambio, leading IT solutions in healthcare transformation. Driving architecture and infrastructure initiatives for e - health solutions in Sweden.
Staff ML Infrastructure Engineer building and scaling robust Compute platforms for Simulation and data workflows at GM. Collaborating with engineers to drive efficiency and reliability in AI infrastructure.
IT Infrastructure Engineer managing network and digital infrastructure for Physicians Insurance, a boutique mutual insurance company. Collaborating on design, deployment, and maintenance operations.
Modern Workplace Exchange Infrastructure Architect at Avanade driving end - to - end cloud solutions with Microsoft 365. Collaborating with a large team on enterprise projects for digital transformation.
Infrastructure Specialist supporting enterprise voice platforms including Avaya and RingCentral. Balancing transformation with service stability while working in a hybrid environment.