Site Reliability Engineer maintaining systems and infrastructure to ensure reliability and performance. Collaborating with developers and automating operational tasks for a robust cloud environment.
Responsibilities
Design and maintain reliable systems and infrastructure
Monitor system reliability and performance
Collaborate with development teams to ensure system robustness
Automate operational tasks and processes
Troubleshoot and resolve issues in production environments
Implement best practices for system availability, security, and performance
Mentor junior SRE team members
Requirements
5+ years of experience in site reliability engineering
Strong background in Linux/Unix systems
Proficient in scripting languages (Python, Bash, etc.)
Experience with cloud providers (AWS, Azure, Google Cloud)
Knowledge of CI/CD tools and processes
Understanding of application architecture and microservices
Excellent troubleshooting skills
Good communication skills and ability to work in a team environment.
Background in networking stack and protocols
Should be available for on-call rotations as needed
Benefits
Medical provided through Cigna (PPO, HSA, EPO options)
Medical provided through Kaiser (HMO option only) for California employees only
Dental provided through Cigna (DPPO & DHMO options)
Nationwide Vision provided through VSP
Flexible Spending Account for Health & Dependent Care
Pre-Tax Account for Commuter Benefit/Parking & Transit (location-specific)
Continuing Education and Professional Development via various integrated platforms, e.g. Udemy and Coursera
Corporate Wellness Program
Employee Assistance Program
Wellness Days
401k Plan
Basic Life, Accidental Life, Supplemental Life Insurance
Short Term & Long Term Disability
Critical Illness, Critical Hospital, and Voluntary Accident Insurance
Tuition Reimbursement (available 6 months after start date, capped)
Paid Time Off (accrued and prorated, maximum of 120 hours annually)
Paid Holidays
Any other statutory leaves, paid time, or other fringe benefits required under state and federal law
DevOps Engineer developing monitoring solutions, enhancing AIOps and Observability Platform at global healthcare company, Organon. Collaborating across teams to ensure compliance with industry standards and optimizing data processing.
Senior DevOps Engineer responsible for full stack development of energy trading products at Deutsche Börse. Implementing automated solutions and collaborating across product teams in a hybrid setting.
Cloud DevOps Engineer at RELX supporting CI/CD platform and development teams. Focusing on resilient software delivery and troubleshooting across various environments.
Join Boeing AvionX as a Software DevOps Engineer driving automation and CI/CD pipelines for cloud - native systems. Lead initiatives improving deployment pipelines and mentor engineering team.
Senior SRE responsible for ensuring system reliability and performance at Aggrandize. Collaborating with cross - functional teams and implementing SRE best practices.
Lead Oracle ERP Enterprise Architect focusing on DevSecOps and cloud - native modernization for a defense - related company. Transitioning monolithic applications to microservices and maintaining CI/CD pipelines.
Lead Oracle ERP Enterprise Architect supporting DevSecOps implementation and modernization initiatives at Credence. Overseeing CI/CD pipelines in cloud environments for defense and health organizations.
Reliability Engineer responsible for RCM program and maintenance initiatives in mining industry. Enhancing equipment reliability and collaborating with various teams.
Lead SRE for Data & Analytics platforms at Deloitte. Championing reliability, improving stability, and driving automation in a hybrid environment based in London.
RDS Engineer supporting enterprise - grade RDS environments for Wells Fargo. Building and tuning Windows Server RDS environments and collaborating with security and networking teams.