Reliability Engineer focusing on supporting production applications and improving system reliability at U.S. Bank. Collaborating with teams to automate processes and reduce incidents.
Responsibilities
Supporting production applications and proactively looking for ways to automate discoveries
Eliminating incidents from recurring and/or reducing the time it takes to get customers back up and running
Improving availability, latency, performance, efficiency, and effective proactive monitoring
Interfacing with business users, development teams and system administrators
Developing, coordinating, and conducting technical reliability studies on engineering designs
Measuring and analyzing the reliability of design, materials, processes, cost, and final products
Recommending design or test methods and statistical process control procedures
Completing risk analysis studies of new designs and processes
Undertaking testing and analysis on failures, proposing changes in design or formulation to improve system and/or process reliability
Requirements
Bachelor's degree, or equivalent work experience
Five to seven years of relevant work experience in business and risk analysis, IT Service Management, production support, product/project management, or application development
Proven experience as a Site Reliability Engineer or similar role.
Strong knowledge of monitoring tools and incident management.
Proficiency in Python or Powershell
Excellent problem-solving and troubleshooting skills.
Strong experience with AWS or Azure services
Experience with Docker and container clustering technologies like AWS ECS or Kubernetes
Experience with monitoring and logging tools such as Data Dog, Splunk, Elasticsearch, Kibana and CloudWatch
Experience using GitLab/GitHub for version control and/or you’ve tracked work
Strong communication and collaboration abilities.
Financial Services industry experience a plus.
Benefits
Healthcare (medical, dental, vision)
Basic term and optional term life insurance
Short-term and long-term disability
Pregnancy disability and parental leave
401(k) and employer-funded retirement plan
Paid vacation (from two to five weeks depending on salary grade and tenure)
Up to 11 paid holiday opportunities
Adoption assistance
Sick and Safe Leave accruals of one hour for every 30 worked, up to 80 hours per calendar year unless otherwise provided by law
DevSecOps Engineer responsible for embedding security controls in CI/CD at Keyloop. Collaborate with engineering teams to integrate security in build and deployment workflows.
DevOps Engineer modernizing infrastructure for a fintech company focused on empowering e - commerce businesses. Engaging in hands - on work with GCP and Kubernetes to establish reliable, efficient deployment pipelines.
DevSecOps Engineer supporting AI - enabled financial compliance initiative for the Department of War. Responsible for designing secure infrastructure and collaborating with cross - disciplinary teams.
(Senior) DevOps Engineer with a focus on CI/CD and cloud infrastructure management for e - commerce solutions. Collaborating across teams to ensure automated, scalable deployments.
Senior DevOps Engineer managing monitoring systems for B2B e - commerce platforms in Azure Cloud. Collaborating with teams to improve platform products and processes.
DevSecOps Expert managing deployments, monitoring systems, and providing technical support in Brussels. Role involves close collaboration with Development and IT teams at a major client's site.
Junior and DevOps Engineers designing and running secure cloud - native platforms for UK public - sector organisations. Collaborating with teams to streamline deployment and automate infrastructure workflows.
DevOps Engineer automating cloud - native infrastructure for public - sector organizations. Join an agile team to enhance deployment processes and support critical systems.
DevOps Engineer designing and constructing secure cloud - native platforms for public - sector organizations across the UK. Leading technical decisions while collaborating closely with clients.
DevOps Engineer at Gemba designing secure, cloud - native platforms for public - sector organizations. Leading technical decisions and collaborating to solve complex challenges for critical systems.