About the role

Senior Site Reliability Engineer at ABBYY, working on critical production service designs and reliability improvements on Azure cloud applications

Responsibilities

Сo-own critical production service designs to ensure high reliability is achievable and measurable Drive reliability and observability improvements in the services within the engineering verticals
Using monitoring and telemetry data, help teams make informed decisions on where reliability challenges may exist and help design and build solutions to improve them
You will build SRE dashboards from SLIs to measure SLO adherence
You will be supporting Production applications which are hosted in Azure cloud
Build and improve internal tools and automation software to make maintaining production services easier and safer
Lead reliability-focused practices such as Failure Analysis, Load and Capacity Planning, Service Reviews, Architecture Designs, Incident Postmortems, and others
Developing Infrastructure as a Code.
Define (from design to implementation details) necessary auto-healing and fault-tolerant systems
Point of contact for production application issues, working closely with engineering leadership

Requirements

7-10 Years IT Experience
Proven experience at least one cloud technology - Azure or AWS.Preferibily Azure
Proficient in Kubernetes, AKS, Azure Function, Storage account, and others
Proven experience in Microsoft Technologies, Windows server, IIS(Preferred)
Distributed monitoring experience in Grafana: logging, metrics, tracing, etc.
Matching years of experience to level in an Infrastructure, SRE, DevOps, CloudOps role
Experience working in SRE team in a dynamic and fast paced environment
Experience programming in one or more of the following: C#, Java, Python, .Net, NodeJS, Go,
Experience with Terraform, Ansible, or any similar programming language
Experience with cloud-performant microservices and event-driven architectures
Experience with Kubernetes administration is an added advantage.
Experience with database performance monitoring tools (e.g Percona Toolkit, SQL Profiler).
Proven experience in diagnosing and resolving indexing issues in relational databases (e.g., PostgreSQL, MySQL, SQL Server, Oracle).
Strong understanding of query optimization, execution plans, and database internals.
Understanding of information security concepts and terminology
Strong knowledge of software development methodologies and passion for creating high-standard tool sets for infrastructure-as-code
Ability to analyze problems quickly and find suitable solutions based on available resources
A proactive and open-minded individual with a clean-cut client focus and structured approach
Experience in leading and managing a small team
Comfortable working US hours aligned to the PST time zone.

Benefits

We provide remote and hybrid working options to fit all lifestyles.
We use flexible hours across most of our teams to allow you to find your own definition of balance.
Encouraging a culture of giving, we provide two paid volunteering days off every year so you can take time to contribute to the causes you care about.
To ensure your family is cared for, we offer paid parental leave in all our locations.

Hybrid Senior Site Reliability Engineer – US Working Hour

at ABBYY

About the role

Responsibilities

Requirements

Benefits

Job title

Job type

Experience level

Salary

Degree requirement

Tech skills

Location requirements

Report this job

Similar roles

DevOps Specialist

Evlo

Software Quality and Release Engineer

Turion Space

Site Reliability Engineer, DevOps

Exacaster

Senior DevOps Engineer

Exacaster

Design and Release Engineer – Mirror Systems

Ford Motor Company

Site Reliability Engineer

VALCE Talent Solutions

Senior DevOps Engineer

Stillfront Group

Mainframe DevOps Engineer – SCM Migration SME

Kyndryl

Observability & DevOps Tools Engineering Manager

RELX

DevOps/MLOps Engineer

Niyam IT