Senior DevOps Engineer enhancing Azure application reliability for a healthcare fintech platform. Collaborating closely with engineering teams to ensure deploy safety and observability.
Responsibilities
Own and evolve the reliability, observability, and performance of our Azure-based application ecosystem.
Design and implement end-to-end observability for Azure App Services using Application Insights, Azure Monitor, and distributed tracing.
Build and maintain actionable dashboards that surface application health, performance bottlenecks, and reliability risks.
Analyze logs, metrics, and traces to proactively identify issues and reduce MTTD / MTTR.
Serve as a subject-matter expert on Azure App Service architecture, scaling, and performance tuning.
Debug production issues at the application level, including C#, .NET, Angular, and SQL-related problems.
Automate monitoring, alerting, and common remediation workflows using PowerShell and Azure tooling.
Participate in on-call rotations and lead incident response for application reliability issues.
Document best practices for monitoring, debugging, and deployment safety.
Requirements
Bachelor’s degree in Computer Science, IT, or equivalent experience.
Proven experience in application-focused DevOps or SRE roles, with deep hands-on Azure work.
Strong experience with Azure App Services, Application Insights, and Azure Monitor.
Ability to read, understand, and debug .NET/C# and Angular code.
Experience with incident response, on-call operations, and RCA writing.
Comfort operating in fast-moving environments with increasing deployment velocity.
Bonus Experience: Grafana, Prometheus, Datadog, or Dynatrace; Azure Front Door, CDN, Function Apps, WebJobs; Service Bus, Event Hub, or distributed messaging; Additional Azure certifications.
Benefits
Medical, dental, vision, generous PTO
Competitive compensation: Salary, bonus eligibility, and 401(k) matching
Lead DevOps Engineer at Incogni evolving infrastructure during monolith - to - microservices transitions. Building self - service platforms and ensuring observability in a fast - growing consumer privacy - tech product.
Senior Site Reliability Engineer maintaining reliability and user experience of AI services for Woven by Toyota. Collaborating with engineering teams to ensure service availability and performance.
DevOps Specialist supporting the engineering and operational enablement of next - gen data center platforms at KONE. Involves Infrastructure - as - Code deployments and daily DevOps workflows.
GitHub Enterprise Specialist managing KONE's GitHub ecosystem, ensuring secure and scalable workflows. Collaborating with teams to enhance developer productivity through AI - powered capabilities.
Senior Software Engineer responsible for designing microservices and enhancing LLM performance for Fortanix's Generative AI platform. Collaborating with data science and ML Infrastructure teams for security and optimization.
Reliability Engineering Technician conducting various verification tests and collaborating with reliability engineers. Preparing technical documentation in a well - equipped laboratory environment in Poland.
Reliability Engineer ensuring quality and reliability of products. Conducting various verification tests in a well - equipped laboratory in Mierzyn, Poland.
Senior SRE driving incident management and operational excellence in financial software solutions. Working with innovation and technology in Brazil's leading software company's team.
Salesforce DevOps Engineer focused on CI/CD pipeline management for Salesforce at S&P Global Mobility. Collaborating with cross - functional teams to ensure stable and secure releases.
Senior DevOps Engineer designing and building infrastructure for AI workloads across cloud and edge environments. Collaborating with engineering teams to implement scalable, automated solutions.