Develop and lead enterprise observability and reliability capabilities for Parts Town's systems using Dynatrace. Collaborate across teams to ensure comprehensive monitoring and improve performance and incident outcomes.
Responsibilities
Own enterprise observability using Dynatrace across cloud, on-prem, ERP, WMS, eCommerce, APIs, and integrations
Design service topology, dashboards, alerts, and health indicators that reflect business impact
Apply SRE principles (SLIs, SLOs, error budgets where appropriate) to reduce incidents and improve resilience
Accelerate incident detection and root-cause analysis; lead post-incident reviews focused on systemic fixes
Identify reliability, performance, and capacity risks before they impact the business
Define observability and SRE standards and enable teams to use them effectively
Requirements
7+ years in infrastructure, platform, operations, or reliability engineering
Hands-on experience implementing and operating Dynatrace
Strong understanding of distributed systems, cloud/hybrid environments, and integrations
Practical experience with SRE or reliability engineering concepts
Comfortable operating in high-impact incident and production environments
Benefits
Quarterly profit-sharing bonus
Hybrid Work schedule
Team member appreciation events and recognition programs
Volunteer opportunities
Monthly IT stipend
Casual dress code
On-demand pay options: Access your pay as you earn it, to cover unexpected or even everyday expenses
Cloud DevOps Specialist responsible for building scalable infrastructure solutions in AWS at SONDA. Focusing on automation, containerization, and data management in a collaborative environment.
DevOps Engineer maintaining and evolving deployment pipelines for Docebo’s AI - powered learning platform. Collaborating with cross - functional teams to ensure efficient software releases and infrastructure management.
DevOps Engineer optimizing CI/CD pipelines for Docebo, an AI - powered learning platform. Involves managing multi - tenant infrastructure using AWS, Docker, and Kubernetes.
DevOps Engineer maintaining and automating infrastructure and CI/CD processes for cybersecurity solutions by NordLayer. Collaborating with teams to ensure performance and scalability of cloud services.
DevOps Engineer maintaining and improving infrastructure and CI/CD processes for cybersecurity solutions provider. Collaborating with cross - functional teams for reliable and scalable cloud solutions.
DevOps Engineer maintaining and automating infrastructure and CI/CD processes at NordLayer. Collaborating with Senior Engineers to implement best practices in a dynamic cybersecurity environment.
Secure DevOps Engineer responsible for integrating security into CI/CD pipelines and strengthening AWS infrastructure. Key expertise in AWS security and container management.
DevOps Engineer responsible for CI/CD pipeline development and automation for urban software solutions. Collaborating with teams to enhance efficiency in software deployment and infrastructure.
DevOps Engineer managing cloud and on - premise platforms for a public sector infrastructure project. Collaboration primarily remote, with occasional on - site meetings.
DevSecOps Engineer architecting CI/CD framework services for Truist, enhancing the flow of business value through DevSecOps practices. Building and maintaining automation for software delivery and operations.