Observability Engineer at Ryan Specialty driving strategy, implementation, and optimization of observability platforms. Ensuring system reliability and performance through advanced monitoring and analytics.
Responsibilities
Lead the design, deployment, and management of observability tools including Datadog, Dynatrace, Azure Monitor, and Splunk
Develop and maintain dashboards, alerts, and reports to provide actionable insights into system health and performance
Collaborate with infrastructure, application, and DevOps teams to define observability standards and best practices
Drive integration of observability platforms with CI/CD pipelines and cloud-native environments
Analyze telemetry data to identify trends, anomalies, and opportunities for optimization
Mentor junior engineers and promote a culture of observability across the organization
Evaluate emerging technologies and recommend enhancements to the observability stack
Ensure compliance with security and governance policies in monitoring implementations
Requirements
Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience)
5+ years of experience in infrastructure monitoring, observability, or site reliability engineering
Hands-on expertise with Datadog, Dynatrace, Azure Monitor, and Splunk
Strong understanding of cloud platforms (especially Microsoft Azure) and hybrid environments
Proficiency in scripting languages (e.g., Python, PowerShell) for automation and data manipulation
Experience with log aggregation, distributed tracing, and metrics collection
Excellent problem-solving skills and ability to work in a fast-paced, collaborative environment
Applicants must be authorized to work for any employer in the U.S. and we are unable to sponsor or take over sponsorship of an employment visa at this time
Benefits
Paid time off for company holidays, vacation, sick and personal days
R&D Engineer developing failure analysis systems for innovative battery technology. Collaborating with a skilled team in the revitalizing lithium supply chain.
Safety Engineer for Project GOSHAWK managing safety assurance and governance across the project. Leading safety management activities and developing trial plans within a hybrid working model.
Intermediate Validation Engineer at DATAmundi leading SSD product validation and optimization. Collaborating with engineers to ensure high - quality, next - generation memory products meet strict performance standards.
Forward Deployed Engineer developing AI - powered software at Lovable in Stockholm. Building new engineering functions and partnering with customers to create groundbreaking solutions.
Process Manufacturing Engineer developing and implementing PCB manufacturing systems at TTM Technologies. Focused on improving manufacturability and collaborating with customers and suppliers.
Process Engineering Manager at TTM Technologies leading technical direction and engineering oversight. Responsible for design robustness and manufacturability to meet project milestones and quality standards.
Senior APQP Engineer overseeing advanced product quality planning processes for mechatronic products at Interroll. Leading cross - functional teams to ensure high product and process quality from concept to production.
Senior Systems Mission Engineer supporting National Security Space systems at Aerospace. Provide engineering expertise across multiple acquisition programs for mission integration and interoperability.
Fire Pump Engineer focusing on diesel - driven pump systems across the Midlands, ensuring compliance and reliability. Join Johnson Controls, working within a supportive collaborative environment.
Designing and developing secure, policy - driven enterprise browser features for a global fintech leader. Collaborating with security architects and platform engineers to enhance system integrations and performance.