Reliability and Performance Engineer at Broadvoice ensuring infrastructure stability and efficiency. Driving initiatives for system reliability and supporting SLO/SLI framework in a collaborative environment.
Responsibilities
Design and implement comprehensive reliability strategies to ensure that infrastructure and applications remain resilient, scalable, and deliver consistently high performance
Own the optimization of system performance by proactively identifying bottlenecks and anticipating challenges to minimize customer impact and maintain seamless service
Work closely with Infrastructure and Product teams to define and align SLOs and SLIs that reflect both technical goals and exceptional user experience in a real-time communications environment
Lead root cause analyses and post-incident reviews to drive continuous improvements in system reliability, availability, and operational maturity
Develop automation and tooling to empower teams in monitoring, testing, and enhancing reliability at scale across platforms
Manage capacity planning and forecasting efforts to ensure the communication platforms scale efficiently alongside business growth and increasing customer demand
Participate actively in on-call rotations, contributing to a culture of operational excellence, rapid incident response, and minimizing downtime in a mission-critical environment
Requirements
3+ years of experience in SRE, performance engineering, or infrastructure roles
Strong understanding of distributed systems, cloud infrastructure (AWS preferred), and networking
Proficiency in observability tools (e.g., Prometheus, Grafana, Datadog)
Experience with performance profiling, load testing, and benchmarking
Solid scripting or programming skills (e.g., Python, Go, Bash)
Familiarity with SIP/VoIP systems and real-time communications
Experience working in a SaaS or multi-region infrastructure environment
Knowledge of incident management and reliability frameworks (e.g., Google SRE model)
Systems & Safety Assurance Engineer enhancing safety and performance in Queensland Rail’s Major Projects team. Providing expert analysis and assurance for the Logan and Gold Coast Fast rail project.
Junior Technical Engineer at Trade Nation providing technical support and troubleshooting for staff issues. Involving hands - on coordination and administration of IT infrastructure and services.
Senior Reservoir Engineer at Deep Sky specializing in CO2 storage solutions across Canada. Leading dynamic reservoir modeling and regulatory applications in a hybrid work setting.
Manufacturing Engineer producing engineering outputs for aerospace projects. Collaborating with teams to ensure quality and efficiency within the production processes.
Junior Engineer Approvals at GROHE managing product certifications for water systems. Engaging with internal and external partners to ensure compliance with standards and norms.
Forward Deployed Engineer embedded with enterprise clients at WRITER, optimizing AI deployment while serving as a technical liaison. Requires in - depth AI expertise and software development skills.
MES Engineer developing and delivering solutions based on Rockwell Automation MES system. Designing modules and troubleshooting according to customer requirements.
Release Train Engineer facilitating Agile Release Train processes for a global technology leader in automation. Leading execution, transparency, and continuous improvement across multiple Agile teams.
Staff PDK Engineer assisting in the design of high - performance mixed - signal ASICs at Cirrus Logic. Collaborating on PDK development and EDA methodology.
System - level V&V Engineer ensuring compliance and traceability for medical device engineering. Collaborate and execute plans and reports while maintaining documentation under ISO standards.