Senior Site Reliability Engineer at Unify addressing reliability challenges and scaling data infrastructure. Collaborating on backend services and ensuring stable platform performance for enterprise customers.
Responsibilities
Scale our data infrastructure: Optimize and extend our ClickHouse and PostgreSQL deployments—designing partitioning strategies, tuning queries, and improving replication and failover systems.
Improve system performance: Profile and optimize critical paths across backend services, identify bottlenecks in data pipelines and API layers, and ship changes that improve latency and throughput.
Build for reliability: Implement rate limiting, circuit breakers, graceful degradation, and other patterns that keep the platform stable under load and during partial failures.
Automate everything: Write tooling that eliminates toil—automating deployments, scaling operations, backup verification, and incident remediation.
Instrument and observe: Build out distributed tracing, metrics, and alerting that give engineers clear visibility into system behavior and accelerate debugging.
Respond and learn: Participate in on-call rotations, run incident response, and drive blameless postmortems that prevent recurrence.
Requirements
5+ years of software engineering experience with a strong backend foundation, including 2+ years focused on reliability, infrastructure, or platform work.
Hands-on experience operating databases at scale including query optimization, replication, and failover.
Strong programming skills (Typescript, Python, Go, or similar) with experience building automation and tooling.
Able to diagnose complex distributed systems issues under pressure and communicate clearly during incidents.
Collaborative, low-ego attitude and desire to work in a fast-paced environment.
Lead DevOps Engineer modernizing infrastructure and automation for Wells Fargo’s Consumer Technology platforms. Collaborating across teams to build scalable solutions and elevate engineering excellence.
Senior DevOps Engineer re - envisioning enterprise level applications at Ryan. Designing and maintaining cloud infrastructure for optimal service delivery.
Reliability Engineer focusing on risk minimization and maintenance strategies in an innovative PEM electrolyzer company. Collaborating cross - functionally to enhance equipment and systems performance.
Principal Site Reliability Engineer at Red Hat managing the RHIVOS product SRE initiative. Focusing on infrastructure reliability and continuous improvement with deep technical expertise in engineering.
DevOps Azure Developer specializing in end - to - end application development at global healthcare leader Abbott. Engaging in CI/CD processes and building secure cloud applications using Azure and Python.
DevSecOps Engineer at Livingston ensuring security in CI/CD pipelines and building resilient systems. Collaborating with teams to integrate best practices in software development.
Reliability Engineer at LANXESS improving the reliability of fixed and rotating equipment. Partnering with Engineering and Operations to ensure asset safety and performance.
Cloud Engineer at Agility Technologies leading the design of scalable eLearning infrastructure. Collaborating on technical design and implementation involving cloud - based platforms and secure integrations.
Senior Hardware Reliability Engineer overseeing reliability testing and analysis of outdoor electronic assemblies at Gridware. Collaborating with mechanical engineers and contributing to product lifetimes modeling.