Site Reliability Engineer ensuring reliable operation of payment platform at CellPoint Digital. Collaborates with teams to drive automation and reliability across global infrastructure.
Responsibilities
As an SRE at CellPoint Digital, you’ll be a key player in ensuring our payment platform runs reliably, securely, and at scale—processing thousands of payments per second
Working closely with our Product, Development, and Architecture teams, you’ll blend hands-on operational excellence with a software engineering mindset to drive automation, observability, and reliability across our global infrastructure
Requirements
Ensure the production environment runs smoothly, with a holistic view of system health
Build software and systems to manage infrastructure and applications
Drive improvements in reliability, quality, and delivery speed of our payment solutions
Measure and optimize system performance, always looking to innovate and get ahead of customer needs
Provide operational support and engineering expertise for large-scale, distributed systems
Collaborating with Product, Development, and Architecture to define and share SLAs, and improve system reliability
Partnering with our Release Manager to deploy and troubleshoot new versions of our platform and services
Participating in an on-call (Grafana IRM) rotation to respond to incidents impacting availability and supporting internal engineering teams
Preventing incidents through robust automation, monitoring, and proactive engineering
Running our modern stack: Google Cloud Platform, Kubernetes, Terraform, Github Actions, etc.
Designing, building, and maintaining core infrastructure that supports massive scale and high availability
Debugging production issues across services and infrastructure layers
Planning and executing infrastructure growth to meet future demand
Benefits
Competitive salary in a fast-growing start-up
Rewards & Recognition system
Opportunity for personal and professional growth in a dynamic industry
Work from anywhere in the world; we're a fully distributed company, and we provide the tools, culture, and support to make your work setup work for you
Occasional travel to Europe (UK, Copenhagen, Bulgaria)
Join Boeing AvionX as a Software DevOps Engineer driving automation and CI/CD pipelines for cloud - native systems. Lead initiatives improving deployment pipelines and mentor engineering team.
Senior SRE responsible for ensuring system reliability and performance at Aggrandize. Collaborating with cross - functional teams and implementing SRE best practices.
Lead Oracle ERP Enterprise Architect focusing on DevSecOps and cloud - native modernization for a defense - related company. Transitioning monolithic applications to microservices and maintaining CI/CD pipelines.
Lead Oracle ERP Enterprise Architect supporting DevSecOps implementation and modernization initiatives at Credence. Overseeing CI/CD pipelines in cloud environments for defense and health organizations.
Reliability Engineer responsible for RCM program and maintenance initiatives in mining industry. Enhancing equipment reliability and collaborating with various teams.
Lead SRE for Data & Analytics platforms at Deloitte. Championing reliability, improving stability, and driving automation in a hybrid environment based in London.
RDS Engineer supporting enterprise - grade RDS environments for Wells Fargo. Building and tuning Windows Server RDS environments and collaborating with security and networking teams.
Senior DevSecOps Engineer managing Azure to AWS migration for AccuSourceHR. Leading cloud architecture, CI/CD implementation, and ensuring security and reliability in production systems.
Site Reliability Engineer ensuring infrastructure reliability and performance for Hornetsecurity. Collaborating across product, business, and infrastructure teams in a critical environment.
Senior DevOps Engineer developing core infrastructure supporting Shelf products. Focused on building reliable, secure, and scalable systems in hybrid work environment.