Site Reliability Engineer driving innovation and growth for Banking Solutions, Payments, and Capital Markets business. Responsible for application reliability and incident response in a hybrid work environment.
Responsibilities
Design and maintain monitoring solutions for infrastructure, application performance, and user experience
Implement automation tools to streamline tasks, scale infrastructure, and ensure seamless deployments
Ensure application reliability, availability, and performance, minimizing downtime and optimizing response times
Lead incident response, including identification, triage, resolution, and post-incident analysis
Conduct capacity planning, performance tuning, and resource optimization
Collaborate with security teams to implement best practices and ensure compliance
Manage deployment pipelines and configuration management for consistent and reliable app deployments
Develop and test disaster recovery plans and backup strategies
Collaborate with development, QA, DevOps, and product teams to align on reliability goals and incident response processes
Participate in on-call rotations and provide 24/7 support for critical incidents
Requirements
Proficiency in development technologies, architectures, and platforms (web, API)
Experience with cloud platforms (AWS, Azure, Google Cloud) and IaC tools
Knowledge of monitoring tools (Prometheus, Grafana, DataDog) and logging frameworks (Splunk, ELK Stack)
Experience in incident management and post-mortem reviews
Strong troubleshooting skills for complex technical issues
Proficiency in scripting languages (Python, Bash) and automation tools (Terraform, Ansible)
Experience with CI/CD pipelines (Jenkins, Harness, GitLab CI/CD, Azure DevOps)
Ownership approach to engineering and product outcomes
Excellent interpersonal communication, negotiation, and influencing skills
Bachelor’s degree in Computer Science, Computer Engineering, or a related field, or equivalent experience
Benefits
Opportunities to innovate in fintech
Tools for personal and professional growth
Inclusive and diverse work environment
Resources to invest in your community
Competitive salary and benefits
Job title
Principal Site Reliability Engineer – Software Engineering
DevSecOps role at Tiime ensuring implementation of security practices in products. Collaborate with teams for cloud security and incident management in a hybrid workspace.
Senior Site Reliability Engineer responsible for designing reliable infrastructure supporting Fixify's SaaS platform. Collaborating with product engineering teams and maintaining operational standards for infrastructure performance.
DevOps Engineer working with critical infrastructure systems for Swedish internet services. Focused on building and managing robust systems and contributing to automation and operational improvements.
DevSecOps Consultant integrating security into IT development and operational processes. Advising clients on seamless integration of security requirements into DevOps workflows.
DevOps Engineer designing, developing and supporting programs at Swift, the leading provider of secure financial messaging services. Involves system analysis, program development and team collaboration.
Senior DevSecOps Engineer delivering complex software applications with a talented team in the defense sector. The role requires strong Kubernetes and cloud platform knowledge.
Senior Infrastructure/DevSecOps Engineer delivering complex software applications. Collaborating with a talented team to enhance national security efforts at CACI.
Staff Infrastructure/DevSecOps Engineer delivering complex software applications in collaboration with a talented team. Drive innovation and support national missions at CACI with a commitment to integrity.
Platform DevOps Engineer at Booz Allen Hamilton developing and managing container platforms for cloud capabilities. Collaborating to improve client environments using the latest cloud technologies.
DevOps Engineer enhancing reliability and performance of Ciena's Blue Planet applications in cloud environments. Implementing automation and upgrade strategies for seamless delivery of services.