Site Reliability Engineer ensuring reliability and scalability of fintech applications. Developing automation solutions to optimize performance and support critical systems in a hybrid role.
Responsibilities
Ensure the reliability and scalability of software applications.
Develop and manage automation scripts and tools to improve system performance and efficiency.
Monitor application performance and troubleshoot issues to ensure high availability and reliability.
Collaborate with development teams to ensure best practices for deployment and operations.
Conduct root cause analysis of incidents and implement corrective actions.
Participate in on-call rotations to provide 24/7 support for critical systems.
Identify and automate repetitive tasks to reduce manual toil and errors.
Contribute to and maintain design and process documentation.
Build and configure observability and Application Performance Management (APM) tools.
Understand, champion, and enforce security and compliance policies and procedures adhering to frameworks like PCI, NIST, CIS, etc.
Continually seek opportunities to improve SLA/Uptime and minimize customer impacts.
Requirements
Proven experience as an Application Support Engineer or similar role, with leadership experience.
Strong knowledge of automation tools and scripting languages.
Ability to code automation using a structured programming language like Python.
Proficiency in Linux.
Broad knowledge of the architecture of enterprise-level information technology building blocks (e.g., Networking, Databases, Messaging, RBAC, etc.).
Understanding of internet technologies and microservice-based architecture (e.g., Web servers, encryption, XML, HTTP, Web Services, APIs).
Excellent problem-solving skills and attention to detail.
Strong communication and collaboration skills.
Focus on scalability, high availability, performance, resiliency, and reliability of software applications.
Bachelor's degree in Computer Science, Engineering, or a related field or relevant experience.
DevSecOps Engineer responsible for enhancing Thales' secure hosting platforms in public and private clouds. Collaborating with teams to apply modern practices and build resilient infrastructures.
Develops high - automation services in Golang or Java within AWS, Kubernetes, and Azure. Supports teams in building secure applications while working in a hybrid environment.
DevOps Engineer specializing in AWS Cloud Infrastructure in a hybrid position. Collaborating within a supportive team to build modern infrastructure for VM - based applications.
Leading DevOps platform strategy for KIPMI Software's next - generation digital trust products. Collaborating with teams to implement scalable infrastructure and DevSecOps practices.
Join our DevOps team to build and manage GitHub pipelines and cloud - native Azure solutions. Collaborate with teams to drive DevOps best practices and optimize deployments.
Site Reliability Engineer enhancing system reliability and deployment practices at OpenLoop. Collaborating with cross - functional teams for incident management and performance tuning.
Senior DevOps Engineer enhancing Azure application reliability for a healthcare fintech platform. Collaborating closely with engineering teams to ensure deploy safety and observability.
DevOps Engineer contributing to tooling changes and leading a community of practice at Totara. Focused on collaboration, development, and support for internal teams.
Site Reliability Engineer responsible for infrastructure supporting AI platform. Safeguarding US customer data and ensuring compliance in the Aerospace and Defense sector.