Site Reliability Engineer at BAE Systems improving service availability and team collaboration. Engaging with the community to develop local tech and cyber skills while supporting core applications.
Responsibilities
Supporting and maintaining essential service that support core mission applications
Proactively enhancing their availability, performance and stability
Finding innovative solutions to problems rather than undertaking repetitive work, automating everything you can
Advising product teams of good practice in how to design and build systems
Instrumenting applications where they don’t have sufficient monitoring in place
Participating in the wider DevOps/SRE community within the organisation
Requirements
Experience in software development in Java and web technologies, e.g. JavaScript and HTML
Familiarisation with database technologies such as Elastic, Mongo
Knowledge of Linux and Windows command lines, e.g. Bash and PowerShell
Hands-on experience with cloud infrastructure such as AWS, Azure or OpenStack
Use of deployment tools such as chef and puppet
Expertise in monitoring large systems using technologies such as ELK
Experience of working in an Agile scrum team, and the tooling that supports it, e.g. Jira
Diagnosing and troubleshooting application issues resulting in service outages
Troubleshooting skills across different levels of the stack
Understanding of ITIL terminology
Experience with container management and micro-services architectures such as Docker
Familiarisation with automation test frameworks such as Selenium
Awareness and insight into technology trends to adopt new cutting edge tools
DevOps Engineer automating and optimizing software development lifecycle processes at COSMOTE Global Solutions. Designing and managing containerized infrastructure on Azure and implementing CI/CD.
Senior DevOps Engineer at Elliptic shaping DevOps culture and driving automation across engineering teams, providing expertise and leadership across the stack.
Senior Data Reliability Engineer ensuring software reliability and quality across enterprise applications. Collaborating with teams to implement robust on - call processes and maintain data fidelity.
Infrastructure & Cloud Operations Engineer managing AWS and hybrid environments for CV - Library. Hands - on role focused on reliability, automation, and operational excellence.
Site Reliability Engineer building reliable and scalable infrastructure for fintech startup Pave Bank. Collaborating with internal teams to enhance banking platform performance and reliability.
Lead DevOps Engineer managing DevOps projects for high - quality strategy games at Twin Harbour Interactive. Collaborating with teams to optimize production systems and improve development workflows.
Software Engineer contributing to the observability team's development of visibility systems. Implementing a high - performance telemetry platform and supporting AI tools for engineering teams.
Senior DevOps Platform Engineer at Humana designing secure cloud infrastructure for healthcare technology. Responsible for CI/CD pipelines and compliance in regulated environments.
Site Reliability Engineer working on the post - RPA Agentic Automation Platform for enterprises. Responsible for developing scalable systems and improving operational reliability.