Senior Observability Engineer managing observability and performance ecosystem for Trading 212. Involved in automation, monitoring, and optimizing large-scale distributed systems.
Responsibilities
Own and evolve Trading 212’s observability and performance ecosystem across cloud and on-prem Kubernetes environments.
Design, automate, and optimize observability infrastructure (Prometheus, CloudWatch, Elasticsearch, Kafka, etc.) using IaC and GitOps.
Build Grafana dashboards and implement a smart alerting strategy to surface actionable insights.
Monitor and analyze system performance, identify bottlenecks, and drive improvements in reliability and cost-efficiency.
Collaborate with product, QA, and engineering teams to embed observability best practices.
Maintain clear documentation and mentor engineers, fostering a culture of data-driven performance.
Plan and test Multi-AZ/Region DR and resilience scenarios.
Requirements
5+ years of experience in DevOps, SRE, or Systems Engineering, focusing on observability for large-scale distributed systems.
Proven experience deploying and maintaining observability tools.
Metrics & Monitoring: Strong proficiency with Prometheus and Grafana; experience with AWS CloudWatch.
Log Management: Deep knowledge of the ELK stack (Elasticsearch, Logstash, Kibana, Fluentbit).
Cloud & Containers: Hands-on experience with AWS, Docker, and Kubernetes.
Automation & IaC: Skilled in Python, Go, or Bash for scripting, and proficient with Terraform (Ansible/Puppet a plus).
Systems Knowledge: Strong grasp of distributed systems, networking, and Linux/Unix internals.
Problem-Solving: Analytical, detail-oriented, and methodical in root cause analysis and troubleshooting.
Benefits
Challenges that will help you grow and realize your potential really fast
Opportunity to make a big Impact - you will build innovative services used by millions of investors to build wealth
Work with smart, spirited, helpful, high-performing colleagues with a common goal
An environment where nothing is set in stone
Appreciation for your talent and ideas
Generous remuneration package including annual bonuses
Excellent social benefits package, including private health insurance and sports card
Senior Extrusion Equipment Engineer at Teleflex optimizing extrusion manufacturing processes and machinery. Focused on enhancing product quality, efficiency, and safety with hands - on problem solving and leadership responsibilities.
Senior Process Engineer at BASF managing lab and plant - scale projects in McIntyre, GA. Leading product development, trial execution, and process troubleshooting while ensuring health and safety compliance.
Bau - Ingenieur (m/w/d) für das Sachgebiet Gebäudewirtschaft im Landratsamt Aichach - Friedberg. Verantwortlich für die Planung und Überwachung von Instandhaltungs - und Sanierungsmaßnahmen in kommunalen Liegenschaften.
Carbon Analysis Engineer measuring greenhouse gas emissions across industrial processes for climate decarbonisation at Neutreeno. Working in a hybrid model with Cambridge - based office requirements.
Autopilot Engineer developing and customizing firmware for autonomous systems at STARK. Collaborating on testing/debugging and integrating with AI systems.
Controls Engineer III designing and supporting industrial control systems for manufacturing at SP Industries. Leading complex projects from concept through commissioning while ensuring compliance with industry standards.
Process Engineer specializing in biogas projects, leading technical decisions and analyses. Involved from preliminary studies to basic project development in São Paulo, Brasil.
Engineer enhancing and supporting critical business recovery infrastructure at Lloyds Banking Group. Collaborating with a dedicated team to ensure operational resilience and recovery for business services during crises.
Leading the Planning Department with responsibilities in network and facility planning. Driving asset management development and project planning in the energy sector.
VoIP Engineer managing Aircall’s telephony infrastructure and ensuring secure, high - quality voice connectivity. Responsible for deploying cloud telephony technologies and collaborating with cross - functional teams.