Senior Observability Engineer managing observability and performance ecosystem for Trading 212. Involved in automation, monitoring, and optimizing large-scale distributed systems.
Responsibilities
Own and evolve Trading 212’s observability and performance ecosystem across cloud and on-prem Kubernetes environments.
Design, automate, and optimize observability infrastructure (Prometheus, CloudWatch, Elasticsearch, Kafka, etc.) using IaC and GitOps.
Build Grafana dashboards and implement a smart alerting strategy to surface actionable insights.
Monitor and analyze system performance, identify bottlenecks, and drive improvements in reliability and cost-efficiency.
Collaborate with product, QA, and engineering teams to embed observability best practices.
Maintain clear documentation and mentor engineers, fostering a culture of data-driven performance.
Plan and test Multi-AZ/Region DR and resilience scenarios.
Requirements
5+ years of experience in DevOps, SRE, or Systems Engineering, focusing on observability for large-scale distributed systems.
Proven experience deploying and maintaining observability tools.
Metrics & Monitoring: Strong proficiency with Prometheus and Grafana; experience with AWS CloudWatch.
Log Management: Deep knowledge of the ELK stack (Elasticsearch, Logstash, Kibana, Fluentbit).
Cloud & Containers: Hands-on experience with AWS, Docker, and Kubernetes.
Automation & IaC: Skilled in Python, Go, or Bash for scripting, and proficient with Terraform (Ansible/Puppet a plus).
Systems Knowledge: Strong grasp of distributed systems, networking, and Linux/Unix internals.
Problem-Solving: Analytical, detail-oriented, and methodical in root cause analysis and troubleshooting.
Benefits
Challenges that will help you grow and realize your potential really fast
Opportunity to make a big Impact - you will build innovative services used by millions of investors to build wealth
Work with smart, spirited, helpful, high-performing colleagues with a common goal
An environment where nothing is set in stone
Appreciation for your talent and ideas
Generous remuneration package including annual bonuses
Excellent social benefits package, including private health insurance and sports card
Associate Transmission Line Engineer providing design and project support for T&D projects at Leidos. Working in a team atmosphere with flexible work arrangements and career development opportunities.
VAPT Engineer with 2 - 4 years of experience in penetration testing for web and mobile applications. Analyze systems vulnerabilities and implement security best practices at RIB.
Engineer 2, Labelling Assurance coordinating development and implementation of product labels and manuals at Cook Medical. Collaborating with various teams for timely execution of projects.
Nutanix AHV Virtualization Engineer contributing to virtualization strategy and implementation. Collaborating closely with engineering teams on Nutanix AHV infrastructure projects.
Process Engineer designing engineering solutions for manufacturing processes at HP. Collaborates with teams to achieve production standards and develop innovative solutions.
Senior Distribution Engineer managing planning, design, and execution of electrical distribution systems for Milhouse Engineering. Collaborating with teams for safe and efficient power delivery.
Distribution Engineer IV at Milhouse Engineering responsible for planning, design, and execution of electrical distribution systems. Ensuring safe, reliable, and efficient power delivery while providing technical leadership and project management.
Data Engineer integrating into SiDi team in Campinas, focused on developing machine learning models for embedded devices. Collaborating with global teams in a hybrid work environment.
Field Service/Support Representative supporting EW systems for HII's Mission Technologies division. Training users and troubleshooting RF issues to keep fleets mission - ready.