System Monitoring & Observability Engineer at SRT Marine Systems, responsible for implementing user-friendly observability solutions using Prometheus and Grafana across global systems.
Responsibilities
Design, configure, and maintain Prometheus-based monitoring solutions
Develop and manage metric exporters for application and system-level data
Optimise Prometheus scraping configurations and retention policies
Define and maintain alert rules based on SLIs/SLOs and performance baselines
Ensure alerts are actionable, with minimal false positives
Participate in on-call rotations and incident postmortems
Design and maintain Grafana dashboards for real-time operational insights
Collaborate with engineering and product teams to create tailored visualisations
Provide self-service dashboard capabilities for end users
Monitor infrastructure for uptime, latency, and throughput
Identify bottlenecks and recommend improvements
Requirements
Proven experience with Prometheus (including PromQL) and Grafana in production environments
Strong knowledge of Linux-based systems
Experience writing and optimising PromQL queries for alerts and dashboards
Familiarity with exporters (node_exporter, blackbox_exporter, custom exporters)
Understanding of alertmanager configuration and routing
Proficiency with Grafana dashboard creation and templating
Strong troubleshooting skills for infrastructure and application issues
Familiarity with containers (Docker)
Scripting skills (Bash, Python, or Go) for automation
Benefits
Highly Competitive Salary
Matched company pension contributions up to 5%
25 days annual leave rising to 28 days with service
Career development opportunities
Company “Get to know you” days
Job title
System Monitoring & Observability Engineer, Prometheus, Grafana
Engineer specializing in battery energy storage systems (BESS) projects in France. Focused on client technology selection, energy market optimization, and project lifecycle involvement.
System Monitoring & Observability Engineer responsible for Prometheus/Grafana visualization at SRT Marine Systems. Working in a team to enhance user - friendly monitoring solutions.
Broadcast Maintenance Engineer providing technical support and equipment maintenance at BMG's US operations. Overseeing system upgrades and assisting in production environments while ensuring equipment functionality.
Senior Product Development Engineer leading the technical ownership and engineering development of furniture systems at 7th Avenue. Focused on CAD control, manufacturing documentation, and mechanical systems integration.
R&D Engineer developing failure analysis systems for innovative battery technology. Collaborating with a skilled team in the revitalizing lithium supply chain.
Safety Engineer for Project GOSHAWK managing safety assurance and governance across the project. Leading safety management activities and developing trial plans within a hybrid working model.
Intermediate Validation Engineer at DATAmundi leading SSD product validation and optimization. Collaborating with engineers to ensure high - quality, next - generation memory products meet strict performance standards.
Forward Deployed Engineer developing AI - powered software at Lovable in Stockholm. Building new engineering functions and partnering with customers to create groundbreaking solutions.
Process Manufacturing Engineer developing and implementing PCB manufacturing systems at TTM Technologies. Focused on improving manufacturability and collaborating with customers and suppliers.
Process Engineering Manager at TTM Technologies leading technical direction and engineering oversight. Responsible for design robustness and manufacturability to meet project milestones and quality standards.