Hybrid Senior Observability Platform Engineer

Posted last month

Apply now

About the role

  • Configure, operate, and enhance observability platforms and frameworks (Clickhouse, Thanos, Loki, Tempo, OpenTelemetry Collector + custom processors)
  • Manage and evolve core observability infrastructure supporting engineering teams and a customer-facing portal
  • Handle telemetry at scale (more than 20 TB per day from 10,000+ nodes including Linux hosts, k8s clusters, VMs)
  • Drive organization-wide adoption of observability best-practices for monitoring, logging, and tracing
  • Develop and maintain automated solutions for monitoring, alerting, and incident response
  • Collaborate with engineering teams to provide scalable observability solutions and understand their needs
  • Optimize system performance, ensure high availability, and perform capacity planning and cost optimization
  • Experiment with and integrate new observability tools and OpenTelemetry Collector to enhance telemetry collection and analysis

Requirements

  • Proven track record managing observability stacks (Thanos, Mimir, Cortex, Tempo, Loki, Clickhouse)
  • Deep understanding of Kubernetes architecture and hands-on cluster management
  • Experience writing and maintaining Helm charts
  • Experience with GitOps, CI/CD and continuous delivery practices
  • Expertise in Docker containerization and orchestration
  • Proficiency in Linux system administration, scripting and automation
  • 5+ years of experience in platform engineering, site reliability engineering, or a related role
  • Bachelor’s degree in Computer Science, Information Technology, or related field (or equivalent experience)
  • Demonstrated experience managing large-scale infrastructures and observability platforms
  • Coding experience in Golang or similar language (desirable)
  • Open source contributions in Golang or similar language (desirable)
  • Knowledge or contribution to OpenTelemetry Collector (desirable)
  • Strong communication skills and ability to convey technical concepts to non-technical stakeholders
  • Quick learner and collaborative mindset
  • Customer-focused approach

Benefits

  • Recognized as an outstanding place to work
  • Collaborative team environment
  • Opportunities to develop skills and advance your career
  • Culture emphasizing customer safety, unconventional thinking, simplicity, and collaboration

Job title

Senior Observability Platform Engineer

Job type

Experience level

Senior

Salary

Not specified

Degree requirement

Bachelor's Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job