Observability Platform Engineer at Amex GBT designing observability platforms using tools like ELK Stack and New Relic. Collaborating with teams to enhance system reliability and performance metrics.
Responsibilities
Design and deploy observability platforms using industry-leading tools such as ELK Stack, New Relic, Datadog, and Alert site
Develop and maintain monitoring strategies, dashboards, and alerting rules to ensure system reliability and performance
Collaborate with engineering teams to instrument applications and infrastructure for comprehensive observability
Troubleshoot complex system issues using observability data and provide actionable insights
Establish best practices for logging, metrics collection, and distributed tracing
Optimize observability infrastructure for cost-efficiency and performance
Conduct training and knowledge-sharing sessions with development and operations teams
Participate in on-call rotations and incident response activities
Continuously evaluate and recommend new observability tools and technologies
Requirements
5+ years of experience in platform engineering, DevOps, or systems engineering roles
Hands-on expertise with at least two of the following platforms: ELK Stack, New Relic, Datadog, or Alertsite
Strong understanding of monitoring, logging, metrics, and alerting concepts
Proven experience creating and maintaining monitoring dashboards and visualizations
Hands-on experience implementing synthetic monitoring and end-to-end transaction monitoring, Application Performance Monitoring (APM) concepts and implementation, Real User Monitoring (RUM) and digital/browser/mobile app observability
Knowledge of SLI/SLO definition and measurement methodologies
Familiarity with MTTA, MTTR, MTTD, and other incident metrics
Proficiency in scripting languages (Python, Bash, or similar)
Experience with cloud platforms (AWS, Azure, or GCP)
Knowledge of containerization and orchestration technologies (Docker, Kubernetes)
Senior Platform Engineer at Sweden's largest digital healthcare provider. Focused on AWS, Kubernetes, and AI workloads in production with real influence.
Senior Platform Engineer designing and building the Signicat Digital Trust Platform for digital identity needs. Join a pan - European digital identity company with diverse projects across multiple locations.
Senior Software Engineer in Strategy, Digital & Innovation Group at Wells Fargo. Leading infrastructure development and maintaining systems in a hybrid work environment.
Power Platform Developer responsible for designing and implementing solutions using Microsoft Power Platform. Collaborating with stakeholders and optimizing organizational processes through application development and automation.
Platform Engineer II managing payment platforms at NCR Atleos. Supporting system maintenance, troubleshooting, and compliance in a hybrid work environment.
Staff Cloud & Edge Platform Engineer managing complex engineering challenges and driving architectural decisions for PayPal's Global Edge Infrastructure. Collaborating with teams to enhance secure cross - border money movement.
Platform Engineer to build scalable solutions for engineers and customers at Definely. Responsible for reliability, security, and automation in product infrastructure.
Platform Engineer managing technical infrastructure for subtitles and language services in film and streaming. Collaborating with development and product teams to ensure system performance and scalability.
Senior SharePoint Power Platform Developer at Geosyntec focused on developing solutions and automation. Collaborating with teams to address challenges in environmental, natural resources, and civil infrastructure.
Sr. Data Platform Engineer I at MetroStar designing and optimizing PostgreSQL databases for federal government. Collaborating with a team to maintain database stability and support operations.