Platform DevOps Engineer for Fidelity managing the technology and delivery of the private market platform. Collaborating closely with engineering teams to support operations and expand growth opportunities.
Responsibilities
Lead the technology, architecture, implementation, testing, and delivery of the private market platform
Work closely with the engineering team and product managers to support our platform and customers
Research and develop new growth opportunities to expand on the strengths of the core technology
Design and build our application infrastructure on AWS using a variety of technologies
Own the build and deploy process supporting CI/CD across multiple environments
Deliver internal infrastructure services such as monitoring, logging and alerting to internal users
Automating as much of the operational infrastructure as possible
Requirements
Deep knowledge of core AWS services and Linux OSs such as IAM, VPC, ECS, Lambda, RDS, SQS, SNS, and S3 and CloudFormation
Demonstrated experience with configuration management and infrastructure management systems such as CloudFormation, SaltStack, Terraform/OpenTofu, and others
Solid understanding of monitoring tools such as Sensu, CloudWatchPrometheus, Grafana, ELK Stack, and OpenTelemetry
Good understanding of web services, databases and related infrastructure
Practical and proven skill with an administrative language such as Python or Bash
Background in building, maintaining and automating build and test pipelines with tools such as Jenkins.
Experience or exposure to container-based runtimes and orchestration tools such has EKS /ECS such as Docker
Proven understanding of networking and distributed computing concepts
Enjoys collaborating with other developers, pair programming, reviewing code and white-boarding problems
Excellent problem solving skills, for when the playbook just doesn't cover it
DevOps Engineer focusing on deploying high - security on - prem infrastructure and MLOps platforms for mission - critical systems. Collaborating on Kubernetes - based orchestration and machine learning workloads.
Cloud Site Reliability Engineer managing Solace Cloud services across leading cloud providers. Ensuring reliability, handling incidents, and collaborating with customers for operational excellence.
Senior Cloud Site Reliability Engineer ensuring reliability and health of Solace Cloud Services with hands - on cloud operations expertise. Lead incident management and customer support for high - impact environments.
DevOps Engineer designing and operating AWS infrastructure within industrial IoT environments. Working on systems that ensure security, resilience, and end - to - end observability.
Sr. Site Reliability Engineer (SRE) III providing technical solutions for the federal government. Collaborating in a high - performing team focused on reliability and application scalability.
Senior Linux System Engineer developing and maintaining Linux server infrastructure for Th. Geyer GmbH. Collaborating on ERP systems and CI/CD processes while ensuring system performance and security.
Platform Engineer leading the development of cloud application platforms for Allstate. Responsible for cloud infrastructure for ML experimentation and production deployments.
Cloud Platform Engineer (ML DevOps) developing and managing CI/CD pipelines for ML workflows in a leading insurance company. Collaborating with data scientists and ensuring infrastructure security and compliance.