DevOps Manager leading a distributed team managing L3 support for vision AI solutions. Overseeing operations for Edge/on-prem and cloud platforms at Everseen.
Responsibilities
Lead and grow a distributed DevOps Operations team that owns the L3 support function for Edge/on-prem and cloud platforms.
Establish processes and capabilities for reliable operations at scale.
Foster collaboration between Operations and DevOps engineering teams for incident and change management.
Deliver fast, regionally aligned L3 support for DevOps-related incidents and changes.
Equip Ops with the tooling and automation needed to resolve L1/L2 issues.
Partner with DevOps engineering to automate and streamline existing workflows.
Requirements
Proven 3+ years experience as a DevOps Manager and previous experience as a DevOps/Automation Engineer, Linux Administrator or a combination of them.
Experience in building new teams and establishing processes and procedures.
Basic understanding of local and distributed filesystems (ext4, xfs, NFS), storage migration, SSH tunneling, SCP, SFTP, web servers (Nginx, Apache), reverse proxying, and troubleshooting (logs, performance, connectivity).
Basic knowledge of Docker and Kubernetes/OpenShift, and virtualization solutions.
Experience with Ansible/Ansible Tower (or similar tools like Salt, Puppet, Chef), Jenkins/GitLab CI/CD (or equivalents like CircleCI, TeamCity), and Infrastructure as Code (IaC) using Terraform (or alternatives like Azure Automation, CloudFormation).
Familiarity with Azure Cloud (or AWS, GCP), GIT for system versioning, and basic understanding of backup solutions.
Experience in Bash scripting and Python for automation and development tasks.
Experience with monitoring tools like Node exporter, Prometheus, Grafana, and log collection solutions (e.g.,Prometheus, fluentd, logstash).
Basic understanding of relational (MySQL, MariaDB) and NoSQL (Elasticsearch, MongoDB) databases, with hands-on experience writing queries.
Hands-on experience with JSON and making Restful API calls.
Benefits
Full Time Permanent
Engineering - DevOps team
Continuous learning through workshops and training
Senior DevOps Engineer at Twin Harbour Interactive developing and maintaining high availability systems with a focus on optimization and tooling. Collaborating with game development and product teams in a hybrid work environment.
DevOps Engineer ensuring reliable operations for SaaS solutions at INFORM. Focus on CI/CD, cloud infrastructure, and service automation in the Risk & Fraud business unit.
Site Reliability Engineer ensuring performance, scalability, and security of production environments at FIS. Collaborating on resilient, self - service platforms for fintech solutions.
HPC Storage Dev Ops Engineer identifies and optimizes storage solutions at Intel. Overseeing installation and performance to ensure data integrity and compliance with regulations.
DevOps Engineer working on Linux - based infrastructure focusing on automation with tools like Ansible and Terraform. Engaging in international projects and ensuring optimal system operations.
Senior Site Reliability Engineer ensuring reliability of applications across AWS infrastructure at Onit. Collaborating with teams to troubleshoot and optimize system performance.
Chassis Engineer leading Brake system design for Ford Racing. Focused on delivering performance vehicle solutions through innovative design and collaboration with teams.
Site Reliability Engineer at Coinbase optimizing cloud deployments and enhancing system reliability. Working with engineering teams to improve software reliability and performance across the organization.
Senior Site Reliability Engineer designing and implementing high - reliability platforms for Broadridge. Collaborating with teams across hybrid environments and driving automation and efficiency in service delivery.
Staff Engineer for GM's Hybrid Services & Reliability team. Driving reliability architecture and maintenance for hybrid cloud services with a focus on SRE principles.