Participate in a Hybrid work schedule that includes working from our Raleigh Office 3 days a week.
Manage highly scalable and available distributed systems on cloud-based platforms.
Automate everything - write automation and configuration management code to build scalable, reliable and secure systems.
Identify key system metrics and ensure adequate monitoring coverage for new and existing services.
Build monitoring, supervision and resiliency mechanisms to observe, react to, and maintain platform health and performance.
Manage deployments, upgrades and configurations in a controlled, pre-production and production environment with tight operating perimeters.
Be part of a DevOps team with rotating after hours on-call support (24/7/365).
Verify the functionality of components and services and ensures deployment meets expectations; Design, create, and document test plans, possibly code a custom test automation framework.
Work on complex, major or highly visible tasks in support of multiple projects that require multiple areas of expertise.
Plan and execute ongoing routine application maintenance tasks, such as production support, and troubleshooting existing information systems; Identify errors and deficiencies as well as develop long and short-term solutions.
Keep up-to-date with security patches and proactively address security vulnerabilities and compliance.
Analyze a variety of approaches to larger DevOps issues – provide pros and cons of different approaches to the team to arrive at an agreed upon direction.
Gather and understand business requirements with a focus on business outcomes.
Prepare and present formal training to IT staff, as needed.
Requirements
Strong Windows skills with respect to operations, and troubleshooting.
Experience operating, tuning, and troubleshooting SQL Server and Postgres in Production as well as database design, backup, and restoration.
Operational experience with Kubernetes, Docker and HashiCorp products in Production.
Strong demonstrated experience in Python, Java, or Shell Scripting.
Expert experience with operations and DevOps in AWS.
Expert experience with automation/configuration management using Ansible, Terraform, Chef, or Puppet; CI/CD tools, such as Jenkins, Artifactory, GIT, etc.
The ability to project, manage or act as project lead and provide subject matter expertise and technical direction for certain subjects.
Strong experience with APM tools such as NewRelic or AppDynamics, as well as Log Analysis and Monitoring tools such as Splunk, CloudWatch, Dynatrace, Nagios, SysDig, Site24x7, etc.
BS or MS in Computer Science, related field, or equivalent professional experience.
Experience in network security (DNS, VPN/VPC, IDS/IPS, Subnets/Security Groups/Network ACLs) and technologies supporting compliance (HA/DR, Identity Management, Key Management, WAF and others).
Be part of a DevOps team with rotating after hours on-call support (24/7/365).
NICE TO HAVE: Experience with Cloud Network Design (AWS, Azure or GCP).
NICE TO HAVE: Experience with Fortigate, Citrix Netscaler or other firewalling and Load Balancing Products.
NICE TO HAVE: Experience with Kubernetes Clusters optimization and security management.
NICE TO HAVE: Experience managing VPN solutions incorporating 2FA.
NICE TO HAVE: Understanding of security best practices for banking procedures (ideally PCI or SOC knowledge).
NICE TO HAVE: Experience with JIRA.
Benefits
Participate in a Hybrid work schedule that includes working from our Raleigh Office 3 days a week.
Senior DevOps Engineer leading cloud - native solutions at Sparksoft Corporation. Driving automation and system reliability within a fast - paced Agile team.
Platform Engineer focusing on supporting CI/CD pipelines and Kubernetes at PCCW. Responsible for ensuring platform services' reliability and performance, with night - time support as needed.
Site Reliability Engineer at Bumble optimizing large - scale Linux environments and ensuring system stability. Focusing on troubleshooting, incident recovery, and performance tuning in complex infrastructures.
Senior DevOps Manager overseeing CI/CD processes for NVIDIA Networking products. Leading a team and collaborating with global teams to enhance R&D efficiency and infrastructure.
DevOps Manager overseeing engineering team developing scalable CI/CD processes for NVIDIA Networking products. Enhancing global R&D efficiency in a technology - focused company.
Join Operations Team as Senior Site Reliability Engineer driving operational excellence for cybersecurity solutions. Collaborate across teams to manage production platforms and optimize infrastructure.
Software Developer - DevOps System Administrator working within the SCMT team to enhance software application efficiency. Collaborating on tools and scripts for application lifecycle management.
DevOps Engineer managing CI/CD pipelines and Kubernetes deployments at Stefanini. Collaborating with teams to optimize application health and deployment processes.
DevOps Engineer working with development teams for seamless feature integration and deployment automation. Focus on CI/CD pipelines, monitoring solutions, and continuous process optimization.