Support and manage applications and services in a hybrid-cloud environment such as AWS; ensure optimal performance, cost-efficiency, and security
Support Kubernetes clusters for containerized application deployment, scaling, and management
Configure and integrate DevOps tools and processes with the ServiceNow platform to automate incident management and support other IT service workflows
Serve as a subject matter expert for Level 2 technical issues, troubleshooting complex problems related to software engineering, cloud infrastructure, and application performance
Work closely with software development, deployment, and SRE teams to ensure seamless integration and a shared understanding of operational requirements
Implement comprehensive monitoring and logging solutions to ensure system health and proactively identify and resolve issues
Participate as a member of the Incident Management Support Team – OEM; no supervisory responsibilities
Requirements
Experience providing Level 2 technical support in a fast-paced, enterprise environment
Excellent problem-solving, analytical, and communication skills
Demonstrable expertise in Cloud Engineering on at least one major platform (AWS, Azure, or GCP)
Proven experience with containerization and orchestration technologies, specifically Kubernetes
Hands-on experience with ServiceNow, including ITSM, ITOM, and other relevant modules
Minimum of 2-5 years of experience in DevOps, Software Engineering, or similar role
Bachelor’s degree in computer science, Information Technology, or a related field
Certificates or Licenses: None
Benefits
A hybrid work environment, up to 2 days per week of remote work
Tuition Reimbursement to support your continued education
Student Loan Repayment Assistance
Technology Stipend allowing you to use the device of your choice to connect to our network while working remotely
Generous PTO and Parental leave
401k Employer Match
Competitive health benefits including medical, dental and vision
Eligibility for a discretionary bonus; Incentive Range 8% to 15%
DevOps Product Manager working on complex platform and infrastructure projects. Consulting on DevOps best practices and ensuring scalable, efficient digital ecosystems for clients.
Site Reliability Engineer optimizing large - scale Linux environments at Bumble Inc. Troubleshooting incidents and driving performance improvements on platforms such as Kafka and Kubernetes.
Senior DevOps Engineer at mylo, managing multi - cloud infrastructure and CI/CD pipelines. Promoting DevOps culture while ensuring compliance and automating system maintenance.
Lead Site Reliability Engineer at S&P Global's Cloud Engineering team. Responsible for designing and maintaining cloud infrastructure and ensuring the performance of cloud - based systems.
Site Reliability Engineer responsible for monitoring and improving the reliability of satellite operations infrastructure. Collaborating with teams to automate processes in a dynamic environment.
DevOps Analyst providing high quality and reliable solutions within multifuncional teams at technology - focused financial organization. Automating build and deployment solutions in a hybrid work environment.
Network & Datacenter Deployment Engineer at Cloudflare focused on building and expanding their global network infrastructure with collaboration across multiple engineering teams and vendors.
Senior DevOps Engineer leading cloud - native solutions at Sparksoft Corporation. Driving automation and system reliability within a fast - paced Agile team.