Design and implement robust infrastructure tailored to the unique requirements of your assigned product line.
Manage cloud-based resources, utilizing AWS (and potentially Azure), and implement automation tools to optimize resource management.
Maintain CI/CD pipelines, automating the building, testing, and deployment processes.
Create monitoring for production systems to preemptively identify potential issues and swiftly address them.
Mentor and train junior team members (including cross-training developers).
Drive the implementation and management of automation tools and infrastructure-as-code practices.
Implement robust security and compliance measures, collaborating with security and compliance teams to uphold the integrity of the company's infrastructure and data.
Requirements
Bachelor's degree in Computer Science or related field.
At least 3 years of experience in cloud infrastructure administration or related field.
Expertise in AWS cloud-based environments (bonus for Azure expertise).
Proficiency in scripting languages like PowerShell, Python, Ruby, or Bash.
Experience with infrastructure automation tools Terraform and Ansible.
Proficiency with containerization technologies such as Docker and Kubernetes.
Experience with logging, monitoring, and alerting tools like DataDog, Prometheus, OpenTelemetry, Mezmo, TrackJS, etc.
Solid understanding of networking and security fundamentals.
Strong leadership, communication, and collaboration skills.
Willingness to learn and adapt to new technologies and methodologies.
Certifications in AWS, Azure, or other relevant technologies are a plus.
Benefits
Competitive wages
Wellness days
Community Engagement Committee
Flexible workday
Benefits Ask us for a copy of our health and dental benefits!
Network & Datacenter Deployment Engineer at Cloudflare focused on building and expanding their global network infrastructure with collaboration across multiple engineering teams and vendors.
Senior DevOps Engineer leading cloud - native solutions at Sparksoft Corporation. Driving automation and system reliability within a fast - paced Agile team.
Platform Engineer focusing on supporting CI/CD pipelines and Kubernetes at PCCW. Responsible for ensuring platform services' reliability and performance, with night - time support as needed.
Site Reliability Engineer at Bumble optimizing large - scale Linux environments and ensuring system stability. Focusing on troubleshooting, incident recovery, and performance tuning in complex infrastructures.
Senior DevOps Manager overseeing CI/CD processes for NVIDIA Networking products. Leading a team and collaborating with global teams to enhance R&D efficiency and infrastructure.
DevOps Manager overseeing engineering team developing scalable CI/CD processes for NVIDIA Networking products. Enhancing global R&D efficiency in a technology - focused company.
Join Operations Team as Senior Site Reliability Engineer driving operational excellence for cybersecurity solutions. Collaborate across teams to manage production platforms and optimize infrastructure.
Software Developer - DevOps System Administrator working within the SCMT team to enhance software application efficiency. Collaborating on tools and scripts for application lifecycle management.
DevOps Engineer managing CI/CD pipelines and Kubernetes deployments at Stefanini. Collaborating with teams to optimize application health and deployment processes.