DevOps Engineer responsible for deploying, monitoring, and operating software across key production systems. Impacting user experience and company growth in an agile environment with modern tools.
Responsibilities
Provide technical thought leadership in collaboration with Architecture and Engineering
Design, implement, and maintain infrastructure as code for both development and production environments
Design, implement and maintain observability and monitoring solutions and monitor the system health, taking action to maintain up-time and performance SLAs, including on-call rotation
Design, implement, and maintain continuous delivery systems and advise development teams on code and implementation to increase application resiliency, operability, and best practices
Design, implement, and maintain Kubernetes clusters with advanced configuration and features
Design, implement, and maintain Kafka clusters with advanced configuration and features
Identify points of failure and recommend solutions in our infrastructure
Years of Experience – 3+ relevant experience in Dev Ops in a SaaS environment
Expertise in distributed systems including load balancing, distributed messaging, and distributed databases
Expertise in software configuration management (SCM) and tools like Chef, Puppet, Ansible or similar
Expertise in cloud platforms, primarily AWS
Expertise in automated deployment methodologies that support CI/CD/CM using Git, Jenkins or GitHub Actions preferred
Experience configuring and administering Kubernetes clusters using advanced features including non-standard networking, storage classes/claims, namespacing, and multiple node groups
Experience configuring and administering Kafka clusters including disaster recovery mechanisms, data loss prevention techniques, and determining topic configuration to right size for demanding and variable loads
Experience with Windows and Linux systems and tools
Prior professional application development experience with a particular focus on SaaS applications and web development
Knowledge of IP networking, including TCP, UDP, firewalls, SSL
Knowledge of software engineering and design principles such as design patterns, architectural patterns, CAP Theorem, event driven architecture, and similar
Must be security-minded at all levels
Excellent communication and team collaboration skills ability to approach problems in a structured way and find reliable solutions.
Benefits
Alcumus has a **hybrid workplace policy, **where you will work from the office 3 days per week. We want you to be able to do your best work here. We emphasize providing many ways to support our team to do their best work and below are some of the perks and benefits we offer:**** **** **Personal Health & Wellbeing / Benefits******>🍼 Enhanced Parental Leave ****>🌴Generous annual leave ****>🏥 Healthcare Plan****>💟 Annual Giving Day – an extra day to give back to yourself or your community****>🚲 Cycle-to-work Scheme ****** Future Planning******>**💰**Pension scheme with employer contributions ****>🧬 Life Assurance – 3X base salary****>💸 Rewards Program – access to discounts and cashback ****>🏫 LinkedIn Learning License for upskilling & development ******Interested but don’t feel you meet all the requirements? ******Our recruitment team assesses and reviews all applications against the role and business needs. We believe in people having transferable and soft skills and want you to know that we do consider where an individual might not meet all the criteria, but have the aptitude and capability, nonetheless. Our priority is to ensure we set people up for success. We will make a final call based on our determining whether we can offer the necessary support to upskill or provide the developmental support needed for you to get the best out of this opportunity with us!******Bring Your Whole Self to Work.******Alcumus is proudly an equal-opportunity employer. We are committed to ensuring that no candidate is discriminated against because of gender identity and expression, race, disability, ethnicity, sexual orientation, age, colour, region, creed, national origin, or sex. We are dedicated to growing a diverse team while continuing to create an inclusive environment where everyone feels safe and empowered to be themselves. ******What you can expect if you apply: **- A response to your application within 15 working days- An interview process consisting of:- An initial discovery call with the recruiter- A first stage interview via Microsoft Teams - Additional interview (likely face to face) with the stakeholders you’ll be working with closely in the role We’re keen to ensure our hiring process allows you to be at your best, so if you need us to make any adjustments, please just let us know.
Senior Reliability Engineer applying a variety of reliability techniques and managing projects at Baker Hughes. Collaborating with teams to meet customer expectations and enhance their success.
Staff Site Reliability Engineer managing large - scale systems and ensuring infrastructure reliability for NordVPN's services. Collaborate on automating platforms and solving complex technical challenges.
Site Reliability Engineer responsible for infrastructure performance and reliability at ASAPP, collaborating with product engineering teams and automating processes.
DevOps Technical Lead specializing in automation and CI/CD pipeline management at Stanley Black & Decker. Leading a team to enhance cloud infrastructure within an innovative technology environment.
DevOps Engineer for Vodafone Innovus enhancing DevOps solutions in IoT applications. Collaborating with software, QA, and systems engineers to optimize deployment and continuous integration.
DevOps Engineer accountable for the Salesforce DevOps program at S&P Global. Collaborating with Agile teams, managing releases, and enhancing DevOps processes.
DevSecOps Engineer designing secure cloud infrastructure at CredLens, ensuring best practices in security throughout the development lifecycle. Collaborating with engineering and data teams on dependability and compliance.
Senior Site Reliability Engineer ensuring reliability, scalability, and performance of services at Granicus. Leading automation processes and implementing best practices in site reliability engineering.
Senior Site Reliability Engineer at Coinbase, focusing on identity and access management tooling. Responsibilities include automation, cloud - native development, and maintaining secure system architectures.
Join CORTO as a DevOps Engineer working on AWS infrastructure for enhancing legal tech solutions. Collaborate with a high - achieving team to optimize and support development environments.