DevOps Engineer developing and managing scalable AWS infrastructures for a PropTech startup. Collaborating within a growing tech team to achieve ambitious goals in the legal conveyancing space.
Responsibilities
Design, implement, and maintain AWS resources following “well architected” best practices for high availability, security, and robustness.
Build and maintain automated CI/CD pipelines for efficient, reliable code deployments using tools like Jenkins, GitHub Actions and AWS CodePipeline.
Manage monitoring tools (CloudWatch, Datadog, Grafana, Prometheus) to ensure system stability and detect issues before they impact users.
Develop and maintain disaster recovery plans and backup solutions, meeting RTO/RPO targets.
Maintain and troubleshoot Linux systems to ensure security, performance, and reliability.
Automate routine tasks with scripting languages such as Python, Ruby, or Bash to improve deployment processes.
Work closely with leadership, development, sales, and operations to align technical solutions with business needs.
Ensure the infrastructure continues to meet and achieve security and compliance standards (Cyber Essentials Plus, ISO27001, GDPR, SOC 2).
Analyse performance metrics and optimise platform stability, performance, and cost-efficiency.
Effectively communicate technical strategies to both technical and non-technical stakeholders.
Manage your own workload while maintaining high standards of quality and productivity.
Requirements
5+ years of experience as a DevOps Engineer (or in a similar role) with expertise in AWS environments.
Deep knowledge of AWS services (EC2, S3, RDS, EKS, Lambda, CloudFormation, etc.).
Hands-on experience with CI/CD tools such as Jenkins, GitLab CI/CD, GitHub Actions, and AWS CodePipeline.
Proficiency in Infrastructure as Code using tools like Terraform or AWS CloudFormation.
Experience with monitoring and alerting tools like CloudWatch, Datadog, Prometheus, or Grafana.
Strong scripting skills in Python, Ruby, or Bash for automation.
Solid experience with Linux system administration and shell scripting.
Familiarity with Jira and Confluence for collaboration and documentation.
Problem-solving mindset with the ability to troubleshoot and resolve complex issues rapidly
Excellent communication and collaboration skills to work with both technical and non-technical teams.
AWS certifications (e.g. AWS Solutions Architect, AWS DevOps Engineer) are a bonus.
Benefits
Up to 40 days annual leave inclusive of Bank Holidays
Option to purchase up to 5 days leave
Hybrid working with flexible working times
Enhanced maternity and paternity leave
Company Sick Pay
Discounted Gym Membership
Subsidised Conveyancing
Employee Assistance Scheme which includes counselling sessions
Well being programmes
Ongoing training, development, and recognition programs.
A supportive and fun team environment, with regular collaboration and charity events.
Professional Development Support - We believe in fostering growth and will fully support your training and development in AWS and related technologies. This includes access to certification programs of your choosing, online courses, and workshops to advance your expertise and career.
Senior Site Reliability Engineer at Diligent leading reliability, automation, and observability across cloud infrastructure. Build tools for incident response and enhance performance in fast - paced environments.
Perception Deployment Engineer deploying deep learning models on embedded systems at Caterpillar. Collaborating with cross - functional teams for integration and optimization of perception modules in vehicles.
Principal Site Reliability Engineer at AT&T required to design scalable solutions for critical operations with minimal downtime. Collaborating with teams to monitor and improve system performance in cloud environments.
DevOps Engineer managing AI SaaS infrastructure at a high - growth European company. Supporting AI model deployment and ensuring platform security and compliance with multiple systems integration.
Engineering Manager leading teams for observability platforms at LexisNexis. Owns operational excellence across software delivery lifecycle in Raleigh, NC.
Reliability Engineer optimizing site facility infrastructure and utility systems at Roche. Conducting root cause analyses and developing maintenance plans to enhance reliability and efficiency.
DevOps SME designing, implementing, and operating multi - cloud platforms for The Missing Link. Collaborating with engineering, security, and operations teams while embedding DevOps best practices.
Site Reliability Engineer improving reliability of cloud infrastructure for an AI - specialized company. Taking ownership of monitoring and incident response processes in hybrid - working style.
DevOps Engineer leading automation for sophisticated release/deployment pipelines at Securonix. Focused on Python, Ansible, and cloud services to enhance security operations.
Senior Analyst on Data Platform DevOps at AIMCo, responsible for building data operations and collaborating with teams on innovative solutions. Focused on ensuring data quality and integrity across technologies.