Cloud Infrastructure Engineer ensuring AWS service reliability and performance at Perlego. Collaborating with teams and managing infrastructure in a hybrid working environment.
Responsibilities
Manage and support AWS infrastructure, focusing on scalability, security, and reliability
Handle deployments, managing CI/CD pipelines for both containerised (Docker/ECS) and serverless (AWS Lambda) applications
Ensure effective backup, recovery, and disaster recovery strategies to minimise downtime
Manage operational and analytical data stores (Aurora MySQL, DynamoDB, Databricks)
Monitor and manage platform activity using tools like Prometheus, Grafana, or AWS CloudWatch
Respond quickly to alerts and incidents, independently resolving issues and ensuring service uptime
Conduct post-incident reviews and help improve system resiliency through automation and monitoring enhancements
Review network activity with AWS Security Hub and Cloudflare
Collaborate with cross-functional teams to implement platform improvements
Work independently and make swift decisions when managing service incidents outside core business hours
Assist in platform security, ensuring adherence to best practices for cloud security and compliance
Automate manual processes to reduce human error and improve efficiency
Continuously enhance monitoring systems, ensuring robust early detection and resolution capabilities
Identify potential performance bottlenecks and contribute to overall platform optimisation
Requirements
Experience in Cloud Infrastructure Engineering, DevOps, or a similar field
Strong experience with AWS services and containerised applications
Strong experience operating operational data stores (Aurora MySQL, DynamoDB)
Expertise in using monitoring tools (e.g. Prometheus, Grafana, CloudWatch)
Strong understanding of network security and Cloudflare
Hands-on experience with CI/CD pipeline management for deploying containerised and serverless applications, preferably with GitHub Actions
Proficiency in Linux-based operating systems and shell scripting
Familiarity with Infrastructure as Code tools (Terraform, CloudFormation)
Experience with incident management, troubleshooting, and platform recovery in high-pressure environments
Strong communication skills with a proven ability to work both independently and collaboratively
It’s a plus if you have experience working in a global, distributed team providing off-hours support
Experience with analytical data stores (Databricks)
Previous experience with SecOps and cloud security best practices
Familiarity with scaling highly available systems in a fast-paced, growth-oriented environment
Benefits
Flexible hybrid working environment, in the office twice a week
Personal L&D budget for online courses, subscriptions, or books not on Perlego
Dedicated Learning Time for new skills, projects, or interests
22 days annual leave + 1 additional day per year of service
Days between Boxing Day and New Year off
Flexibility to swap local bank holidays for religious or cultural days
Flexible short-period remote working overseas, as long as you remain a UK tax resident
1-month unpaid sabbatical after 3 years; 1-month paid sabbatical after 5 years
1 additional day per year for life events
Private medical, optical and dental insurance via Vitality
Cycle to Work Scheme
Regular social events and activities for everyone
Competitive matched parental leave and phased return to work
Principal Engineer leading design and implementation of secure architectures for Walmart’s AI Security Team. Responsibilities include risk management, capacity planning, and cross - team collaboration.
Communications Desk Infrastructure Engineer responsible for maintaining and troubleshooting APS communication systems. Supporting critical operational and public safety communication needs across Arizona.
Student Assistant in IT Infrastructure Engineering at Liebherr - Hamburg. Supporting network solutions, system configurations and project management tasks.
Infrastructure Architect required for designing a next - gen hosting platform in Kubernetes at Enova Consulting. Collaborating closely with engineers and partners for a hybrid infrastructure solution.
Senior Infrastructure Engineer designing and building hybrid networks for ICEYE’s satellite operations. Ensuring high - throughput and reliability between ground stations and cloud environments.
AI Infrastructure Engineer designing and implementing AI solutions for Xsolla's infrastructure tasks across GCP and multi - cloud environments. Collaborating with senior engineers to execute AI strategy.
Data Transport Infrastructure Engineer at Leidos supporting U.S. Air Force Cloud One Architecture. Involves developing scalable cloud - native solutions and mentorship roles in a hybrid remote setting.
Principal Software Engineer on Walmart's AI Security team analyzing threats and implementing robust security architectures. Collaborate across domains and mentor on AI safety and secure engineering practices.
Data Center Infrastructure Architect designing scalable and resilient optical cabling for hyper - scale data centers. Implementing physical solutions and automating fiber mapping for efficiency.
Systems and Infrastructure Engineer managing technology infrastructure and providing DevOps support for system reliability. Collaborating with development teams to implement solutions and enhance system performance.