Site Reliability Engineer responsible for designing and maintaining AWS infrastructure. Focus on system performance, reliability, and collaboration with development teams for secure applications.
Responsibilities
Design, build and maintain scalable, and reliable cloud infrastructure in AWS
Monitor and manage the performance, reliability, and security of our systems
Implement, and improve monitoring tools to ensure system health, and availability
Work with development teams to build, and maintain scalable, resilient and secure applications
Participate in our on-call rotation, and resolve production issues
Continuously improve automation, monitoring and deployment processes
Requirements
Experience with AWS services such as ECS, S3, RDS, Lambda, CloudFront, etc.
Experience with monitoring tools like DataDog, CloudWatch, and Grafana
Experience with Docker, ECS, Kubernetes or similar containerisation technologies
Knowledge of languages such as Bash, Python, NodeJS
Experience with IaC tools such as Terraform, Pulumi, and so on
Advantageous but not essential:
Experience with serverless architectures
Exposure to agile development methodologies
Excellent problem-solving skills
Strong written and verbal communication skills
Experience working across different teams & domains, and influencing others to solve complex problems
Benefits
💰 Competitive compensation, including equity in the company;
🌴 Generous vacation days so you can rest and recharge;
💊 Health perks such as private healthcare;
💪Fitness perks such as an onsite gym & fitness app subsidy;
🧩 "Flexible compensation plan" to help you diversify and increase the net salary;
🥳 Unforgettable Perk events, including travel to one of our hubs;
💙 Spring Health - Get access to 12x therapy & 12x coaching sessions per year!;
📈 Exponential growth opportunities;
🫶 VolunteerPerk - We offer 16 paid hours per year that you can use to give back to society by volunteering for a charity of your choice;
🌎 "Work from anywhere" in the world allowance of 20 working days per year;
📚 IRL English or Spanish Lessons are held in the Barcelona office;
Graduate Reliability Engineer at GKN Aerospace enhancing operational excellence through data analysis and project participation within large structural assemblies.
Site Reliability Engineer at WRITER, ensuring 24/7 availability and performance of AI - powered workflows. Collaborating on scalable infrastructure solutions while impacting enterprise customer trust.
Engineer at Trading Technologies improving platform stability through coding and automation. Focus on building advanced monitoring tools for global trading operations.
Senior ML Ops/DevOps developing MLOps platform components at Capco Poland for financial digital transformation. Responsibilities include CI/CD, model deployment, monitoring, and team collaboration.
Senior DevOps Engineer at Verisk, focusing on AWS infrastructure and CI/CD pipeline automation. Ensuring high availability and security through collaboration with development and QA teams.
Senior DevOps & Infrastructure Engineer at IMAGO focusing on automation and infrastructure improvements. Building reliable infrastructure and leading CI/CD optimization in a dynamic environment.
DevOps Specialist creating and overseeing Azure hybrid cloud infrastructures for EVLO's battery energy storage solutions. Collaborating with teams to implement cutting - edge technologies in a dynamic environment.
Software Quality and Release Engineer developing and maintaining C++/Python software solutions for aerospace and defense industry. Collaborating on CI/CD automation and feedback documentation.
Senior DevOps Engineer building and managing big data platforms for clients in telecommunications and finance industries. Ensuring stability, scalability, and performance across cloud and on - premise environments.