Site Reliability Engineer joining Mindvalley's Cloud Engineering team, focusing on building and improving cloud infrastructure in production environments.
Responsibilities
Support and maintain cloud infrastructure and services.
Help operate and manage containerized platforms and workloads.
Contribute to infrastructure changes using Infrastructure as Code.
Apply best practices for availability, security, and cost efficiency.
Assist with monitoring, alerting, and operational dashboards.
Participate in incident response and on-call rotations with guidance.
Support troubleshooting, root cause analysis, and operational improvements.
Help automate repetitive operational tasks to reduce manual effort.
Support CI/CD pipelines and deployment workflows.
Assist with application releases, rollbacks, and reliability improvements.
Collaborate with engineering teams to improve delivery and operational stability.
Requirements
1–2 years of experience (or equivalent practical exposure) in Cloud, DevOps, SRE, or Platform Engineering
Familiarity with cloud platforms and container technologies
Exposure to Infrastructure as Code, CI/CD, or automation tools
Basic scripting skills (Python or Bash)
Understanding of operational best practices, including secure configurations
Strong problem-solving skills and willingness to learn
Good communication and collaboration skills.
Nice To Have
Experience with Kubernetes or container orchestration
Familiarity with deployment workflows and release processes
Exposure to monitoring, logging, or incident response practices
Mindvalley is an equal opportunity employer and does not discriminate on the basis of race, colour, religion, gender identity or expression, national origin, age, disability, marital status, sexual orientation, or any other legally protected status.
We are committed to creating a diverse and inclusive workplace and encourage applications from all qualified individuals.
DevOps Engineer focusing on deploying high - security on - prem infrastructure and MLOps platforms for mission - critical systems. Collaborating on Kubernetes - based orchestration and machine learning workloads.
Cloud Site Reliability Engineer managing Solace Cloud services across leading cloud providers. Ensuring reliability, handling incidents, and collaborating with customers for operational excellence.
Senior Cloud Site Reliability Engineer ensuring reliability and health of Solace Cloud Services with hands - on cloud operations expertise. Lead incident management and customer support for high - impact environments.
DevOps Engineer designing and operating AWS infrastructure within industrial IoT environments. Working on systems that ensure security, resilience, and end - to - end observability.
Sr. Site Reliability Engineer (SRE) III providing technical solutions for the federal government. Collaborating in a high - performing team focused on reliability and application scalability.
Senior Linux System Engineer developing and maintaining Linux server infrastructure for Th. Geyer GmbH. Collaborating on ERP systems and CI/CD processes while ensuring system performance and security.
Platform Engineer leading the development of cloud application platforms for Allstate. Responsible for cloud infrastructure for ML experimentation and production deployments.
Cloud Platform Engineer (ML DevOps) developing and managing CI/CD pipelines for ML workflows in a leading insurance company. Collaborating with data scientists and ensuring infrastructure security and compliance.