Hybrid Tech Lead – Cloud Reliability Engineering

Posted last month

Apply now

About the role

  • Writing software/scripts to automate operations on our platform, reducing support requests and engineering time.
  • Solving operational issues for our customers by investigating technically and liaising between Mendix Support (1st line) and other development teams in R&D.
  • Providing out of hours support for critical customer issues on an on-call basis.
  • Creating and maintaining monitoring & alerting systems to provide real-time visibility into the performance and availability of the platform (SRE).
  • Developing and maintaining dashboards & reports to track key performance indicators and identify trends and issues.
  • Delivering and supporting a high quality, highly available public cloud platform where customers can run Mendix apps.
  • Developing and running Mendix Cloud infrastructure and services that offer deployment, operations and monitoring.

Requirements

  • You have experience with Site Reliability Engineering (SRE).
  • You have coding skills, ideally in Python; it’s a plus if you also have experience with Golang.
  • You have good knowledge of infrastructure (AWS).
  • You have experience with Infrastructure as Code (IaC), preferably Terraform or OpenTofu.
  • You have strong experience with containerization technologies, primarily Kubernetes.
  • You're comfortable writing a Python script to automate complex tasks to reduce manual effort.
  • You have excellent communication and people skills, both written and verbal.
  • You have the ability to spearhead, manage and explain complex technical issues and reduce them to a form that less technical customers & colleagues can understand.
  • A deep understanding of Cloud architecture/deployment and infrastructure services like web servers, load balancing, SSL/TLS/X509, etc.
  • You have experience with monitoring and logging tools such as CloudWatch, ELK, Grafana, Datadog or Prometheus.
  • Proven experience administering, developing against, or architecting on a cloud platform (AWS is preferred; GCP or Azure acceptable).
  • You have strong experience with containers and Linux/Unix systems.
  • You are familiar with SQL/databases (primarily PostgreSQL).
  • A passion for investigating complex issues and finding out the solution in a platform with many distributed applications.

Job title

Tech Lead – Cloud Reliability Engineering

Job type

Experience level

Senior

Salary

Not specified

Degree requirement

No Education Requirement

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job