Linux System Administrator managing IT infrastructures for educational institutions and research. Collaborating on DevOps and HPC projects while ensuring system security and performance.
Responsibilities
Linux system administration
Set up and ensure ongoing operational maintenance and support (MCO/MCS) of Linux servers (Debian, Ubuntu, Rocky, ...)
Manage cloud platforms (OVH, Azure) in collaboration with internal teams
Manage databases (MySQL, MariaDB, ...)
Manage users, access rights, and installed applications
Document, automate, and improve reliability
Contribute to the deployment and maintenance of CI/CD pipelines (GitLab)
Industrialize application packaging and delivery (Docker, Kubernetes, ...)
Develop and maintain infrastructure as code (Terraform, Ansible, ...)
Participate in observability (Grafana, Prometheus, ELK, ...)
Participate in the setup and management of our HPC platform to support teaching and research uses in AI and scientific computing (Slurm, OpenOnDemand, JupyterHub, Apptainer, ...)
Integrate and maintain CPU / GPU / AI workload environments
Support users (faculty researchers, advanced students)
Apply hardening best practices, log monitoring, and updates
Contribute to security compliance and vulnerability remediation
Provide level 2/3 technical support to internal teams
Follow-up with external service providers and integrators.
Requirements
Bachelor’s or Master’s degree in Computer Science (engineering degree, master’s, or equivalent)
4+ years’ experience in Linux administration
Strong scripting skills (Shell, Python, ...)
Knowledge of a CI/CD tool (GitLab preferred)
Good understanding of virtualized environments and networking
DevOps Engineer building and maintaining authentication platforms in multi - cloud environments. Using technologies like Terraform, Ansible, and Python for automation and optimization.
Cloud Engineer developing Infrastructure - as - Code with Terraform and Azure DevOps. Managing Azure infrastructure and leading incident response within cross - functional teams.
DevSecOps Engineer at Skillfield working on secure CI/CD pipelines for mobile - first delivery. Collaborating with teams to embed security and automation in the delivery lifecycle.
Lead DevOps Engineer focused on AWS and Azure data platform solutions. Collaborating with teams to deliver scalable, secure, and highly available solutions.
DevOps Engineer working at GRÜN Software Group to automate and maintain stable infrastructures. Collaborating with teams to improve deployments and processes for better performance.
Azure SRE Engineer responsible for designing and maintaining secure, scalable Azure cloud infrastructure. Driving automation and operational excellence for leading organizations in technology transformation.
Senior Manager of Site Reliability Engineering overseeing Workday Kubernetes based platform. Leading teams while ensuring high availability and collaborating with federal agencies.
Site Reliability Engineer focusing on AWS cloud environments, SRE practices, and system reliability within GFT's team. Collaborating on cloud migrations and observability initiatives.
Consultant at Minsait supporting technical decisions in infrastructure automation and developing solutions. Collaborating with teams for maintaining and evolving automation platforms.