Senior Systems Engineer – HPC at Rackspace Technology | Hybrid Hired

About the role

Install, configure, and maintain HPC clusters (hardware, software, operating systems).
Perform regular updates/patching and manage user accounts and permissions.
Troubleshoot/resolve hardware or software issues.
Monitor and analyze system and application performance, identify bottlenecks and implement tuning solutions.
Manage job scheduling and resource allocation using tools such as Slurm, LSF, Bright Cluster Manager, OpenHPC, and Warewulf.
Configure Linux networking (TCP/IP, DNS, routing) and HPC interconnects (InfiniBand, Ethernet).
Implement and maintain large-scale storage and parallel file systems (Lustre, Ceph, GPFS) ensuring data integrity and managing backups.
Implement security controls and manage authentication services like LDAP and Active Directory.
Automate deployments and system configurations using tools like Ansible, Terraform, Jenkins, and Git.
Provide technical support, documentation, and training to researchers and collaborate with scientists and HPC architects.

Requirements

Bachelor’s degree in Computer Science, Engineering, or a related field (equivalent experience may substitute for degree).
Minimum of 10 years of systems experience, including at least 5 years working specifically with HPC.
Strong knowledge of Linux operating systems (e.g., Rocky Linux, Ubuntu) with a fundamental understanding of Linux internals, system administration, and performance tuning.
Experience building and managing RPM and DEB packages.
Experience with cluster management tools such as Bright Cluster Manager, OpenHPC stack, or Warewulf.
Proficiency with job schedulers and resource managers such as Slurm and LSF.
Strong understanding of Linux networking (e.g., TCP/IP, DNS, routing) and HPC interconnects (e.g., InfiniBand, Ethernet) including performance tuning.
Knowledge of parallel file systems such as Lustre, Ceph, or GPFS.
Working knowledge of Linux authentication and directory services such as LDAP and Active Directory.
Proficiency in scripting languages (e.g., Python, Bash, R) and familiarity with MPI libraries for parallel and distributed computing (nice to have).
Strong experience with DevOps and configuration management tools, including Ansible, Terraform, Jenkins, and Git.
Knowledge of HPC in cloud environments (e.g., AWS, Azure, GCP HPC offerings) is a plus.
Strong knowledge of Linux security, compliance standards, and data protection best practices.
Excellent communication, interpersonal, and problem-solving skills.

Similar roles

Browse all Systems Engineer jobs

Just now

SO

SONDAAnalista de Sistemas

Systems Analyst role at SONDA managing application implementation and coordinating with internal teams. Working on defining business requirements and providing technical support.

Hybrid Role

CANAA DOS CARAJAS Brazil Systems Engineer

1 hour ago

ZG

ZEISS GroupSystem Engineer – Cloud & Platform Services

System Engineer managing VDI infrastructure and AWS Cloud solutions for ZEISS corporate IT. troubleshoot user requests and maintain IT systems in operational teams across multiple locations.

Onsite Role

Jena Germany Systems Engineer

1 hour ago

ZG

ZEISS GroupSystem Engineer, Server Operations

System Engineer responsible for ensuring the operation of IT systems at ZEISS. Managing server infrastructure components and analyzing server environment enhancements.

Onsite Role

Jena Germany Systems Engineer

1 hour ago

ZG

ZEISS GroupSystem Engineer, Virtual Infrastructure Operations

System Engineer focusing on the administration and development of virtual server infrastructures for Carl Zeiss AG. Ensuring high performance and availability of IT systems in an operational team.

Onsite Role

Jena Germany Systems Engineer

3 hours ago

AE

AEBApprenticeship: IT Specialist – System Integration

Ausbildung zum Fachinformatiker für Systemintegration bei AEB in Stuttgart, Deutschland. Praktische Ausbildung mit begleitenden theoretischen Inhalten für den zukünftigen Abschluss.

Onsite Role

Stuttgart Germany Systems Engineer

€1,100 - €1,300 per month

4 hours ago

SG

Semcoglas Holding GmbHAuszubildender Fachinformatiker für Systemintegration

Auszubildender zum Fachinformatiker für Systemintegration bei SEMCO. Konzipierung, Installation, und Schulung von IT - Systemen in einem interkulturellen Team.

Onsite Role

Westerstede Germany Systems Engineer

€970 per month

7 hours ago

LG

Liebherr GroupSystem Engineer – System & Concept

System Engineer at Liebherr - Aerospace involved in system architecture and requirements engineering. Collaborating on technical documentation and system methodologies for aerospace applications.

Onsite Role

Lindenberg Germany Systems Engineer

8 hours ago

HB

Hitss BrasilAnalista de Sistemas

System Analyst developing innovative solutions focusing on performance and security at Hitss. Join a vibrant culture fostering curiosity and client respect.

Hybrid Role

Rio de Janeiro Brazil Systems Engineer

10 hours ago

SC

School of Information Technology at the University of CincinnatiBusiness Systems Analyst 2, ERP, Digital Technology Solutions

Systems Analyst serving as the liaison for the Student Financials module in PeopleSoft applications at the University of Cincinnati. Collaborating with various business units to achieve technology solutions and business goals.

Hybrid Role

Cincinnati United States Systems Engineer

10 hours ago

CC

Careers at CrownVehicle Systems Engineer

Engineering role focusing on improving design and development of innovative forklifts. Providing engineering expertise and developing electro - mechanical systems at Crown Equipment Corporation.

Onsite Role

New Bremen United States Systems Engineer