Intermediate to Advanced level HPC Workload System Administrator at Leidos, supporting the DoD HPC Modernization Program. Engage in operations, testing, deployment, and administrative support for high performance computing environments.
Responsibilities
Support the day to day operations, testing, deployment, administration/management, reporting, and analysis tools for examination of workload management/job scheduler activity on high performance computers
Provide Tier III HPC support to HPC site
Correctly forecast and express resource limitations and provide recommendations for increasing the efficiency of resources through proper scheduling and load balancing techniques
Participate in the installation, integration, acceptance testing, and on-going maintenance of HPC systems and software environment
Maintain and/or develop software code that is used to report Job Accounting on HPC systems to the HPCMP
Develop, install, and maintain requested software including file/data profiling, text transposing/linters, and interactive processing scripts
Requirements
Bachelor’s degree in computer science or related field
At least 8+ years of experience in a large and complex IT environment
Must have an active Secret Clearance and be able to obtain and maintain a TS/SCI security clearance
IAT Level II Certification Required
Experience with Red Hat Enterprise Linux (RHEL), CentOS, or Linux variants operating systems
Hands-on support and administration of Workload Management Batch Job Schedulers such as Altair PBS Pro, Slurm
Provide industry and government recognized functional expertise with workload management, including validation, scheduling policies, and post-run processing
Must have experience with installing, testing and supporting COTS, GOTS, and open-source software
Mid - Level Windows and Linux System Administrator responsible for designing and maintaining IT infrastructure at Boeing. Collaborating with multi - disciplined teams to meet program requirements and security compliance.
Senior Systems Administrator responsible for designing and maintaining Docker's IT infrastructure in Seattle. Ensuring reliability, security, and mentoring IT team members.
WMS Systems Administrator provides technical support for Mouser Electronics’ warehouse management software. Responsible for troubleshooting, issue resolution, and implementing new technologies in systems and software.
Systems and Network Administrator at Kincy responsible for IT systems security and user support. Engaging in technical problem resolution and documentation updates in Marseille.
Senior IT Systems Administrator managing security and efficiency in a cohesive tech suite for a modern technology company. Role involves automation, identity management, and maintaining high standards of on - site IT.
Administrateur Systèmes responsable de l'infrastructure des systèmes Windows et VMware chez Consort Group. Collaborateur dans les projets de migration et d'amélioration des outils.
Senior System Administrator responsible for managing Linux systems and databases at Ecommpay. Collaborating on a global scale in a fast - paced fintech environment.
Network Systems Administrator responsible for university network and system infrastructure. Handling monitoring, incident resolution, and customer service requests in a hybrid environment.
Senior Linux Administrator with DevOps experience improving infrastructure reliability and performance. Involved in automation, CI/CD, and cloud migration initiatives in a hybrid work model.
Azure Cloud & Systems Engineer at McKesson designing and improving cloud environments. Collaborating with teams on system administration and network management for healthcare solutions.