Expert Linux systems engineer managing server infrastructure for Ford's High Performance Computing environment. Developing solutions for resource-intensive computational tasks in a multi-datacenter setup.
Responsibilities
Actively design, implement and support HPC compute resources, and related infrastructure
Assist application teams with optimizing workflows for the environment
Develop and support automation and scripts (Python, Perl, bash, Ansible) and the servers related to those automations (Satellite, Ansible Automation Platform)
Monitor and troubleshoot system failures (occasional on-call)
Requirements
Bachelor's degree in related field or equivalent work experience
5+ years of Linux administration experience
2+ years of server automation experience
Strong server and infrastructure automation knowledge
Strong architectural and engineering experience with enterprise Linux environments
Strong scripting experience with Python and Bash, and willingness to learn other languages.
Performs at an extremely high level of technical competence and maturity
Excellent problem-solving and troubleshooting skills
Excellent communication skills and ability to utilize desktop tools to accelerate communications (Grafana, mkdocs, Wikis, IM, MS Teams, etc.)
Seeks out improvements to processes and business offerings
Ability to create detailed technical documentation
Ability to handle multiple complex projects at one time
Nice to have
Experience using build and configuration automation (Red Hat Satellite, Ansible Automation Platform, etc.)
Exposure to HPC interconnect technologies like InfiniBand or MPI
Familiarity with batch workload managers
Experience using SRE practices and Jira/Agile
Experience managing Red Hat Enterprise Linux
Benefits
Immediate medical, dental, and prescription drug coverage
Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
Vehicle discount program for employees and family members, and management leases
Tuition assistance
Established and active employee resource groups
Paid time off for individual and team community service
A generous schedule of paid holidays, including the week between Christmas and New Year’s Day
Paid time off and the option to purchase additional vacation time.
MES Senior Systems Engineer enhancing Manufacturing Execution Systems within a global pharmaceutical environment. Ensures stable operation, compliance, and continuous improvement of MES applications and environments.
Principal Systems Engineer III for E - INFOSOL designing and maintaining data center networking infrastructure. Requires active Top - Secret clearance and extensive networking experience.
Senior Systems Analyst supporting the onboarding of applications in a high - availability enterprise platform at E - INFOSOL. Focused on customer engagement and requirements documentation.
Senior Developer leading the development of a custom CopyTrader system using cTrader for fintech clients. Responsible for system integration and high - performance architecture in a hybrid setting.
Senior MCU System Engineer designing and developing MCU - based high - performance and zonal controllers for SDV at 42dot. Involves hardware abstraction and system optimization tasks.
Linux System Engineer at the Allen Institute managing IT infrastructure for scientific computing with over 400 servers. Deploying cloud services and engaging in lifecycle management.
Senior Developer enhancing One Identity Manager solutions for the Department of Agriculture, Fisheries and Forestry. Collaborating with stakeholders to implement custom solutions and integrations.
Senior Developer enhancing One Identity Manager solutions for the Department of Agriculture, Fisheries and Forestry. Involves application development, integrations, and leadership within IGA.
Senior OneID Application and Systems Developer improving identity management for the Department of Agriculture. Focusing on application development and integration within the One Identity Manager platform.