Senior Manager driving cloud infrastructure migration and high-performance computing solutions at Pfizer. Collaborating with HPC engineers to modernize the scientific computing platform.
Responsibilities
Design, implement, operate, and own robust infrastructure for HPC and ML/AI workloads in a cloud environment (AWS/GCP)
Lead containerization, deployment, and operation of user- and admin-facing HPC platforms (Slurm, Open On Demand, Prometheus/Grafana, batch and distributed computing platforms)
Partner with HPC specialists to capture institutional knowledge and manual processes in IaC workflows
Develop and maintain infrastructure automation using IaC tools like Terraform and CloudFormation
Create reusable Terraform modules and enforce standards
Operationalize containerized solutions using Docker and Kubernetes
Own full lifecycle of infrastructure management, from provisioning to operations
Perform troubleshooting, system analysis, and benchmarking
Develop and maintain monitoring, logging, and alerting for the infrastructure
Design new dashboards, workflows, and utilities to improve observability and workload efficiency
Document architecture, deployment processes, and operational procedures
Partner closely with team members to deliver scientific computing services including user support and resource optimization
Requirements
B.S. in computer science, life science, data science or similar fields
6+ years of experience in cloud infrastructure engineering with a proven track record of developing and supporting robust IaC deployments
Experience managing scientific computing workloads in an enterprise environment
Advanced experience with at least one of AWS and GCP, including knowledge of core compute and storage services relevant to HPC
Solid understanding of cloud networking, identity, and security controls
Prior experience with HPC deployment utilities including AWS ParallelCluster, AWS Parallel Computing Services, and Google Cloud Cluster Toolkit
Proficiency with distributed computing environments, especially EKS/GKE/Kubernetes
Familiarity with HPC environments, job schedulers (Slurm), HPC application containers (Docker, Singularity, Apptainer) and NVIDIA GPU computing
Candidate demonstrates diverse leadership experiences and capabilities including influencing and collaborating with peers, developing and coaching others, overseeing and guiding colleagues' work to achieve meaningful outcomes and create business impact.
Benefits
401(k) plan with Pfizer Matching Contributions and additional Pfizer Retirement Savings Contribution
Paid vacation, holiday and personal days
Paid caregiver/parental and medical leave
Health benefits including medical, prescription drug, dental and vision coverage
Staff Software Engineer optimizing computational cloud infrastructure for R&D teams at Pfizer. Leading strategy and stakeholder engagement for scientific workloads migration and resource management.
Senior Software Engineer responsible for designing and developing software solutions at Parkhill. Leading technology initiatives to enhance digital capabilities for architectural and engineering workflows.
Senior Software Engineer developing high - quality software solutions for various clients at 8th Light. Collaborating with teams to implement innovative technologies and drive project success.
Principal Software Engineer leading high - stakes consulting engagements at 8th Light, architecting scalable solutions and fostering client trust in technology.
Intern Embedded Software Developer joining DAS EMEIA KDC, focusing on firmware development for embedded devices. No prior job experience required, just curiosity and a mindset for problem solving.
Senior Software Engineer developing AI - powered solutions at NetDocuments. Building scalable backend systems and collaborating within a modern engineering team.
Principal Software Engineer leading AI architecture for CBS Sports' digital platforms. Collaborating across teams to deliver cutting - edge sports media experiences.
Senior Software Engineer building critical software for Enso pain relief device. Leading a full - stack environment with React Native, Node.js, and TypeScript collaborations.
Software Engineer responsible for design and development of software solutions for DOD and Intel communities. Working with Microsoft technologies and ensuring compliance with security standards.
Program Mission Assurance Engineer for Northrop Grumman ensuring technical requirements integration and collaborating on quality standards. Overseeing program risks, conducting quality reviews, and analyzing testing processes.