Senior Manager driving cloud infrastructure migration and high-performance computing solutions at Pfizer. Collaborating with HPC engineers to modernize the scientific computing platform.
Responsibilities
Design, implement, operate, and own robust infrastructure for HPC and ML/AI workloads in a cloud environment (AWS/GCP)
Lead containerization, deployment, and operation of user- and admin-facing HPC platforms (Slurm, Open On Demand, Prometheus/Grafana, batch and distributed computing platforms)
Partner with HPC specialists to capture institutional knowledge and manual processes in IaC workflows
Develop and maintain infrastructure automation using IaC tools like Terraform and CloudFormation
Create reusable Terraform modules and enforce standards
Operationalize containerized solutions using Docker and Kubernetes
Own full lifecycle of infrastructure management, from provisioning to operations
Perform troubleshooting, system analysis, and benchmarking
Develop and maintain monitoring, logging, and alerting for the infrastructure
Design new dashboards, workflows, and utilities to improve observability and workload efficiency
Document architecture, deployment processes, and operational procedures
Partner closely with team members to deliver scientific computing services including user support and resource optimization
Requirements
B.S. in computer science, life science, data science or similar fields
6+ years of experience in cloud infrastructure engineering with a proven track record of developing and supporting robust IaC deployments
Experience managing scientific computing workloads in an enterprise environment
Advanced experience with at least one of AWS and GCP, including knowledge of core compute and storage services relevant to HPC
Solid understanding of cloud networking, identity, and security controls
Prior experience with HPC deployment utilities including AWS ParallelCluster, AWS Parallel Computing Services, and Google Cloud Cluster Toolkit
Proficiency with distributed computing environments, especially EKS/GKE/Kubernetes
Familiarity with HPC environments, job schedulers (Slurm), HPC application containers (Docker, Singularity, Apptainer) and NVIDIA GPU computing
Candidate demonstrates diverse leadership experiences and capabilities including influencing and collaborating with peers, developing and coaching others, overseeing and guiding colleagues' work to achieve meaningful outcomes and create business impact.
Benefits
401(k) plan with Pfizer Matching Contributions and additional Pfizer Retirement Savings Contribution
Paid vacation, holiday and personal days
Paid caregiver/parental and medical leave
Health benefits including medical, prescription drug, dental and vision coverage
Senior Software Developer developing and optimizing software solutions for a technology - focused company. Engaging in project management, customer communication, and mentoring juniors in modern technologies.
Full - Stack Engineer developing core workflow automation platform for HR teams at peopleIX. Building capabilities to automate HR processes with AI and integrations.
Software Development Engineer II developing cloud features as part of an Agile Scrum team in Arlington, TX. Responsible for feature development, cloud migration, and enhancing product quality through best practices.
Software Development Engineer II developing cloud - ready products for GM Financial. Contributing to Agile teams and delivering high - quality software with minimal supervision.
Software Engineering Intern designing, building, and shipping internal tools for leasing, property management, and finance at Great Expectations. Working directly with leadership on real - world impactful projects.
Software Development Engineer focusing on building automation frameworks for QA in Mandaluyong City. Collaborating with QA and DevOps teams to enhance automation infrastructure and tools.
Full - stack Developer supporting digital customer experience transformation at USG. Involved in upgrading outdated technology stacks to modern solutions for improved customer experiences.
Senior Full Stack Engineer leading frontend development in React and collaborating on Golang APIs for an AI - native financial services platform. Driving technical architecture and mentoring team members for innovative solutions.
Senior Software Engineer expanding the capabilities of Sentry's analytics platform. Lead initiatives to improve data visibility and performance across billions of events.
Full Stack Software Engineer developing core Red Oak platform with a focus on innovative product features. Involvement in all phases of software development life cycle.