Intermediate to Advanced level HPC Workload System Administrator at Leidos, supporting the DoD HPC Modernization Program. Engage in operations, testing, deployment, and administrative support for high performance computing environments.
Responsibilities
Support the day to day operations, testing, deployment, administration/management, reporting, and analysis tools for examination of workload management/job scheduler activity on high performance computers
Provide Tier III HPC support to HPC site
Correctly forecast and express resource limitations and provide recommendations for increasing the efficiency of resources through proper scheduling and load balancing techniques
Participate in the installation, integration, acceptance testing, and on-going maintenance of HPC systems and software environment
Maintain and/or develop software code that is used to report Job Accounting on HPC systems to the HPCMP
Develop, install, and maintain requested software including file/data profiling, text transposing/linters, and interactive processing scripts
Requirements
Bachelor’s degree in computer science or related field
At least 8+ years of experience in a large and complex IT environment
Must have an active Secret Clearance and be able to obtain and maintain a TS/SCI security clearance
IAT Level II Certification Required
Experience with Red Hat Enterprise Linux (RHEL), CentOS, or Linux variants operating systems
Hands-on support and administration of Workload Management Batch Job Schedulers such as Altair PBS Pro, Slurm
Provide industry and government recognized functional expertise with workload management, including validation, scheduling policies, and post-run processing
Must have experience with installing, testing and supporting COTS, GOTS, and open-source software
Systems Administrator responsible for maintaining core IT infrastructure and serving as escalation point. Providing troubleshooting for IT systems including server and network environments in Traverse City, MI.
IAM Okta Administrator responsible for maintaining IT infrastructure and security at Blue Cross and Blue Shield. Collaborating with IT teams and implementing best practices for security and compliance.
Systems Administrator managing Linux systems and optimizing infrastructure at Alongside. Focus on automation, performance, and collaboration with tech teams in a hybrid environment.
Systems Administrator providing Tier 3 support for the OSINT Integration Center under the DOMEX Technology Platform contract. Working with Linux and Windows systems administration in a collaborative environment.
Systems Administrator supporting the Reagan Test Site operations with focus on Windows administration. Working in a secure computing environment at Kwajalein Atoll, Marshall Islands.
Systems Administrator supporting missile testing and space operations on Kwajalein Atoll in the Marshall Islands. Focus on Windows administration within a secure IT environment.
System Administrator managing IT infrastructure in the Eng Digital division. Involves Active Directory management, network services handling, and VMware configurations.
Salesforce Administrator at Sinch developing solutions using declarative tools and complex business process automations. Collaborating with teams to enhance the Salesforce ecosystem and support stakeholders.
System Administrator managing VMware and Microsoft 365 environments for a growing technology startup. Involves virtual infrastructure management and support for internal teams.