Linux System Administrator managing IT infrastructures for educational institutions and research. Collaborating on DevOps and HPC projects while ensuring system security and performance.
Responsibilities
Linux system administration
Set up and ensure ongoing operational maintenance and support (MCO/MCS) of Linux servers (Debian, Ubuntu, Rocky, ...)
Manage cloud platforms (OVH, Azure) in collaboration with internal teams
Manage databases (MySQL, MariaDB, ...)
Manage users, access rights, and installed applications
Document, automate, and improve reliability
Contribute to the deployment and maintenance of CI/CD pipelines (GitLab)
Industrialize application packaging and delivery (Docker, Kubernetes, ...)
Develop and maintain infrastructure as code (Terraform, Ansible, ...)
Participate in observability (Grafana, Prometheus, ELK, ...)
Participate in the setup and management of our HPC platform to support teaching and research uses in AI and scientific computing (Slurm, OpenOnDemand, JupyterHub, Apptainer, ...)
Integrate and maintain CPU / GPU / AI workload environments
Support users (faculty researchers, advanced students)
Apply hardening best practices, log monitoring, and updates
Contribute to security compliance and vulnerability remediation
Provide level 2/3 technical support to internal teams
Follow-up with external service providers and integrators.
Requirements
Bachelor’s or Master’s degree in Computer Science (engineering degree, master’s, or equivalent)
4+ years’ experience in Linux administration
Strong scripting skills (Shell, Python, ...)
Knowledge of a CI/CD tool (GitLab preferred)
Good understanding of virtualized environments and networking
Engineer supporting enterprise - scale Microsoft 365 environment at NIH. Implementing automated testing frameworks and secure development practices in Federal Government program.
Senior Cloud Engineer developing cloud - native applications and optimizing CI/CD pipelines at GRAYOAK. Collaborating in interdisciplinary teams on innovative cloud projects with a focus on data and AI.
Senior Manager Site Reliability Engineering at WEX ensuring system scalability and resilience while leading engineering best practices. Collaborating with cross - functional teams to enhance reliability across platforms.
SRE DevOps Engineer developing scalable solutions for Consumer Products and Retail Services at Capgemini. Focusing on Kubernetes, Terraform, and CI/CD automation with a flexible work culture.
DevOps Analyst at SONDA managing integrations of technological solutions in Brasília. Focused on infrastructure management and continuous improvement of processes.
DevOps Engineer focusing on hybrid projects in a dynamic team responsible for leading DevOps technologies. Collaborating to optimise large - scale websites and applications in the United Kingdom.
Senior DevOps Engineer optimizing CI/CD workflows and collaborating with development and security teams. Focused on building robust pipelines and implementing DevSecOps best practices.
Senior DevOps Developer at Boeing focused on designing, implementing, and maintaining AWS Cloud solutions. Collaborating with teams to streamline operations and enhance system reliability.
Senior Delivery Practice Leader guiding delivery practices including Agile, Lean, and DevOps at Sandvik. Focus on enhancing delivery maturity and driving continuous improvement within teams in a flexible working environment.