Principal DevOps/SysOps Engineer responsible for deploying and maintaining Linux and Windows servers. Excelling in large-scale cloud infrastructures using Docker, Kubernetes, and DevOps practices.
Responsibilities
Deploy, configure, and maintain Linux and Windows servers across physical and virtual environments.
Strong knowledge of HPC cluster management, job scheduling systems like SLURM or Run:AI
Hands on experience of Docker containers and Kubernetes, implementation, administration and optimization within complex large scale environments
Proven experience as a DevOps Engineer with a focus on CI/CD, containerization, GitOps and cloud technologies.
Proficiency in scripting languages such as Bash, Python for automation and performance tuning.
Experience with high-speed interconnects (Infiniband, Ethernet), distributed storage systems, and cluster networking.
Configure and maintain firewalls, load balancers, and VPNs.
Proven ability to troubleshoot complex system and networking issues in large-scale clusters.
Install and utilize monitoring tools to ensure the health, performance, and security of systems.
Worked with at least one major cloud provider (AWS, Azure, or Google) to deploy and manage cloud-based solutions.
Apply knowledge of cybersecurity best practices to enhance the security posture of systems.
Utilize infrastructure-as-code tools like Terraform and Ansible for efficient and reproducible infrastructure deployment.
Possess a good understanding of the software development lifecycle to collaborate effectively with development teams.
Requirements
Bachelor's degree in Computer Science, Information Technology, or a related field with 3+ years of experience.
Proven experience as a DevOps Engineer with a focus on CI/CD, containerization, and cloud technologies.
Strong knowledge and hands-on experience with Docker containers and Kubernetes.
Interaction with at least one major cloud provider (AWS, Azure, or Google).
Understanding and application of cybersecurity principles.
Proficiency in infrastructure-as-code tools like Terraform and Ansible.
Excellent scripting skills in Bash and Python.
Familiarity with the software development lifecycle.
Strong knowledge of Linux operating systems.
**Preferred Experience:**
CKA/CKAD/CKS certification.
Certifications in relevant DevOps, cloud, and cybersecurity domains.
Additional scripting languages and automation tool proficiency.
Active participation in community forums or open-source projects.
Senior IT Operations Manager at Savvy overseeing internal IT systems and onboarding staff. Leading IT processes and providing technical support in a fast - paced fintech environment.
Site Director of Fulfillment Operations overseeing day - to - day management of fulfillment center. Responsible for client satisfaction, team leadership, and operational excellence at Cart.com.
Operations and Maintenance Engineer providing expert consulting support for diverse water infrastructure facilities including hydropower and irrigation systems. Collaborating with multidisciplinary teams on complex technical challenges.
Lead strategy and operations for the marketplace at Whatnot, a livestream shopping platform. Shape post - purchase operations and drive cross - functional initiatives to enhance user experiences.
Operations Specialist responsible for general labor and customer service in agriculture. Join CHS in making an impact in local communities through grain, food, and energy resources.
Flight Turnaround Operations Coordinator overseeing aircraft turnaround operations. Ensuring safety and compliance while coordinating ground personnel and adhering to regulations.
Operations Technician maintaining pipeline equipment and supporting operations for the oil and gas industry. Performing inspections, analyses, and ensuring compliance with technical standards.
Privacy Operations Manager leading privacy engineering and operations initiatives for Tenneco. Ensures compliance and governance in global privacy frameworks and standards.
Office Coordinator supporting healthcare operations across NYC clinics in a hybrid role. Coordinating schedules, supplies, and providing IT support within a fast - paced healthcare organization.