Platform Engineer in MLOps team at EIT, building cloud infrastructure and enabling scientific breakthroughs. Seeking experienced individual to automate operational processes and enhance experimentation.
Responsibilities
**Day-to-day, you might:**
Architect, build, and operate our cloud platform, moving infrastructure beyond the initial setup to deliver resilient compute, network, and storage, including full-sized GPU clusters
Drive the implementation of highly structured, auditable delivery pipelines (CI/CD/GitOps) using to enforce automated, repeatable infrastructure changes
Design and deploy automated governance and security controls using Policy-as-Code (specifically Kyverno and YAML) to ensure strong isolation, protect data, and meet internal audit standards
Establish the foundational monitoring, alerting, and telemetry framework required for robust operations, defining clear SLOs, and setting the course for future SRE work
Partner with Research and Data teams to build self-service capabilities that efficiently support diverse workloads, from Python notebooks to distributed clusters
Requirements
**What makes you a great fit:**
Proven experience platform engineering, with a demonstrable track record of architecting and automating operational processes
A highly proactive attitude and a passion for introducing and automating operational structure
Expertise with at least one major cloud provider (OCI, AWS, GCP, or Azure)
Proficiency with Terraform for declarative, large-scale infrastructure provisioning
Comfortable with operating and managing large-scale, resilient Kubernetes clusters
Proficiency in at least one major language for system-level tools (e.g. Python, Go, or Java) with some scripting experience
**It would also be great if you had:**
Familiarity with modern Policy-as-Code tooling
A passion for introducing and automating operational rigour and structure
Experience supporting ML and Data Engineering workloads
Benefits
**We offer the following salary and benefits:**
Enhanced holiday pay
Pension
Life Assurance
Income Protection
Private Medical Insurance
Hospital Cash Plan
Therapy Services
Perk Box
Electric Car Scheme
-
**Why work for EIT:**
At the Ellison Institute, we believe a collaborative, inclusive team is key to our success. We are building a supportive environment where creative risks are encouraged, and everyone feels heard. Valuing emotional intelligence, empathy, respect, and resilience, we encourage people to be curious and to have a shared commitment to excellence. Join us and make an impact!
Lead Platform Engineer at TD Securities, developing a high performing Trading Risk Warehouse platform. Responsible for ensuring stability and scalability, while managing underlying infrastructure and supporting development teams.
Lead Platform Engineer at Capital One driving transformation in technology and solutions with Agile practices and DevOps tools. Collaborating on complex technical problems in a fast - paced environment.
Data Platform Engineer managing daily operations of data platforms for a global cybersecurity company. Collaborating with teams to ensure platform reliability and performance.
Senior Platform Engineer focused on building internal platform capabilities for developer tooling and experience at MONY Group. Collaborating with teams to enhance platform engineering and software delivery.
Databricks Platform Engineer working on AWS ecosystem design, build, and optimization. Responsible for implementing scalable pipeline solutions across data platforms.
Senior Data & Platform Support Engineer supporting Oracle databases at the Federal Reserve Bank. Collaborating with teams to ensure operability of payment systems and enhance business outcomes.
IT Project Manager involved in managing diverse projects at Fidelity focusing on architecture and data solutions. Lead delivery teams in technology initiatives enhancing existing systems.
Data Platform Engineer transforming operational data into clean, analysis - ready datasets for Versana's platform. Collaborating within a team to implement data engineering practices and ensuring data quality standards.