Principal IaaS Engineer leading architecture, standardization of AI infrastructure. Collaborating with data and security teams to enhance global infrastructure platforms.
Responsibilities
Architect and evolve the company’s IaaS platform across hybrid environments (on-premise, distributed), enabling secure and scalable compute foundations.
Design, build, and maintain infrastructure automation frameworks using Terraform, Pulumi, and Ansible, including development of custom providers and modules.
Define and enforce engineering standards for infrastructure provisioning, networking, and observability to ensure reliability, security, and consistency.
Lead evaluation and integration of core technologies including OpenShift, Kubernetes, MAAS, and Ceph to optimize performance, cost, and maintainability.
Drive multi-tenant PaaS initiatives and private cloud modernization leveraging OpenShift, Juju, and S3-compatible storage (Ceph, MinIO, TrueNAS).
Collaborate with Data, ML, and Platform Engineering teams to align IaaS architecture with emerging workloads—data pipelines, MLflow, and Airflow orchestration.
Establish GitOps and CI/CD frameworks (ArgoCD, Helm, GitHub Actions, Azure DevOps) for consistent infrastructure delivery and configuration management.
Lead capacity planning, HA/DR strategy, and monitoring/alerting design using Prometheus, Grafana, and Loki stacks.
Partner with InfoSec to embed zero-trust, OIDC/SAML-based IAM, and secret management best practices into infrastructure lifecycle.
Mentor engineers and contribute to organization-wide technical enablement through documentation, workshops, and community participation.
Requirements
10+ years of experience designing and operating large-scale infrastructure systems across on-prem and cloud environments.
Proven expertise in Infrastructure as Code (Terraform, Pulumi, Ansible) with experience authoring reusable modules and providers.
Deep understanding of hybrid and private cloud platforms (OpenShift, Juju, MAAS, OpenStack, VMware, Proxmox).
Strong background in storage (Ceph, TrueNAS, S3, NFS) and networking (VLAN, VXLAN, SDN) for high-availability architectures.
Demonstrated experience building GitOps-based deployment pipelines and maintaining production-grade Kubernetes environments.
Familiarity with data and ML infrastructure integration—MLflow, Airflow, Databricks, or Spark preferred.
Strong proficiency in Python, Go, and Bash for automation and platform tooling.
Excellent cross-functional leadership, communication, and mentorship skills.
Lead Systems Engineering Value Stream and manage a team of systems engineers at Northrop Grumman. Drive value stream integration for major defense systems development milestones.
Mid - Level Systems Analyst focusing on PABX virtualized support for Enghouse in Brazil. Analyzing incidents and ensuring system stability while collaborating with manufacturers.
Systems Engineer developing cutting edge ISR and aviation solutions at SNC. Researching, modeling, and testing advanced aerospace systems with a cross - functional engineering approach.
Senior Systems Analyst leading technical projects for modernizing banking collection platforms. Ensuring compliance and integration in financial systems with regulatory standards in Brazil.
System Engineer senior for HV Powertrain & Charging Systems at Expleo. Responsible for defining architecture, requirements, and validation with integration and technical coordination.
System Engineer for Body Controls in automotive industry. Responsible for requirements management, system architecture, supplier coordination, and validation planning.
System Engineer developing Digital Cockpit features within Connected Architecture. Collaborating on navigation, voice assistant, and multimedia interfaces in premium automotive.
Senior Business Systems Analyst developing IT solutions aligned with strategic reliability objectives at MPC. Partner closely with stakeholders to enhance systems and drive improvements.
Lead Cloud Systems Engineer managing Microsoft 365 and AWS collaboration tools at U.S. FinTech. Responsible for architecting and optimizing collaboration platforms for remote work.
Lead Systems Engineer designing and implementing complex systems for national security missions at Boeing. Join a high - performing team and work on exciting advanced technology projects.