Cloud & AI Systems Engineer designing and optimizing Azure cloud infrastructure for health care solutions provider. Collaborating with teams on AI/ML workloads and implementing security and compliance measures.
Responsibilities
Lead the design, implementation, and continuous improvement of Microsoft Azure cloud technologies (IaaS, PaaS, SaaS)
Develop Azure policies, ARM templates, Blueprints, and governance strategies
Build and maintain secure, scalable, and cost-optimized infrastructure solutions
Drive infrastructure automation and self-service provisioning through Infrastructure-as-Code (IaC), scripting, and DevOps pipelines
Monitor system performance, troubleshoot incidents, and ensure high availability and disaster recovery readiness
Collaborate in cross-functional teams (e.g., with Finance, Data Science, Software Engineering and Security) to design, deploy, and maintain secure and scalable AI/ML pipelines
Support retrieval-augmented generation (RAG) pipelines by implementing and maintaining secure endpoints, network isolation, and proper access controls
Manage API security, authentication, and token usage for AI and LLM-based services, ensuring visibility into utilization and cost tracking
Deploy and monitor machine learning models and APIs using Azure ML, Azure OpenAI, and containerized inference solutions
Support AI DevOps/MLOps automation using Bicep, Terraform, or MLflow to streamline model deployment, configuration, and lifecycle management
Implement monitoring and observability using Azure Monitor, Application Insights, and related tools to track AI API performance, latency, and errors
Assist in operationalizing AI workloads in Azure ML, OpenAI, or custom inference endpoints while ensuring security and compliance alignment
Contribute to AI governance by helping define secure model lifecycle processes, role-based access controls, and compliance checkpoints across environments
Implement and manage AI-specific security controls, such as prompt injection mitigation, data masking in training datasets, and LLM access policies
Collaborate with security team to evaluate and secure AI model endpoints, ensuring compliance with HIPAA and SOC 2 for AI workloads
Support the development and enforcement of cloud security policies and incident response plans
Collaborate with the security team to investigate cyber threats and deploy AI-driven threat detection and response tools
Ensure compliance with industry standards (e.g., HIPAA, SOC 2, HITRUST) in all cloud and AI implementations
Requirements
Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience)
5–7 years of experience with Microsoft Azure cloud architecture and automation, including design, deployment, and optimization of cloud environments, with exposure to AI/ML production systems
Proven ability to build and maintain AI-native pipelines, including data ingestion, embeddings, inference endpoints, and vector databases
Hands-on experience with Python for automation, scripting, and AI integration
Familiarity with LLM application frameworks (e.g., LangChain, Semantic Kernel) and MLOps platforms (e.g., Azure ML Pipelines, MLflow, Kubeflow)
Strong understanding of data engineering fundamentals, model deployment workflows, and AI observability tools (e.g., Weights & Biases, Arize, PromptLayer)
Proficiency with Infrastructure-as-Code (IaC) tools such as Terraform, ARM templates, or Bicep
Expertise in containerization and orchestration technologies (e.g., Docker, Kubernetes) for AI and cloud workloads
Solid knowledge of Active Directory, Windows Server OS, VMware, networking, and hybrid cloud environments
Systems Engineer designing system architecture for high - speed flight vehicles at Hypersonica. Collaborating in an elite team to advance European defence technology.
Cloud Systems Engineer managing cloud infrastructure projects for federal clients. Supporting stability and reliability of cloud - based systems and networks with a focus on innovative solutions.
Senior Systems Engineer supporting DIAs National Digital Exploitation and OSINT Center for a tech company serving public sector clients. Manage systems engineering tasks and lead a scrum software development team.
Principal Engineer defining and architecting distributed AI systems across heterogeneous compute platforms at Intel. Focusing on dynamic execution and optimization of large - scale AI computation graphs.
Lead Supply Management for Micron's semiconductor materials and delivery in Penang, Malaysia. Collaborate cross - functionally for global supply fulfillment and manage supplier relationships effectively.
System Engineer for network technology in telecommunications company offering tailored internet services. Responsible for firewall administration, network components, and strategic project management.
System Engineer focusing on Citrix and Azure Virtual Desktop at DATAGROUP in Kaunas. Develop technical solutions, implement them independently, and ensure smooth operations.
Principal Platform Systems Engineer leading test automation for maritime AI solutions. Focused on building infrastructure for cloud, edge compute, and embedded systems.
Principal Software Systems Engineer supporting Northrop Grumman's Sentinel program based in Colorado Springs or Huntsville. Involves collaboration in software development and best practices.
Systems Engineer specializing in automated warehouse systems for key warehouse automation projects at Dematic. Overseeing technical integration and collaborating with engineering teams to deliver innovative solutions.