AI Infrastructure Engineer designing and implementing AI/ML solutions for infrastructure use cases at Xsolla. Collaborating with teams to enhance the security posture of infrastructure systems.
Responsibilities
Design and implement AI/ML-powered solutions for infrastructure use cases, including predictive autoscaling, anomaly detection, intelligent cost optimization, and automated remediation across GCP and multi-cloud environments
Build and maintain AI-driven monitoring and observability systems that correlate logs, metrics, and traces to surface root causes, predict bottlenecks, and reduce mean time to resolution (MTTR)
Develop and operate automated incident response workflows using AI-powered playbooks that diagnose, contain, and resolve infrastructure issues with minimal manual intervention
Integrate AI tooling into CI/CD pipelines to improve deployment reliability, automate test prediction, score release health, and support rollback automation
Contribute to the development of internal AI agents and virtual assistants integrated into developer workflows (Slack, IDEs, Confluence) — enabling self-service for provisioning, troubleshooting, and infrastructure guidance
Implement AI/ML-based anomaly detection and automated vulnerability management workflows to enhance the security posture of Xsolla's infrastructure
Prototype and productionize Generative AI solutions for infrastructure automation, including auto-generation of Terraform/Puppet modules, IaC configurations, runbooks, and change documentation
Collaborate with senior engineers and leadership to evolve and execute the infrastructure AI strategy across its implementation phases
Maintain clear documentation of AI tools, integrations, and automated workflows; share knowledge and best practices across the team
Requirements
5–7 years of experience in infrastructure engineering, DevOps, SRE, or a related field
Hands-on experience with GCP (priority) and/or AWS; solid understanding of cloud resource management, scaling, and cost structures
Practical experience building or integrating AI/ML-powered tools in an operational context (anomaly detection, predictive models, LLM-based automation, or similar)
Experience with infrastructure-as-code tools — Terraform, Puppet, Ansible, or equivalent
Proficiency in Python for scripting, automation, and AI/ML integration; Bash or Go a plus
Working knowledge of Kubernetes and container orchestration in production environments
Familiarity with observability and monitoring stacks (Prometheus, Grafana, ELK, Datadog, or similar)
Familiarity with LLM APIs (OpenAI, Anthropic, or similar) and prompt engineering for operational use cases
Strong problem-solving mindset with a bias toward automation and eliminating toil
Infrastructure Engineer collaborating with teams to build infrastructure solutions at HCSC. Focusing on efficiency and improving deployment times in healthcare technology.
Infrastructure Engineer engineering infrastructure technology for cloud environments with security and operational compliance. Collaborating with stakeholders to inform product roadmaps and providing operational support.
Junior Infrastructure Engineer at ZILO, supporting AWS and cloud infrastructure deployment and maintenance. Collaborating with DevOps and Engineering teams on innovative technology solutions.
L2 Infrastructure Engineer at The Missing Link delivering high - quality tech support and managing modern endpoint environments in Pune. Join a collaborative team for innovative IT solutions.
Infrastructure Engineer designing and building workflows, internal tools, and services at MUBI. Collaborating in a hybrid London setting, connecting systems with AI - powered automation.
Infrastructure Engineer at Push Gaming, focused on building scalable backend systems for online casino games. Collaborating with teams on CI/CD pipelines and automation processes in a hybrid work environment.
Cloud Infrastructure Specialist handling Azure operations and vendor coordination. Driving resilient infrastructure projects with a collaborative, impact - driven team in Warsaw.
Windows Server Infrastructure Engineer maintaining critical enterprise Windows Server environments. Supporting DoD security compliance and infrastructure management for Federal clients in multiple locations.
Infrastructure Engineer contributing to AWS architecture and automation at Oddin.gg. Collaborating with teams to optimize performance and support developer experience.
Cloud Platform Infrastructure Engineer optimizing and managing cloud - native systems in Austin, TX. Collaborating with global teams and participating in agile development processes.