DevOps Engineer working with AI training data for global organizations. Collaborating with engineering, operations, and product teams to maintain servers and databases in a hybrid work environment.
Responsibilities
Design, implement, and maintain robust monitoring and alerting systems to ensure service reliability, security, and performance.
Manage and optimize deployment pipelines to enable automated deployments and robust release management.
Plan and maintain infrastructure, including capacity planning, system upgrades, patch management, and network management to ensure seamless connectivity and performance.
Develop automation tools and scripts to streamline CI/CD workflows, system builds, backups, environment provisioning, and network configuration.
Implement comprehensive security monitoring solutions to detect, prevent, and respond to potential threats, ensuring the integrity and confidentiality of systems and data.
Participate in on-call rotations to troubleshoot incidents, maintain system uptime, and ensure operational health.
Collaborate on designing and building scalable, fault-tolerant infrastructure for distributed systems.
Support performance optimization through load testing, monitoring, proactive system tuning, and continuous security assessments.
Requirements
1–3 years of experience in infrastructure engineering, systems administration, or software development.
Proficiency with at least one major cloud platform: AWS, GCP, or Azure.
Working knowledge of containerization (Docker) and virtualization (Proxmox VE).
Familiarity with Kubernetes or other orchestration tools.
Understanding of modern deployment practices and configuration management.
Experience scaling APIs, web applications, and distributed services.
Proficient in Git and Linux system administration and automation.
Exposure to Elasticsearch, including scaling and cluster deployment.
Hands-on experience with CI/CD systems such as Jenkins, GitLab CI, or CircleCI.
Strong verbal and written communication skills in English.
Full - Stack Engineer enhancing engineering productivity at Fidelity. Building internal tools for SRE teams to improve operational efficiency and reliability.
DevOps Engineer at Cloudogu working with development and operations for reliable software delivery. Focusing on CI/CD, infrastructure automation, and platform services in an agile environment.
Jr. DevOps Engineer supporting and improving CI/CD pipelines and Linux systems at Swift. Collaborating with senior engineers in a hands - on learning environment.
Senior DevOps Engineer I managing automation tooling and multi - cloud infrastructure at Spring Health. Collaborating with AI and Infrastructure teams in a hybrid Seattle office.
Site Reliability Engineer for cloudified backup platform using Commvault technology at Expleo. Joining a dynamic team to ensure backup infrastructure scalability and reliability.
Site Reliability Engineer responsible for designing and maintaining scalable services with high availability. Collaborating with development teams to enhance reliability and operational excellence.
Technical Staff leading the architecture, reliability, and modernization of enterprise ALM and DevOps tools. Driving strategy and influencing product development in collaboration with various teams.
Site Reliability Engineer responsible for reliability and availability, collaborating with development teams on scalable systems. Applying software engineering practices to improve production operations.
DevOps Engineer in the Security Data and AI Lab at Lloyds Banking Group driving data and cloud infrastructure's influence on product operations and customer service improvements.