Lead SRE ensuring system reliability and scalability at Veepee, a leading e-commerce company. Collaborating with teams to automate processes and support technical challenges.
Responsibilities
Implementing tools and processes for deployment and industrialization (CI/CD, blue/green, canary, rollback, etc.);
Automating provisioning of a resilient infrastructure that meets the needs of products;
Working with development teams to facilitate regular releases;
Maintaining services in operational conditions, analyze and resolve performance and scalability anomalies (load tests) of current and historical deployments;
Supervising the application portfolio in collaboration with the Network Operations Center (NOC), manage access and security;
Participating in the evolution of the IS (VMware migration to KVM and service offer) and the reduction of the technical debt;
Being the evangelist of DevOps’ good practices and participate in the construction of a true transversal SRE community within Veepee.
To share company information & spread out team activity;
To define and run a clear and relevant organization within the team;
To develop the team without doing micromanagement;
Requirements
At least 3 years of experience in a similar function;
Knowledge of industrialization processes, agile methods, gitflow flow and DevOps practices in general and understanding of a system side;
Experience in maintaining high levels of availability;
On-call organization and incident response;
Familiar with Linux (good knowledge), knowledge of Windows would be a plus;
Proficiency with IaC: Packer, Terraform, Ansible, Puppet;
SUP: Icinga, ELK, Prometheus;
Hands on with Docker, Kubernetes, Nomad, Consul.
Proficiency with different types of DB such as, PostgreSQL, MongoDB, ElasticSearch;
You have strong verbal and written English language skills.
Empathetic and open-minded
Benefits
Dynamic and creative environment within international teams
The variety of self-education courses on our e-learning platform
The participation in meetups and conferences locally and internationally
DevOps Engineer at Cloudogu working with development and operations for reliable software delivery. Focusing on CI/CD, infrastructure automation, and platform services in an agile environment.
Jr. DevOps Engineer supporting and improving CI/CD pipelines and Linux systems at Swift. Collaborating with senior engineers in a hands - on learning environment.
Senior DevOps Engineer I managing automation tooling and multi - cloud infrastructure at Spring Health. Collaborating with AI and Infrastructure teams in a hybrid Seattle office.
Site Reliability Engineer for cloudified backup platform using Commvault technology at Expleo. Joining a dynamic team to ensure backup infrastructure scalability and reliability.
Site Reliability Engineer responsible for designing and maintaining scalable services with high availability. Collaborating with development teams to enhance reliability and operational excellence.
Technical Staff leading the architecture, reliability, and modernization of enterprise ALM and DevOps tools. Driving strategy and influencing product development in collaboration with various teams.
Site Reliability Engineer responsible for reliability and availability, collaborating with development teams on scalable systems. Applying software engineering practices to improve production operations.
DevOps Engineer in the Security Data and AI Lab at Lloyds Banking Group driving data and cloud infrastructure's influence on product operations and customer service improvements.
Senior Platform DevOps Engineer at Code Metal designing and implementing cloud and hybrid infrastructure to support customer deployments and internal platforms. Collaborating with software and security teams for reliable delivery.