Site Reliability Engineer at BAE Systems improving service availability and team collaboration. Engaging with the community to develop local tech and cyber skills while supporting core applications.
Responsibilities
Supporting and maintaining essential service that support core mission applications
Proactively enhancing their availability, performance and stability
Finding innovative solutions to problems rather than undertaking repetitive work, automating everything you can
Advising product teams of good practice in how to design and build systems
Instrumenting applications where they don’t have sufficient monitoring in place
Participating in the wider DevOps/SRE community within the organisation
Requirements
Experience in software development in Java and web technologies, e.g. JavaScript and HTML
Familiarisation with database technologies such as Elastic, Mongo
Knowledge of Linux and Windows command lines, e.g. Bash and PowerShell
Hands-on experience with cloud infrastructure such as AWS, Azure or OpenStack
Use of deployment tools such as chef and puppet
Expertise in monitoring large systems using technologies such as ELK
Experience of working in an Agile scrum team, and the tooling that supports it, e.g. Jira
Diagnosing and troubleshooting application issues resulting in service outages
Troubleshooting skills across different levels of the stack
Understanding of ITIL terminology
Experience with container management and micro-services architectures such as Docker
Familiarisation with automation test frameworks such as Selenium
Awareness and insight into technology trends to adopt new cutting edge tools
DevOps and Build Engineer for NVIDIA developing and maintaining CI/CD pipelines. Collaborating with teams to enhance compiler technologies and optimize build performance in a diverse environment.
Senior AWS DevOps Developer responsible for managing AWS infrastructure for enterprise public budgeting software at Euna Solutions. Collaborating on cloud projects and enhancing system reliability and performance.
Principal AI Site Reliability Engineer driving operational excellence for critical contact center applications at Fidelity. Leading automation and observability initiatives to improve reliability and efficiency.
Data Transport Infrastructure DevOps Engineer at Leidos modernizing global - scale multi - cloud environments for USAF missions. Involves developing cloud - native solutions and ensuring security best practices.
DevOps Engineer responsible for building and optimizing AWS - based infrastructure and backend systems at Allguth GmbH. Part of a team focused on innovative mobility solutions in Munich region.
(Senior) DevOps Engineer specializing in ML solutions implementation and management in Germany. Focused on CI/CD pipelines, automation, and cloud services.
Specialist DevSecOps joining Periferia IT Group, a leader in digital transformation. Work in a dynamic environment with continuous learning and professional development opportunities.
Join Zinkworks as a Senior Platform Engineer designing scalable IaC - driven cloud platforms for a large - scale enterprise contact centre. Focused on automation, reliability, and platform ownership in a hybrid work environment.
Asset Reliability Engineer providing maintenance advice and service innovations. Join Sensorfact, the leading smart monitoring platform, to modernize the industrial sector.