Senior Site Reliability Engineer at Instabase. Lead SRE team to design, operate and improve SaaS cloud infrastructure, CI/CD, Kubernetes, and production reliability.
Responsibilities
Define and steer the technical direction for your team, collaborating with cross-functional partners
Develop and execute comprehensive short- and long-term roadmaps balancing business needs and user experience
Oversee cloud infrastructure and deployment automation to ensure efficient, reliable operations
Guarantee uptime and reliability for production systems through proactive monitoring and production support
Manage vulnerability assessments and facilitate prompt remediation to maintain security
Maintain and enhance CI/CD and build infrastructure to support seamless development workflows
Implement and optimize tools that enhance developer productivity and streamline processes
Drive improvements in release management processes and tooling to ensure smooth, reliable software delivery
Requirements
5+ years of experience in Site Reliability Engineering, Software Engineering, or Production Engineering
Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience
Demonstrated experience in managing and sustaining SaaS production environments
Hands-on experience with major cloud providers such as AWS and Azure
Proficient in containerization technologies like Docker
Expertise in container orchestration platforms, especially Kubernetes
Skilled in overseeing and managing software release processes
Systematic approach to solving platform and production issues and a passion for automation
Track record of setting technical and cultural standards for engineering teams
Lead DevOps Developer at Boeing, focusing on CI/CD and cloud infrastructure management. Collaborating with teams to automate processes and improve system performance across environments.
Vulnerability & Configuration Management Engineer responsible for vulnerability management and remediation processes at Relax Gaming. Collaborate with IT teams to improve security measures across various platforms.
DevOps Engineer for designing and maintaining Azure - based hybrid cloud infrastructure for a company specializing in nature - based smart city solutions. Leading cloud architecture and mentoring engineers as part of a high - impact team.
SRE responsible for ensuring reliability and performance of IT systems at a digital transformation company specializing in public sector efficiency. Collaborating on system health, incident response, and automation tasks.
DevOps Senior role at Beyond Soluções managing CI/CD for .NET and Kubernetes applications. Collaborating on cloud solutions while fostering a culture of innovation and quality.
Senior Software Engineer at PayPal managing cloud infrastructure and DevOps solutions. Delivering complete SDLC solutions and guiding engineering teams for scalable and reliable services.
Senior Site Reliability Engineer at Diligent leading reliability, automation, and observability across cloud infrastructure. Build tools for incident response and enhance performance in fast - paced environments.
Perception Deployment Engineer deploying deep learning models on embedded systems at Caterpillar. Collaborating with cross - functional teams for integration and optimization of perception modules in vehicles.
Principal Site Reliability Engineer at AT&T required to design scalable solutions for critical operations with minimal downtime. Collaborating with teams to monitor and improve system performance in cloud environments.
DevOps Engineer managing AI SaaS infrastructure at a high - growth European company. Supporting AI model deployment and ensuring platform security and compliance with multiple systems integration.