Senior Site Reliability Engineer developing IT infrastructure and automation solutions for Coinbase. Collaborating with Infrastructure, security, and compliance teams to enhance operational efficiency.
Responsibilities
Partner with the Coinbase Infrastructure team to support and extend existing ci/cd frameworks to support IT services, including enterprise network platforms
Partner with security and compliance to build surveillance tooling into deployment pipelines
Design and implement automation to streamline overall operational IT support workflows
Action Kubernetes deployment, implementation, and support
Build a technological roadmap based on product requirements
Participate in on-call to support the AWS service deployment pipeline
Promote DevSecOps mentality and establish best practices to ensure top-tier cloud security
Set and maintain a standard of excellence for technical documentation across IT engineering
Participate in an operational environment with strict SLAs and managed incident response and disaster recovery strategies
Facilitate incident response, conduct root cause analysis and blameless retrospectives
Define metrics and design/implement automation opportunities based on monitoring/observability
Developing and maintaining integrations with other systems, such as source control and build systems
Troubleshooting and resolving technical issues with internal toolings
Requirements
At least 8 years experience supporting network infrastructure
At least 8 years experience automating cloud infrastructure
Proficient in at least one scripting languages (Bash, python, Ruby, Go, etc)
Proficiency with version control using CI/CD (Git)
Strong experience supporting AWS services and CI/CD workflows using terraform or equivalent framework
Strong experience with configuration management systems like Terraform, Ansible, Chef, Puppet, or Salt
Strong experience with containers and containers orchestration like Docker and Kubernetes
Benefits
medical
dental
vision
401(k)
Job title
Senior Site Reliability Engineer, IT Infrastructure
SRE responsible for ensuring reliability and performance of IT systems at a digital transformation company specializing in public sector efficiency. Collaborating on system health, incident response, and automation tasks.
DevOps Senior role at Beyond Soluções managing CI/CD for .NET and Kubernetes applications. Collaborating on cloud solutions while fostering a culture of innovation and quality.
Senior Software Engineer at PayPal managing cloud infrastructure and DevOps solutions. Delivering complete SDLC solutions and guiding engineering teams for scalable and reliable services.
Senior Site Reliability Engineer at Diligent leading reliability, automation, and observability across cloud infrastructure. Build tools for incident response and enhance performance in fast - paced environments.
Perception Deployment Engineer deploying deep learning models on embedded systems at Caterpillar. Collaborating with cross - functional teams for integration and optimization of perception modules in vehicles.
Principal Site Reliability Engineer at AT&T required to design scalable solutions for critical operations with minimal downtime. Collaborating with teams to monitor and improve system performance in cloud environments.
DevOps Engineer managing AI SaaS infrastructure at a high - growth European company. Supporting AI model deployment and ensuring platform security and compliance with multiple systems integration.
Engineering Manager leading teams for observability platforms at LexisNexis. Owns operational excellence across software delivery lifecycle in Raleigh, NC.
Reliability Engineer optimizing site facility infrastructure and utility systems at Roche. Conducting root cause analyses and developing maintenance plans to enhance reliability and efficiency.
DevOps SME designing, implementing, and operating multi - cloud platforms for The Missing Link. Collaborating with engineering, security, and operations teams while embedding DevOps best practices.