About the role

Senior Reliability Engineer to analyze, design, program, and modify software for database systems at Disney. Building, deploying, and ensuring high availability of database infrastructure.

Responsibilities

Responsible for building, deploying, and ensuring all DEEP&T database infrastructure is available 24/7/365
Leverage software development and automation to design, modernize, and deliver database infrastructure
Participates in setting the architectural direction for database platforms and projects
Manage multiple competing priorities in a fast-paced, deadline-oriented environment
Analyze, design, and deploy fault-tolerant, distributed, and highly available database infrastructure
Proactively plan and implement infrastructure changes through capacity forecasting, software release cycles, and right sizing
Provide database expertise through performance tuning, troubleshooting and administration
Develop, enhance, and adhere to engineering and administration standards
Develop automation and tooling to increase operational efficiency while ensuring system reliability and security
Build infrastructure and systems for scalability, resiliency, availability, and recovery though infrastructure as code and configuration management
Provide relevant insights of data store infrastructure through metrics, monitoring, and alerting
Maintain thorough and well-written documentation
Participate in live event support and on-call rotation
May provide oversight and direction to junior team members
Builds relationships with engineering teams and leads

Bachelor's degree, preferably in computer science, Engineering, or related field (or equivalent experience)
5+ years of related work experience with Microsoft SQL Server, Amazon RDS for SQL Server, Azure SQL, and Azure SQL MI
Fundamental understanding of Microsoft SQL Server database internals
Experience working in Agile software development
Experience with source control management tools (Git, GitLab, GitHub)
Intermediate to advanced level of expertise in one or more programming languages such as Python, Java, or Go
General understanding and experience with Windows operating system, network, and containers
Excellent verbal and written communication skills
Experience designing and deploying fault-tolerant, distributed, and highly available database infrastructure
Experience in database availability monitoring and status reporting using native monitoring tools
Well-versed in SQL Server backup, restore, and recovery strategies
Experience keeping a large environment compliant by deploying SQL Server patches and upgrades
Experience with disaster recovery planning and implementation
Comfortable collaborating with cross-functional teams providing guidance in SQL Server best practices

A bonus and/or long-term incentive units may be provided as part of the compensation package
The full range of medical benefits is offered