Senior Reliability Engineer to analyze, design, program, and modify software for database systems at Disney. Building, deploying, and ensuring high availability of database infrastructure.
Responsibilities
Responsible for building, deploying, and ensuring all DEEP&T database infrastructure is available 24/7/365
Leverage software development and automation to design, modernize, and deliver database infrastructure
Participates in setting the architectural direction for database platforms and projects
Manage multiple competing priorities in a fast-paced, deadline-oriented environment
Analyze, design, and deploy fault-tolerant, distributed, and highly available database infrastructure
Proactively plan and implement infrastructure changes through capacity forecasting, software release cycles, and right sizing
Provide database expertise through performance tuning, troubleshooting and administration
Develop, enhance, and adhere to engineering and administration standards
Develop automation and tooling to increase operational efficiency while ensuring system reliability and security
Build infrastructure and systems for scalability, resiliency, availability, and recovery though infrastructure as code and configuration management
Provide relevant insights of data store infrastructure through metrics, monitoring, and alerting
Maintain thorough and well-written documentation
Participate in live event support and on-call rotation
May provide oversight and direction to junior team members
Builds relationships with engineering teams and leads
Requirements
Bachelor's degree, preferably in computer science, Engineering, or related field (or equivalent experience)
5+ years of related work experience with Microsoft SQL Server, Amazon RDS for SQL Server, Azure SQL, and Azure SQL MI
Fundamental understanding of Microsoft SQL Server database internals
Experience working in Agile software development
Experience with source control management tools (Git, GitLab, GitHub)
Intermediate to advanced level of expertise in one or more programming languages such as Python, Java, or Go
General understanding and experience with Windows operating system, network, and containers
Excellent verbal and written communication skills
Experience designing and deploying fault-tolerant, distributed, and highly available database infrastructure
Experience in database availability monitoring and status reporting using native monitoring tools
Well-versed in SQL Server backup, restore, and recovery strategies
Experience keeping a large environment compliant by deploying SQL Server patches and upgrades
Experience with disaster recovery planning and implementation
Comfortable collaborating with cross-functional teams providing guidance in SQL Server best practices
Benefits
A bonus and/or long-term incentive units may be provided as part of the compensation package
The full range of medical benefits is offered
Job title
Senior Site Reliability Engineer – Database Engineering
Software Engineer - DevSecOps designing modern software systems for aerospace programs at Northrop Grumman. Collaborating with multi - disciplinary teams in an Agile environment to implement DevSecOps lifecycle.
Principal Software Engineer focused on DevSecOps software factory at Northrop Grumman. Working with multi - disciplinary teams to implement DevSecOps practices for aerospace programs across various locations.
Sr. Systems Engineer implementing and optimizing CI/CD platforms at Arch Capital Group. Collaborating with teams and driving DevOps strategy with expertise in cloud technologies.
Java Full Stack and AWS DevOps Developer for Boeing's Manufacturing Quality Information Technology Team, maintaining and enhancing software systems and DevOps environments while ensuring compliance.
DevOps Engineer at One Pass building and improving cloud infrastructure in AWS. Collaborating with engineers on deployments, reliability, and automation in a fast - paced environment.
Senior DevOps Engineer at One Pass redefining health engagement, managing scalable cloud infrastructure and enhancing automation. Collaborate across teams to ensure system reliability and performance.
Site Reliability Engineer maintaining cloud infrastructure reliability for Tecsys solutions. Collaborating across teams to support services and implement automation, observability, and frameworks.
Senior Release Engineer designing CI/CD pipelines for Kaseware’s mission - critical software. Collaborating with engineering, security, and operations teams to ensure fast and reliable deployments.
DevOps Engineer managing Kubernetes and cloud infrastructure for innovative legal software startup. Collaborating with development teams and ensuring smooth deployment processes.
DevOps Architect defining and evolving AgencyBloc’s cloud and DevOps strategy. Leading design of infrastructure and CI/CD frameworks for secure and scalable SaaS platforms.