IT Resiliency Engineer overseeing resilient engineering across cloud and on-premises environments. Leading chaos engineering efforts and evolving monitoring standards for system alerting.
Responsibilities
Oversee design and implementation of resilient engineering
Design and review resilient solutions for cloud and on-premises environments
Lead chaos engineering efforts to identify and mitigate system weaknesses
Collaborate to evolve standards for system monitoring and alerting
Represent IT Resiliency Office in Architectural Review Board
Collaborate enterprise-wide in prioritizing resiliency efforts
Expertise with IaC and tools such as Ansible
Integrate with post mortem process of major incidents
Evangelize standards and practices to enrich resiliency posture
Develop standardized reporting on resilience activities
Requirements
Bachelor's degree or equivalent experience
5-10 years experience with platform engineering focusing on IaC, DevOps practices, and orchestration tools
Preferred experience as a Team lead or a hands on Technical Manager
Track record of architecting and deploying enterprise-level solutions prioritizing system uptime and data integrity
Ability to design and implement high availability systems supporting massive transaction volumes and disaster recovery processes
Experience in infrastructure and service architecture & engineering
Strong dedication to customer needs with excellent communication skills
Insight into complexities of multi-AZ and multi-Region cloud platforms
Proven experience in managing mission-critical systems requiring constant uptime
Knowledgeable in evaluating trade-offs between consistency, availability, and partition tolerance
Well-versed in SaaS, PaaS, and IaaS cloud service models
Proficient in Chaos Engineering principles
Skilled in implementing observability solutions in Agile environments.
Water Engineer Intern focusing on water/wastewater/stormwater treatment and sustainability projects at Arcadis. Evaluating and planning designs while contributing to diverse water projects in White Plains, NY.
Engineer III managing engineering and construction projects at solid waste facilities. Collaborating with professionals and maintaining relationships with regulators in the waste management industry.
Performance Engineer developing tests, tools, and frameworks for Salesforce's Automation Platform, ensuring high performance, scalability, and reliability across cloud features.
Principal Metering Engineer providing end to end technical leadership in renewable energy development at NextEra Energy. Focused on metering programming, configuration, testing, and troubleshooting.
Systems & Safety Assurance Engineer enhancing safety and performance in Queensland Rail’s Major Projects team. Providing expert analysis and assurance for the Logan and Gold Coast Fast rail project.
Junior Technical Engineer at Trade Nation providing technical support and troubleshooting for staff issues. Involving hands - on coordination and administration of IT infrastructure and services.
Senior Reservoir Engineer at Deep Sky specializing in CO2 storage solutions across Canada. Leading dynamic reservoir modeling and regulatory applications in a hybrid work setting.
Manufacturing Engineer producing engineering outputs for aerospace projects. Collaborating with teams to ensure quality and efficiency within the production processes.
Junior Engineer Approvals at GROHE managing product certifications for water systems. Engaging with internal and external partners to ensure compliance with standards and norms.
Forward Deployed Engineer embedded with enterprise clients at WRITER, optimizing AI deployment while serving as a technical liaison. Requires in - depth AI expertise and software development skills.