Proactively monitoring and managing our AWS/Cloud production environments and reacting swiftly to prevent or reduce customer visible impact
Escalation and communication of production issues to key stakeholders
Troubleshooting, reproducing, and mitigating complex system and infrastructure issues within the AWS environment
Incident management of high severity issues impacting our sites and services 24x7
Developing and implementing automation and tooling (e.g., leveraging CloudFormation, Ansible, or Terraform) in collaboration with the Site Reliability Engineering team to improve cloud management processes
Working on the engineering team backlog
Supporting service prior to go-live through pre-launch reviews
Providing technical support for internal products, requiring strong investigation, analysis, and resolution skills
Monitoring and checking of systems
Execution of daily system operations tasks, including maintenance and optimisation of the cloud infrastructure
Deployment, configuration, and management of cloud-native or updated solutions, utilizing Gitlab CICD
As a member of our Technology team, you will be working closely with the Development team and business units to deliver a high level of support and service
Requirements
Strong troubleshooting, problem-solving, and investigative skills applied across diverse operating systems and networked environments
Extensive experience operating and managing critical cloud infrastructure and production environments
Experience of working in an agile environment to deliver software
Knowledge and practical experience with scripting (including Shell scripting and Python) for automation and system management
Experience working in a Microsoft stack environment including Windows Server Operating system, Internet Information Services (IIS), Active Directory (AD) and database servers such as Microsoft SQL Server.
Operating knowledge of UNIX or Linux, including proficiency in Shell scripting and Python scripting for automated task scheduling and infrastructure management
Demonstrated experience with Amazon Web Services (AWS), specifically in managing core services (e.g., EC2, VPC, S3)
Sound knowledge of basic networking such as IPs, TCP/IP and Firewall
Proficient in quickly learning new technologies and ability to analyze business needs and recommend effective solutions
Senior Financial App Systems Analyst managing the transition to SAP S4 Hana for the City of Toronto, ensuring effective system support and design implementation.
System Analyst facilitating projects and supporting operational efficiency for a leading retail company. Engaging in agile methodologies and collaboration across business and IT sectors.
Ausbildung zum Fachinformatiker Systemintegration bei Liebherr - Baumaschinen in Dettingen. Planung, Konfiguration und Support von IT - Systemen und Hardware.
Senior Systems Engineer responsible for managing site Network Implementations at CEVA Logistics. Coordinate with contractors and oversee equipment installation to facilitate global logistics solutions.
Vehicle Cyber Security Systems Engineer at Ford focusing on smart vehicle security and reliability. Collaborating within cross - functional teams to enhance automotive technology and performance.
SET1 HW Systems Engineer handling automotive complex systems from concept to production support at Ford. Leading cross - functional projects and engaging with stakeholders for system requirements.
Feature Systems Engineer leading the development of innovative electric vehicle features at Ford. Collaborating with cross - functional teams to ensure high - quality implementations and user satisfaction.
Lead Systems Engineer for Boeing developing aerospace systems. Overseeing engineering efforts, system requirements, and cross - functional collaboration in a Missouri - based environment.
Lead Services Systems Engineer at GE Vernova responsible for engineering and system optimizations. Engaging in design changes and driving efficiency in wind turbine operations.
Senior Systems Engineer at GE Vernova responsible for execution of turbine operation changes and AI - enhanced analytics. Collaborating with teams to improve turbine availability and reduce service costs.