Systems Engineer managing AWS/Cloud production environments at Ticketek Entertainment Group. Proactively mitigating issues and collaborating with engineering teams for optimal performance.
Responsibilities
Proactively monitoring and managing our AWS/Cloud production environments and reacting swiftly to prevent or reduce customer visible impact
Escalation and communication of production issues to key stakeholders
Troubleshooting, reproducing, and mitigating complex system and infrastructure issues within the AWS environment
Incident management of high severity issues impacting our sites and services 24x7
Developing and implementing automation and tooling (e.g., leveraging CloudFormation, Ansible, or Terraform) in collaboration with the Site Reliability Engineering team to improve cloud management processes
Working on the engineering team backlog
Supporting service prior to go-live through pre-launch reviews
Providing technical support for internal products, requiring strong investigation, analysis, and resolution skills
Monitoring and checking of systems
Execution of daily system operations tasks, including maintenance and optimisation of the cloud infrastructure
Deployment, configuration, and management of cloud-native or updated solutions, utilizing Gitlab CICD
Requirements
Strong troubleshooting, problem-solving, and investigative skills applied across diverse operating systems and networked environments
Extensive experience operating and managing critical cloud infrastructure and production environments
Experience of working in an agile environment to deliver software
Knowledge and practical experience with scripting (including Shell scripting and Python) for automation and system management
Experience working in a Microsoft stack environment including Windows Server Operating system, Internet Information Services (IIS), Active Directory (AD) and database servers such as Microsoft SQL Server.
Operating knowledge of UNIX or Linux, including proficiency in Shell scripting and Python scripting for automated task scheduling and infrastructure management
Demonstrated experience with Amazon Web Services (AWS), specifically in managing core services (e.g., EC2, VPC, S3)
Sound knowledge of basic networking such as IPs, TCP/IP and Firewall
Proficient in quickly learning new technologies and ability to analyze business needs and recommend effective solutions
Excellent verbal and written communication skills
Proven experience in Incident Management for high-severity issues, preferably within a large-scale production environment
Experience working within GCP
Experience utilizing CI/CD tools (such as Gitlab CICD or Jenkins) to streamline cloud deployments
Experience managing or administering specialized databases, particularly cloud-native services like DynamoDB and Snowflake
Experience working effectively in a demanding and fast-paced production environment
Expertise with logging, monitoring, and Application Performance Monitoring (APM) tools used for proactive system health checks
Previous experience working a shift pattern or managing substantial weekend work / on call responsibilities
System Engineer developing and validating high - end systems for the semiconductor industry. Collaborating with experts to meet industrial requirements in a multidisciplinary environment.
Senior Engineer specializing in AD/ADAS system requirements for Woven by Toyota. Collaborating on system design and management of software requirements for autonomous driving technologies.
AI/LLM Systems Engineer at Oregon Health Authority planning and developing projects integrating Artificial Intelligence and Large Language Models. Collaborating on business needs and practical solutions while managing software development and technical recommendations.
Sr. Systems Engineer managing hosting infrastructure and systems engineering at a leading payments company. Collaborating with teams to optimize technology and enhance customer service.
Senior Business Systems Analyst analyzing business requirements and directing improvement projects within PNC's Retail Technology organization. Focused on enhancing business processes and customer solutions.
IT Systems Engineer overseeing cloud, security, and network systems at Schweid & Sons. Architecting automation and operational continuity while modernizing infrastructure.
Associate Business Analyst managing requirements for customer program deliveries at IDEMIA. Evaluating sales opportunities and authoring compliance matrices while improving program efficiency.
Spacecraft Systems Engineering Intern supporting technical execution and spacecraft systems development at Millennium Space Systems. Engaging in requirements definition, design, procurement, integration, and testing.
Principal Business Systems Architect leading the design and optimization of Customer Success systems at Saviynt. Driving architecture, governance, and scaling process in a SaaS environment.
AI Systems Engineer developing AI solutions for TSC’s clients in various sectors. Collaborating with stakeholders to enhance business efficiency through AI technologies.