Systems and Infrastructure Engineer managing technology infrastructure and providing DevOps support for system reliability. Collaborating with development teams to implement solutions and enhance system performance.
Responsibilities
Coordinate system support and performance by ensuring support queue issues are resolved.
Identify the root causes of issues.
Drive suppliers to resolve issues related to products in accordance with service level agreements.
Document the resolution of issues and perform escalation procedures.
Responsible for core DevOps team duties which includes on-call support to ensure system reliability, Oversight of new launch reviews and onboarding, effective incident management along with troubleshooting.
Proactive management of Dashboards and alert systems, providing consultation support to expedite development team processes during releases and post deployment phases.
Track alert analysis trends and identify weekly focal points and collect data driven evidence for specific issues.
Collaborate with development teams to understand and address problem context.
Implement permanent solutions to prevent recurring alerts and communicate.
Implement fixes transparently to stakeholders for enhanced system reliability.
Contribute to automation/ efficiency gain with reduction in manual efforts.
Demonstrate up-to-date expertise in Information Systems Division (ISD) infrastructure and applies this to the development, execution, and improvement of action plans by providing expert advice and guidance to others in the application of information and best practices; supporting and aligning efforts to meet Customer and business needs; and building commitment for perspectives and rationales.
Requirements
Master's degree or equivalent in computer science, computer engineering, information systems, information technology, or related area; OR Bachelor's degree or equivalent in computer science, computer engineering, information systems, information technology, or related area and 2 years of experience in technology infrastructure engineering across areas with compute, storage, network, mobility or virtualization-related technologies.
Experience automating tasks and managing system configurations using Python BASH to streamline operations and reduce manual intervention.
Experience overseeing containerized technologies including Docker and orchestrating complex applications using Kubernetes, ensuring optimal deployment, scaling, and high availability in infrastructure management environment.
Experience designing, implementing, and automating distribute systems solutions.
Experience managing Linux environments which includes tasks such as system configuration, maintenance, troubleshooting and security management.
Experience creating, maintaining and updating Monitoring Dashboard systems for Infrastructure along with alerting mechanism using Grafana, ELK stack, Dynatrace, Spotlight, Prometheus, and X-matter.
Experience troubleshooting Network Issues, identifying the Root cause and fix for a permanent solution using Wireshark, Dynatrace, and Networking protocols.
Experience maintaining and administering Version control source code, Branching Strategies, along with creating Governance rules for Git Organizations using Git, GitHub, Bitbucket, and GitHub Actions.
Experience maintaining streaming application flows including Apache Kafka and AWS Kinesis.
Demonstrated knowledge of SSL/certificates at Various root levels and Certificate authorities with certificate renewal management using ServiceNow, Venafi, OpenSSL, DigiCert, Global sign, and Rapid SSL.
Experience with Code Analysis and Quality gate control measures using SonarQube, Jenkins, Looper, and Concord.
Experience developing APIs for various applications and usage of Databases including Redshift, Cassandra, MySQL, Flask, Docker, and Python.
Benefits
Health benefits include medical, vision and dental coverage.
Financial benefits include 401(k), stock purchase and company-paid life insurance.
Paid time off benefits include PTO (including sick leave), parental leave, family care leave, bereavement, jury duty and voting.
Other benefits include short-term and long-term disability, education assistance with 100% company paid college degrees, company discounts, military service pay, adoption expense reimbursement, and more.
Data Transport Infrastructure Engineer at Leidos supporting U.S. Air Force Cloud One Architecture. Involves developing scalable cloud - native solutions and mentorship roles in a hybrid remote setting.
Principal Software Engineer on Walmart's AI Security team analyzing threats and implementing robust security architectures. Collaborate across domains and mentor on AI safety and secure engineering practices.
Data Center Infrastructure Architect designing scalable and resilient optical cabling for hyper - scale data centers. Implementing physical solutions and automating fiber mapping for efficiency.
Infrastructure Engineer managing IT infrastructure projects and operational tasks for the MHRA. Collaborating with teams to ensure service stability and performance in the Digital and Technology group.
AI Infrastructure Engineer designing and implementing AI/ML solutions for infrastructure use cases at Xsolla. Collaborating with teams to enhance the security posture of infrastructure systems.
AI Infrastructure Engineer at Xsolla designing AI/ML solutions for multi - cloud infrastructure. Collaborating on automation workflows and observability systems for improved infrastructure management.
Cloud Infrastructure Engineer managing Azure environments and supporting cloud infrastructure processes in a credit market servicing organization. Collaborating with DevOps teams and ensuring compliance with security standards.
Cloud Infrastructure Architect managing AWS and Azure environments for fintech clients. Leading architectural governance and security compliance in a hybrid infrastructure setup.
Infrastructure Engineer responsible for managing GCP infrastructure and supporting cloud operations. Seeking skills in Terraform, Kubernetes, Ansible, and incident response in enterprise settings.
Senior Infrastructure Specialist designing Azure landing zones and automation for Vitrolife Group. Collaborating with security and virtualization teams to ensure compliance and governance in cloud infrastructure.