Infrastructure Engineer ensuring seamless and secure cloud infrastructure improvements for Rithum's e-commerce platform. Collaborating with teams to automate cloud solutions and maintain physical datacenter operations.
Responsibilities
Support and operate physical datacenter infrastructure, including servers, storage, networking, and rack-and-stack activities.
Manage and maintain VMware vSphere / ESXi environments, including host builds, upgrades, patching, and troubleshooting.
Administer enterprise storage platforms such as Pure Storage and/or NetApp, including provisioning, snapshots, replication, and performance monitoring.
Perform hardware lifecycle activities: installs, replacements, firmware upgrades, and decommissioning.
Participate in planned maintenance, change control, and incident response for datacenter systems.
Build infrastructure-as-code software and other tools providing a foundation for software teams to rapidly build large scale cloud systems
Establish a golden AMI pipeline and maintain a continuous patching schedule for all cloud resources.
Operationalize the resiliency and disaster recovery processes for IaaS landscape
Automation of key operating and security processes in IaaS workloads
Infrastructure management via Configuration Management platform to ensure security and full stack automation for applications, drive the use of the platform and standards for all applications
Drive investigation and resolution of security incidents impacting cloud infrastructure
Work closely with security team to address vulnerabilities, evidence gathering and securing our infrastructure
Handle the security hardening efforts for major OS distributions to ensure overall compliance and a streamlined service for internal clients
Participate in the rotating on-call schedule. Ensure that user emergencies, platform alerts, and support requests are addressed.
Identify and solve problems in cloud and hybrid environments and the ability to create and implement a new solution from scratch.
Mentor and develop less experienced engineers.
Requirements
3+ years’ hands-on experience working in a physical datacenter environment.
Minimum of 2 days on-site in our data center, additional may be required based on business needs
Based in the Raleigh-Durham (RDU) area, North Carolina
Strong experience with VMware (vSphere / ESXi) in a production setting.
Experience administering enterprise storage platforms, preferably Pure Storage and/or NetApp.
Familiarity with server hardware (Dell, HPE, or similar), RAID, BIOS/firmware management.
Experience following change management and operational best practices.
Knowledge and experience of high availability and scalability
Experience with AWS foundations, including computer, networking, storage, and security
Experience architecting containerization solutions in cloud environments like ECS or EKS. A strong background in systems engineering, especially with tools like Docker and Kubernetes in Linux and containerized environments
Expertise in implementing content delivery solutions at the edge
A thorough understanding of IAM roles, access management, DNS, load balancing, routing, firewalls, and monitoring tools for cloud and hybrid environments
Experience deploying and supporting AWS network services, including VPC, Subnets, Route Tables, NACLs, Security Groups, TGW, GWLB, VPC Endpoints, Route53.
Knowledge of AWS compute, data sources, security technologies, services including EC2, S3, IAM, ECS, EKS, Load Balancers, SCP, RAM, CloudWatch, CloudTrail, WAF
Experience architecting and securing regulated enterprise-class cloud services with compliance frameworks like SOC2, NIST, and ISO
Experience with logging and monitoring systems like Datadog and Cloudwatch
Familiarity and ability to diagnose large systems - how they work and can be operated on a large scale, edge cases, failure modes, behaviors.
Proficient in writing code/scripting in languages like
Automation of infrastructure with CDK, Terraform, and Ansible, as well as containerization using EKS or ECS.
Ability to diagnose complex distributed systems problems whether it be system, network or code
Benefits
Medical, dental and vision benefits: Affordable health care plans and company HSA contributions, starting on Day 1
A 6% 401(k) match
Competitive time off package with 20 days of Paid Time Off, 9 Company-Paid holidays, 2 paid floating holidays, 7 paid sick days, 2 Wellness days, and 1 Paid Volunteer Day; at 3 years of service PTO increases to 22 days, and at 5 years it increases to 25 days
Kubernetes Infrastructure Engineer focusing on Quantum Key Distribution software in Vienna. Responsible for secure and reliable software infrastructure operations.
Kubernetes Infrastructure Engineer focused on developing software infrastructure for Quantum Key Distribution as a service. Join zerothird in Vienna, a leader in quantum cryptography technology.
IT Infrastructure Engineer managing on - prem and cloud infrastructure in aviation data solutions. Collaborating in a well - coordinated team for flexible project work and customer impact.
Infrastructure Architect designing and implementing scalable solutions at Regions. Collaborating with teams on enterprise - wide architecture and infrastructure improvements.
Cloud Infrastructure Engineer at EVENTIM designing AWS infrastructure and implementing DevOps practices. Collaborating with teams on scalability, security, and automation initiatives.
Infrastructure Engineer at BAE Systems Digital Intelligence designing and maintaining enterprise - grade infrastructure platforms. Role involves Linux, Windows, cloud environments, and security responsibilities.
Sr. AWS and Infrastructure Engineer defining and owning AWS infrastructure architecture for scalable production environments. Leading security architecture and compliance implementation with a focus on cost optimization and CI/CD.
Senior Infrastructure Architect at Cambio, leading IT solutions in healthcare transformation. Driving architecture and infrastructure initiatives for e - health solutions in Sweden.
Staff ML Infrastructure Engineer building and scaling robust Compute platforms for Simulation and data workflows at GM. Collaborating with engineers to drive efficiency and reliability in AI infrastructure.