Associate Site Reliability Engineer supporting the reliability and performance of global IT infrastructure at Exegy. Engage with senior engineers and learn foundational systems engineering skills.
Responsibilities
Support uptime across compute, storage, virtualization, and network infrastructure
Assist in managing production systems across data centers, colocation, and cloud environments
Participate in a 24x7 on-call rotation with escalation support
Assist in incident response, root cause analysis, and post-mortem documentation
Provision and deprovision users across Active Directory, Okta, and Exchange
Manage access requests and permissions across on-prem and cloud systems
Troubleshoot Tier 1/2 support issues across hardware, software, connectivity, and accounts
Maintain email distribution lists and shared mailboxes
Provide general office and A/V technology support
Execute and maintain scripts (PowerShell, Bash, Python)
Assist in automating operational tasks and workflows
Support Infrastructure-as-Code (IaC) efforts under guidance of senior engineers
Use AI tools to support troubleshooting, documentation, and operational efficiency
Monitor systems and respond to alerts using observability tools
Perform log analysis and update system health dashboards
Assist in proactive performance checks across hardware and OS layers
Support physical and virtual infrastructure, including hardware lifecycle tasks
Perform patching, upgrades, and routine maintenance
Assist with backup operations and validation
Participate in disaster recovery and business continuity testing
Maintain secure system configurations aligned with standards
Partner with Infrastructure, Network, Security, and DevOps teams
Create and maintain runbooks, SOPs, and documentation
Support security controls, access management, and compliance activities
Requirements
Foundational knowledge of IT systems including:
Networking (TCP/IP, DNS, DHCP)
Operating systems (Windows and/or Linux)
Identity and access management concepts
Comfort working in a command line environment
Exposure to scripting or automation (PowerShell, Bash, Python, or similar)
Basic understanding of monitoring and alerting concepts
Strong problem-solving ability; able to break down unfamiliar issues and work toward solutions
Clear written and verbal communication skills
Ability to work in a team environment, including participation in an on-call rotation
Demonstrated curiosity and initiative in learning new technologies.
Cloud Engineer at MetroStar focusing on building and securing cloud - native systems. Managing Kubernetes workloads and CI/CD pipelines in Agile teams with an emphasis on security.
Senior Engineer Cloud Engineering role focused on AWS migration and automation. Collaborating with teams to innovate cloud patterns and infrastructure best practices.
Senior Operations Engineer driving efficiency and reliability in NVIDIA's global business operations. Collaborating with IT subsystems and automating operational workflows for organizational impact.
Lead or Senior DevOps Developer joining Boeing Defense, Space and Security for advanced technology missions. Involves CI/CD, cloud systems design, and collaboration with government customers.
Site Reliability Engineer ensuring high availability and performance for digital platforms in retail. Collaborating with engineering teams for automation and observability practices.
Site Reliability Engineer driving innovation and growth for Banking Solutions, Payments, and Capital Markets business. Responsible for application reliability and incident response in a hybrid work environment.
DevSecOps role at Tiime ensuring implementation of security practices in products. Collaborate with teams for cloud security and incident management in a hybrid workspace.
Senior Site Reliability Engineer responsible for designing reliable infrastructure supporting Fixify's SaaS platform. Collaborating with product engineering teams and maintaining operational standards for infrastructure performance.
DevOps Engineer working with critical infrastructure systems for Swedish internet services. Focused on building and managing robust systems and contributing to automation and operational improvements.
DevSecOps Consultant integrating security into IT development and operational processes. Advising clients on seamless integration of security requirements into DevOps workflows.