Site Reliability Engineer ensuring reliability and performance of FreeWheel systems. Collaborating with engineering and operations teams for optimization and troubleshooting.
Responsibilities
Design and implement monitoring and alerting systems to ensure the stability, reliability, and performance of data platforms.
Join in on-call shift to quickly respond to and resolve issues.
Develop and maintain automation tools and scripts for deployment, monitoring, backup and disaster recovery.
Analyze and optimize the performance of data storage, query performance, and data flows to ensure efficient processing of large-scale datasets, reduce latency, and improve processing speed.
Respond quickly to platform failures, perform troubleshooting, and coordinate cross-team efforts to resolve issues and ensure high availability and reliability.
Work with engineering teams to analyze and forecast capacity requirements, ensuring the system can handle traffic growth and scale infrastructure accordingly.
Document the architecture, configurations, and operational procedures for platforms, ensuring knowledge is shared across the team and providing relevant training.
Ensure platforms meet security standards and compliance requirements to prevent breaches or misuse.
Collaborate with engineering team, product team, and project management team to support product design and implementation, solving reliability-related issues.
Requirements
3+ years of experience as an SRE, DevOps or Operations Engineer.
Experience with cloud platforms (e.g. AWS, OCI, GCP, Azure) is a plus.
Hands-on experience with Terraform and infrastructure as code principle is a huge plus.
Experience with an automation tool or framework such as Ansible, Terraform, Kubernetes, Docker for automating system deployment.
Proficient in at least one programming language, such as Python, Go, Java, or Scala, with the ability to write efficient scripts and automation tools.
Familiar with using monitoring and log management tools such as Prometheus, Grafana, ELK Stack, or other similar tools.
Excellent communication skills with the ability to convey technical information clearly and concisely to both technical and non-technical stakeholders.
Proactive learner eager to grow in operations and governance.
Benefits
Best-in-class Benefits to eligible employees.
Array of options, expert guidance and always-on tools to support you physically, financially and emotionally.
DevOps Engineer at AddSecure designing and developing modern cloud infrastructure. Involved with IoT solutions and scaling services using AWS, Azure, and Terraform.
Engineer responsible for designing and maintaining SCM, CI/CD, and Software Delivery processes for an international engineering services company. Collaborate in a hybrid environment with advanced technology projects.
Offer Level 1 operational support for DevOps platforms and CI/CD pipelines. Monitor build pipelines, assist in troubleshooting, and provide 24/7 operational support in a hybrid environment.
DevOps Engineer managing and optimizing on - premises infrastructure while supporting cloud and hybrid environments. Building CI/CD pipelines and ensuring system reliability with a focus on collaboration.
Product Reliability Engineer focusing on data analysis and reporting within the reliability function at MineSense. Collaborating with teams to enhance mining technology for a sustainable future.
Dev Ops Engineer managing applications and cloud technologies at DATAGROUP. Collaborating with clients to transform IT landscapes with modern tools and technologies.
Dev Ops Engineer responsible for managing applications and databases, and supporting customer IT transformation into cloud technologies at DATAGROUP. Collaborating with a team in an innovative environment.
AWS Architect developing scalable and resilient cloud infrastructure for Nordcloud clients. Join Nordcloud to enhance cloud migration and security efforts with advanced solutions.
DevOps Engineer supporting major customer(s) container and automation environments for UltraViolet Cyber. Focus on collection, curation, and delivery processes with a collaborative approach.
Join Protolabs as a Senior DevOps Engineer to support business applications and enhance reliability. This hybrid role involves collaboration with IT and development teams in Maple Plain, MN.