Senior DevOps professional at iugu managing system reliability and performance in a dynamic environment. Collaborating with development teams and automating processes for efficiency.
Responsibilities
Ensure high availability and performance of systems;
Monitor system performance in real time to identify issues before they impact users;
Configure and manage monitoring tools;
Collaborate with the Development team to continuously build and improve applications;
Analyze logs and metrics to identify patterns and trends;
Diagnose root causes of issues and implement solutions;
Collaborate with other teams to resolve complex problems, documenting incidents to prevent recurrence;
Implement automation tools and automate repetitive and manual tasks to free up time for more strategic activities;
Create CI/CD pipelines to automate the software delivery process;
Share knowledge and best practices with other teams;
Stay up to date with the latest SRE and DevOps technologies and trends;
Propose improvements to systems;
Contribute to the development of more junior team members.
Requirements
Strong knowledge of operating container-based applications (Kubernetes/EKS);
Strong knowledge of public cloud architectures (preferably AWS);
Advanced knowledge of observability tools for distributed systems (Datadog, Grafana, Prometheus, Alertmanager, Zabbix, CloudWatch);
Advanced knowledge of SRE and DevOps practices;
Advanced knowledge of SDLC (Gitflow, GitHub Actions);
Advanced knowledge of IaC (Terraform) - mandatory;
Proven experience building and maintaining highly available, resilient, observable, scalable, and secure systems;
Strong knowledge of operating infrastructure for distributed environments;
Experience developing scripts to automate operational processes;
Advanced knowledge of the Linux platform.
Benefits
Swile card credit of R$ 1,689.59 distributed as follows:
Meal and Food: R$ 772.99.
Mobility: R$ 408.60 for your commute, which can be used for any service within this category (Uber, 99, transit card top-up, parking, among others).
In addition, R$ 508.00 to use as it makes the most sense for you between meal, food, or mobility.
Bradesco Health and Dental Plan: we offer ways to help you take care of your health more conveniently.
TotalPass: a platform with over 250 training modalities available at thousands of partnered gyms.
Vittude: scheduling psychological consultations with the professional of your choice, with a 40% discount on all sessions.
Life Insurance: we offer solutions that help you better handle everyday challenges.
Daily breakfast: every day for our iuguers to start the day well.
Fresh fruit: available every day in the office.
Mini-market: daily convenience with options of snacks, ready meals, and beverages for all tastes.
Childcare assistance: for moms and dads.
Extended maternity leave: 180 days.
Parental leave: extended to 30 days.
Short-term incentive: based on target achievement and individual performance evaluation.
Day off: during your birthday month.
iu+go: semi-annual performance management program with follow-up on your Individual Development Plan (PDI).
iugu academy: a learning environment created by iugu, giving you access to regulatory content and specific learning tracks.
Partnerships: agreements with outstanding educational institutions — such as language schools and universities (exclusive discounts for iuguers and their dependents).
Electrical Reliability Engineer at Marathon Petroleum maintaining electrical equipment and systems. Collaborating with cross - functional teams and ensuring compliance with electrical codes and standards.
Senior DevOps Engineer focused on GCP platform engineering at healthtech startup. Collaborating with teams to enhance compute and networking capabilities.
SME DevOps Engineer delivering enhancements for enterprise data and analytics products across DoD organizations. Collaborating with government and industry partners to translate strategic requirements into scalable solutions.
DevOps Engineer designing CI/CD pipelines and managing Azure cloud infrastructure for leading organizations. Collaborating with global teams and automating deployment processes across projects.
Site Reliability Engineer ensuring platform stability and managing AWS migration. Focused on hands - on maintenance work and engineering automation for healthcare staffing platform.
Site Reliability Engineer maintaining the ShiftKey Marketplace platform while ensuring its stability and availability. Collaborating on infrastructure projects and support with a remote - first approach.
Site Reliability Engineer for ShiftKey, ensuring stability and performance of healthcare management platform. Involves maintenance and development initiatives with a proactive approach to prevent incidents.
Site Reliability Engineer maintaining stability and availability of healthcare staffing platform while collaborating with engineering teams on AWS migration projects.
Senior DevOps Engineer responsible for deployment and secure operations of FedRAMP products at Semperis. Focusing on compliance, automation, and collaborating with security teams.
DevOps Team Lead managing deployment and operations of FedRAMP authorized products at Semperis. Lead a team in a regulated environment focusing on security and process improvement.