DevOps Coordinator overseeing AWS cloud infrastructure and CI/CD pipelines at BMW TechWorks Romania. Leading operational stability efforts while managing technical teams and enhancing system reliability.
Responsibilities
Ensure operational stability and high availability
Lead and coordinate the external operations team to meet service level agreements and operational requirements
Manage and improve CI/CD pipelines for seamless deployment
Oversee AWS cloud infrastructure including ECS instances, containerization solutions, and related cloud services
Implement and maintain monitoring systems (Prometheus, Grafana, ELK) for proactive issue identification and resolution
Manage Snowflake database operations including performance monitoring, backup strategies, and access controls
Analyze and resolve data flow issues to maintain seamless integration between applications and data integrity
Implement and maintain security measures for authentication, authorization, and data protection across the application stack
Coordinate between internal teams and external partners as the primary point of contact for operational matters
Identify infrastructure issues and work with development teams to implement timely resolutions
Monitor resource utilization and plan capacity needs to support application growth and performance
Develop and maintain disaster recovery plans and procedures to ensure business continuity
Anticipate challenges and implement preventive measures to enhance system reliability before problems arise
Maintain comprehensive documentation of system configurations, processes, and troubleshooting procedures
Monitor cloud resource costs and provide recommendations for optimization without compromising performance
Work with development teams to identify and implement performance improvements across the application stack
Requirements
Proactive mindset with a hands-on attitude towards problem-solving
Bachelor's degree in Computer Science, Information Technology, or a related field
Minimum of 5-7 years professional work experience in IT operations management
Experience managing technical teams, preferably in a vendor management capacity
Proven experience in IT operations, particularly in managing Java-based backend services and modern frontend applications within AWS cloud environments
Strong understanding of CI/CD practices and tools (GitHub Actions, Jenkins, Terraform, etc.)
Knowledge of database operations, particularly with Snowflake or similar cloud data warehouses
Experience with monitoring tools (Prometheus, Grafana, ELK stack) and APM solutions
Excellent communication skills, with the ability to coordinate effectively with diverse teams and stakeholders
Familiarity with infrastructure and security requirements, acting as the first contact for external requests
Reliability Engineering Manager at Nestlé driving improvements in maintenance and engineering processes. Leading teams in establishing a zero loss culture for sustainable production efficiency.
Associate DevSecOps Engineer supporting R&D tools deployment in Bologna. Hands - on exposure to DevSecOps and containerized services in a growing tech environment.
Senior Reliability Engineer responsible for maintaining and improving plant asset reliability processes while ensuring safe operations and high product quality. Requires collaboration with clients and complex problem - solving skills.
Senior Site Reliability Engineer at PulseRise Technologies building and scaling reliability foundations for a fintech platform. Leading incident response and designing resilient AWS architectures in a hybrid environment.
Senior DevOps Platform Engineer at Humana responsible for designing and maintaining cloud infrastructure on Azure and GCP. Driving CI/CD pipeline development and ensuring security compliance for healthcare tech.
DevSecOps Engineer focusing on automation and Active Directory management at Saab. Collaborating within the IAM team to enhance secure access and infrastructure management.
Technology Lead - SRE at Broadridge managing service delivery agreements and client satisfaction through project management. Focusing on improving processes for efficient service delivery in financial solutions.
Director of DevSecOps and SRE at Allegion overseeing infrastructure reliability and CI/CD pipelines. Leading and mentoring SRE and cloud infrastructure teams in a global organization.
Platform Engineer (SRE) responsible for implementing cloud - native infrastructure and automation. Join UOL EdTech to transform education using technology in Brazil.