DevOps Architect defining and evolving AgencyBloc’s cloud and DevOps strategy. Leading design of infrastructure and CI/CD frameworks for secure and scalable SaaS platforms.
Responsibilities
Define and own the DevOps and cloud architecture strategy across all environments and platforms.
Establish and maintain architectural standards, patterns, and best practices for infrastructure, CI/CD, and platform services.
Design scalable, secure, and resilient cloud architectures, including compute, networking, storage, and security models.
Define Infrastructure-as-Code standards, modularization strategies, and reuse patterns across teams.
Establish CI/CD architecture, including pipeline design standards, promotion strategies, and release governance.
Lead tool and platform selection decisions (CI/CD, observability, IaC, security, etc.), ensuring alignment with long-term strategy.
Establish reliability engineering practices, including SLO frameworks, capacity planning, and resilience patterns.
Partner with security teams to define and enforce secure-by-design principles, including identity, access, encryption, and compliance standards (SOC 2, etc.).
Define disaster recovery and business continuity architecture, including RTO/RPO targets and validation strategies.
Guide multi-region and multi-environment deployment strategies.
Conduct architecture reviews and provide guidance on complex or high-impact infrastructure initiatives.
Collaborate with engineering leadership to align DevOps capabilities with product and business priorities.
Define and own the observability strategy, including standards for metrics, logging, tracing, alerting, APM, and Real User Monitoring (RUM) across all platforms.
Design alerting and incident signal strategies aligned to SLOs and business impact, ensuring high-quality, actionable alerts with minimal noise.
Lead selection and standardization of observability tooling (e.g., OpenTelemetry, Datadog, New Relic), driving consistency and visibility across teams.
Drive standardization and reduction of tool and pattern fragmentation across teams.
Mentor engineers and senior engineers, providing technical leadership and architectural guidance.
Define the organization’s AI approach within DevOps, including tool selection, governance, and adoption strategy.
Establish patterns for integrating AI into DevOps workflows (e.g., pipeline optimization, anomaly detection, automated remediation).
Define standards for secure and compliant use of AI tools across engineering teams.
Evaluate emerging AI capabilities and incorporate them into the platform roadmap where they provide measurable value.
Requirements
Bachelor’s degree in Computer Science or equivalent experience preferred.
10+ years of experience in DevOps, cloud engineering, or infrastructure engineering.
Proven experience designing and implementing large-scale cloud architectures (Azure preferred).
Deep expertise in Infrastructure as Code (Bicep, Terraform, CDK, etc.) and modular design patterns.
Strong experience designing CI/CD systems and release management strategies at scale.
Expertise in cloud networking, security architecture, and identity/access management.
Experience defining and implementing observability and reliability frameworks (SLOs, SLIs, SLAs).
Strong understanding of compliance and security frameworks (SOC 2, OWASP, etc.).
Experience evaluating and selecting engineering tools and platforms.
Strong background in automation, scripting, and platform engineering principles.
Experience leading cross-team technical initiatives and influencing engineering direction.
Strong communication skills, with the ability to translate complex technical concepts to non-technical stakeholders.
Senior DevOps Engineer at One Pass redefining health engagement, managing scalable cloud infrastructure and enhancing automation. Collaborate across teams to ensure system reliability and performance.
DevOps Engineer at One Pass building and improving cloud infrastructure in AWS. Collaborating with engineers on deployments, reliability, and automation in a fast - paced environment.
Senior Release Engineer designing CI/CD pipelines for Kaseware’s mission - critical software. Collaborating with engineering, security, and operations teams to ensure fast and reliable deployments.
Site Reliability Engineer maintaining cloud infrastructure reliability for Tecsys solutions. Collaborating across teams to support services and implement automation, observability, and frameworks.
DevOps Engineer managing Kubernetes and cloud infrastructure for innovative legal software startup. Collaborating with development teams and ensuring smooth deployment processes.
DevOps Engineer at VERBI Software GmbH managing AWS - centric infrastructure and driving reliability, scalability, and modernization. Hands - on role applying SRE principles to evolve towards cloud - native best practices.
Sr. DevSecOps Engineer I at MetroStar ensuring integration of security best practices in development and operations lifecycle. Collaborating in delivering high - quality solutions for federal government applications.
DevOps Engineer automating software delivery processes for energy systems in Sweden. Collaborating with development teams and enhancing operational environments for a growing organization.
Site Reliability Engineer at Red Hat designing Python and Golang solutions for managed services. Involves onboarding services, maintaining reliability, and fostering team excellence.