DevOps Engineer building and maintaining cloud infrastructure and data pipelines at Pickle Robot Company. Join the team revolutionizing warehouse automation with robotic unload systems.
Responsibilities
Design cloud architecture applied using repeatable Terraform for GCP, including IAM, Artifact Registry, Cloud Run/Compute, load balancers/IAP, and Cloud DNS.
Evolve our GitHub Actions CI/CD pipelines with reusable workflows, intelligent caching, secrets/attestations, and WIF/OIDC integration to GCP.
Level-up observability using Prometheus/VictoriaMetrics, Grafana, and Alertmanager, establishing sensible SLOs and quiet, actionable alerts.
Harden the platform with least-privilege IAM, Secret Manager/Vault integration, service-to-service authentication, and software provenance tracking.
Ship slimmer, reproducible Docker images and define base-image policies with automated scanning for vulnerabilities.
Design, build, and maintain robust data pipelines to capture logs and telemetry from robots in the field, enabling real-time monitoring and analysis.
Build small internal tools and CLIs (usually Python) to eliminate toil, such as DNS checks, release helpers, and deployment automation.
Develop and maintain automation scripts to bootstrap laptops, servers, and robot controllers, streamlining deployment and configuration management.
Work cross-functionally with software engineering, hardware, and deployment teams to ensure our robots are operationally ready to serve customers with minimal downtime.
Participate in an on-call rotation to provide support for infrastructure and deployment issues.
Write crisp documentation, PR templates, and lightweight runbooks that people actually use.
Establish monitoring, alerting, and observability systems to proactively identify and resolve issues before they impact customers.
Requirements
5–7+ years in software engineering; you've shipped production systems and have a strong software foundation.
Proficiency with Linux, Git, Docker, and Python for automation and infrastructure management.
Hands-on experience with public cloud infrastructure management (GCP preferred): IAM, compute (Cloud Run or GCE), networking/load balancing, storage, PubSub, and Artifact Registry.
Strong experience creating CI/CD and build pipelines that integrate different providers securely, utilizing technologies such as IAM, service account impersonation, and Workload Identity Federation.
Security-minded with deep understanding of least privilege, secrets management, and OIDC/SAML concepts. SecOps experience is a plus.
Bias to automate and document; comfortable taking a loosely defined task to done with light guidance.
Pragmatic about trade-offs; you can explain the "why," not just the "what."Familiarity with Robot Operating System (ROS) and the unique challenges of deploying software to robotic systems.
Experience managing a fleet of robotic devices or IoT systems, including remote monitoring, updates, and troubleshooting.
Excellent troubleshooting skills with the ability to diagnose complex issues across distributed systems.
Strong interpersonal and technical communication skills, with the ability to collaborate effectively across engineering, hardware, and deployment teams in a hybrid environment with distributed teammates.
Detail-oriented problem-solver with a strong sense of urgency and willingness to help colleagues and customers in their time of need.
Self-motivated with high ownership mentality; personal responsibility with a collective mindset.
Benefits
health, dental, & vision insurance
unlimited vacation
all federal and state holidays
401K contributions of 5% your salary
travel supplies
other items to make your working life more fun, comfortable, and productive
DevOps Manager responsible for managing a team for multi - cloud solutions supporting the USAF Cloud One project. Focus on scalable cloud - native solutions and CI/CD practices.
Lead Site Reliability Engineer overseeing SRE practices across Azure and GCP platforms. Driving reliability improvements and leading a team at Lloyds Banking Group.
DevOps Engineer responsible for managing Microsoft Intune operations at Bundesdruckerei GmbH. Focused on ensuring secure digital solutions for identity and data protection in Berlin.
Senior Site Reliability Engineer driving observability and reliability for business - critical systems at Incedo. Collaborating with engineering teams to enhance system resilience and performance.
DevSecOps Specialist securing the software development lifecycle at Vanguard. Collaborating with teams to improve application security tooling and processes, and provide development guidance.
Site Reliability Engineer automating infrastructure deployment for Scaleway's sovereign cloud products. Collaborating with product teams to enhance observability and reliability of the platform.
Reliability Engineer responsible for equipment reliability and safety using data - driven analysis for Wood in Aberdeen. Focus on proactive maintenance and operational efficiency.
Principal Safety and Reliability Engineer developing and supporting safety design for mission - critical aerospace systems. Engaging in design reviews and ensuring compliance with requirements.
Cloud DevOps Engineer playing a pivotal role in developing migration plans for Coast Guard Cloud Architecture. Collaborating with teams to ensure effectiveness and best practices in cloud implementation.