DevOps Engineer building and maintaining cloud infrastructure and data pipelines at Pickle Robot Company. Join the team revolutionizing warehouse automation with robotic unload systems.
Responsibilities
Design cloud architecture applied using repeatable Terraform for GCP, including IAM, Artifact Registry, Cloud Run/Compute, load balancers/IAP, and Cloud DNS.
Evolve our GitHub Actions CI/CD pipelines with reusable workflows, intelligent caching, secrets/attestations, and WIF/OIDC integration to GCP.
Level-up observability using Prometheus/VictoriaMetrics, Grafana, and Alertmanager, establishing sensible SLOs and quiet, actionable alerts.
Harden the platform with least-privilege IAM, Secret Manager/Vault integration, service-to-service authentication, and software provenance tracking.
Ship slimmer, reproducible Docker images and define base-image policies with automated scanning for vulnerabilities.
Design, build, and maintain robust data pipelines to capture logs and telemetry from robots in the field, enabling real-time monitoring and analysis.
Build small internal tools and CLIs (usually Python) to eliminate toil, such as DNS checks, release helpers, and deployment automation.
Develop and maintain automation scripts to bootstrap laptops, servers, and robot controllers, streamlining deployment and configuration management.
Work cross-functionally with software engineering, hardware, and deployment teams to ensure our robots are operationally ready to serve customers with minimal downtime.
Participate in an on-call rotation to provide support for infrastructure and deployment issues.
Write crisp documentation, PR templates, and lightweight runbooks that people actually use.
Establish monitoring, alerting, and observability systems to proactively identify and resolve issues before they impact customers.
Requirements
5–7+ years in software engineering; you've shipped production systems and have a strong software foundation.
Proficiency with Linux, Git, Docker, and Python for automation and infrastructure management.
Hands-on experience with public cloud infrastructure management (GCP preferred): IAM, compute (Cloud Run or GCE), networking/load balancing, storage, PubSub, and Artifact Registry.
Strong experience creating CI/CD and build pipelines that integrate different providers securely, utilizing technologies such as IAM, service account impersonation, and Workload Identity Federation.
Security-minded with deep understanding of least privilege, secrets management, and OIDC/SAML concepts. SecOps experience is a plus.
Bias to automate and document; comfortable taking a loosely defined task to done with light guidance.
Pragmatic about trade-offs; you can explain the "why," not just the "what."Familiarity with Robot Operating System (ROS) and the unique challenges of deploying software to robotic systems.
Experience managing a fleet of robotic devices or IoT systems, including remote monitoring, updates, and troubleshooting.
Excellent troubleshooting skills with the ability to diagnose complex issues across distributed systems.
Strong interpersonal and technical communication skills, with the ability to collaborate effectively across engineering, hardware, and deployment teams in a hybrid environment with distributed teammates.
Detail-oriented problem-solver with a strong sense of urgency and willingness to help colleagues and customers in their time of need.
Self-motivated with high ownership mentality; personal responsibility with a collective mindset.
Benefits
health, dental, & vision insurance
unlimited vacation
all federal and state holidays
401K contributions of 5% your salary
travel supplies
other items to make your working life more fun, comfortable, and productive
DevOps Engineer for designing and maintaining Azure - based hybrid cloud infrastructure for a company specializing in nature - based smart city solutions. Leading cloud architecture and mentoring engineers as part of a high - impact team.
SRE responsible for ensuring reliability and performance of IT systems at a digital transformation company specializing in public sector efficiency. Collaborating on system health, incident response, and automation tasks.
DevOps Senior role at Beyond Soluções managing CI/CD for .NET and Kubernetes applications. Collaborating on cloud solutions while fostering a culture of innovation and quality.
Senior Software Engineer at PayPal managing cloud infrastructure and DevOps solutions. Delivering complete SDLC solutions and guiding engineering teams for scalable and reliable services.
Senior Site Reliability Engineer at Diligent leading reliability, automation, and observability across cloud infrastructure. Build tools for incident response and enhance performance in fast - paced environments.
Perception Deployment Engineer deploying deep learning models on embedded systems at Caterpillar. Collaborating with cross - functional teams for integration and optimization of perception modules in vehicles.
Principal Site Reliability Engineer at AT&T required to design scalable solutions for critical operations with minimal downtime. Collaborating with teams to monitor and improve system performance in cloud environments.
DevOps Engineer managing AI SaaS infrastructure at a high - growth European company. Supporting AI model deployment and ensuring platform security and compliance with multiple systems integration.
Engineering Manager leading teams for observability platforms at LexisNexis. Owns operational excellence across software delivery lifecycle in Raleigh, NC.
Reliability Engineer optimizing site facility infrastructure and utility systems at Roche. Conducting root cause analyses and developing maintenance plans to enhance reliability and efficiency.