Software Engineer contributing to the observability team's development of visibility systems. Implementing a high-performance telemetry platform and supporting AI tools for engineering teams.
Responsibilities
Contribute to the development and maintenance of the systems that provide visibility into Shipt’s technical ecosystem.
Implement and support a high-performance telemetry platform for engineering teams to monitor, debug, and optimize their services effectively.
Work closely with senior engineers to ensure metrics, logs, and traces are reliable and actionable.
Help bridge the gap between traditional monitoring and intelligent diagnostics.
Utilize and support AI-enhanced tools and interfaces to streamline interaction with telemetry data.
Help integrate autonomous agents and predictive models into workflows for a proactive, self-healing infrastructure environment.
Requirements
Bachelor’s degree in Computer Science, Software Engineering, or a related field (or equivalent practical experience).
3+ years of professional experience in software engineering, with exposure to observability, SRE, or infrastructure-focused roles.
Proficiency in at least one programming language, such as Golang or Python, for building and automating infrastructure tooling.
Familiarity with modern Log/Metric/Trace observability stacks- such as Prometheus, OpenTelemetry, structured logging, or similar observability stacks.
Basic understanding of how AI and machine learning can be applied to time-series data for anomaly detection.
Experience working with containerized environments like Kubernetes and cloud platforms like GCP.
Strong analytical and problem-solving skills, with a commitment to providing high-quality visibility for engineering teams.
Benefits
Employees (and eligible family members) are covered by medical, dental, vision and more.
Employees may enroll in our company’s 401k plan.
Employees will also be eligible to receive discretionary vacation for exempt team members.
Paid holidays throughout the calendar year and paid sick leave.
Other compensation includes eligibility for an annual bonus and the potential for restricted stock units based on role.
DevOps Engineer automating and optimizing software development lifecycle processes at COSMOTE Global Solutions. Designing and managing containerized infrastructure on Azure and implementing CI/CD.
Senior DevOps Engineer at Elliptic shaping DevOps culture and driving automation across engineering teams, providing expertise and leadership across the stack.
Senior Data Reliability Engineer ensuring software reliability and quality across enterprise applications. Collaborating with teams to implement robust on - call processes and maintain data fidelity.
Infrastructure & Cloud Operations Engineer managing AWS and hybrid environments for CV - Library. Hands - on role focused on reliability, automation, and operational excellence.
Site Reliability Engineer building reliable and scalable infrastructure for fintech startup Pave Bank. Collaborating with internal teams to enhance banking platform performance and reliability.
Lead DevOps Engineer managing DevOps projects for high - quality strategy games at Twin Harbour Interactive. Collaborating with teams to optimize production systems and improve development workflows.
Senior DevOps Platform Engineer at Humana designing secure cloud infrastructure for healthcare technology. Responsible for CI/CD pipelines and compliance in regulated environments.
Site Reliability Engineer working on the post - RPA Agentic Automation Platform for enterprises. Responsible for developing scalable systems and improving operational reliability.
Cloud Operations Engineer handling advanced troubleshooting and system administration for secure cloud environments. Operating compliance controlled cloud environments and maintaining system stability.