Senior Engineer responsible for building and scaling infrastructure at SaaS company Personio. Focused on improving reliability and performance of services for HR tech industry.
Responsibilities
Engage in and improve the full service lifecycle from initial design through deployment, operation, and continuous improvement.
Prepare services for production by engaging in system design reviews, developing shared frameworks and platforms, planning capacity and conducting launch assessments.
Operate, monitor, and maintain live services, designing observability stacks and dashboards to track key metrics and improve operational insight.
Ensure sustainable scalability through automation, driving continuous evolution to increase reliability and delivery speed.
Collaborate with product and engineering teams to define SLOs, error budgets and ensure services are reliable, scalable and observable.
Lead incident management processes, including on-call rotations, managing outages, driving post-mortems and conducting root cause analysis.
Identify and reduce toil through process automation, creating playbooks and automated runbooks to reduce MTTR.
Define resilience strategies and implement chaos testing to proactively uncover weaknesses and validate recovery strategies.
Mentor, train and grow the community. Guide engineers across teams in reliability best practices and tooling.
Requirements
Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
8+ years of experience with SaaS software development in distributed systems using languages such as Kotlin/Java, Typescript, Python, and technologies like IaC, Docker, and Kubernetes.
2+ years’ experience as an SRE or similar role designing, operating, analyzing and troubleshooting distributed systems in agile environments.
Strong knowledge of modern application and infrastructure monitoring concepts (Datadog and/or AWS experience advantageous).
Systematic problem solving and debugging skills with a strong sense of ownership and bias towards establishing mechanisms which can scale across the entire company.
Excellent written, verbal, and documentation skills.
Collaborative team player, able to communicate effectively across disciplines.
Benefits
Receive a competitive reward package – reevaluated each year – that includes salary, benefits, and pre-IPO equity.
Enjoy 28 days of paid vacation, plus an additional day after 2 and 4 years.
Make an impact on the environment and society with 1 (fully paid) Impact Day.
Receive generous family leave, child support, mental health support, and sabbatical opportunities.
We enjoy gathering for meals, cultural initiatives, and events like local Summer Sessions and year-end celebrations. There's also healthy snacks, drinks, and a weekly catered lunch.
DevSecOps / Platform Engineer at Obviant designing secure cloud - native infrastructure. Collaborating with teams to build high - reliability shipping across various platforms.
Junior DevOps Engineer supporting IT security improvements using an in - house developed platform. Working within a cross - functional team to enhance IT and OT environments focused on cybersecurity.
Senior Business Analyst managing technical initiatives and acting as liaison for stakeholders in a tech organization. Supporting Agile frameworks and handling multiple priorities in a dynamic setting.
DevOps Engineer developing and managing cloud and on - prem infrastructure for AI - powered cyber - risk platform. Automating deployments and collaborating with data engineers to enhance cyber - security.
DevOps Engineer working with Deloitte's investment banking clients on data protection automation. Involves hands - on engineering, integration with third - party APIs, and Agile collaboration.
Senior DevOps Specialist ensuring reliability, scalability, and efficiency of SaaS platforms at Experlogix. Collaborating with development and operations teams to optimize infrastructure performance and deployment processes.
DevSecOps Engineer at coni+partner AG ensuring security and development throughout the software life cycle in banking projects. Involves participation in automation projects and close collaboration with development teams.
DevOps Engineer at Aifano GmbH developing AI - driven enterprise solutions. Involves CI/CD pipeline management, cloud infrastructure setup, and collaboration with development teams.
Lead Infrastructure Engineer at U.S. Bank responsible for managing and configuring cloud systems and infrastructure technologies while promoting automation practices.
Site Reliability Engineer focused on automation and optimization of software application performance. Collaborating with cross - functional teams to enhance scalability and reliability in Chennai/Bangalore.