Senior Site Reliability Engineer maintaining reliability and user experience of AI services for Woven by Toyota. Collaborating with engineering teams to ensure service availability and performance.
Responsibilities
Develop and maintain site reliability tools and processes within SRE and Enterprise AI’s engineering and support teams
Implement new technologies and infrastructure upgrades and configuration
Collaborate with other site reliability and support teams at Woven to build and maintain an integrated system
Take part in on-call rotations to ensure platform availability
Requirements
7+ years of experience in software engineering, with at least 3 years in a site reliability engineering or related role
Kubernetes cloud infrastructure and services in AWS and/or GCP, Python/Go, and Terraform experience
On-call support and monitoring/alerting tools (such as Pagerduty, Statuspage, Grafana, etc.), processes (such as on-call, incident management, post-mortems, release/change management, etc.), structure, and best practice experience
Business level English or higher
Experience working with customers and clients in Japan
Experience working with machine learning and/or AI
Japanese language skills
Benefits
Competitive Salary - Based on experience
Work Hours - Flexible working time
Paid Holiday - 20 days per year (prorated)
Sick Leave - 6 days per year (prorated)
Holiday - Sat & Sun, Japanese National Holidays, and other days defined by our company
Japanese Social Insurance - Health Insurance, Pension, Workers’ Comp, and Unemployment Insurance, Long-term care insurance
Housing Allowance
Retirement Benefits
Rental Cars Support
In-house Training Program (software study/language study)
DevOps Specialist supporting the engineering and operational enablement of next - gen data center platforms at KONE. Involves Infrastructure - as - Code deployments and daily DevOps workflows.
GitHub Enterprise Specialist managing KONE's GitHub ecosystem, ensuring secure and scalable workflows. Collaborating with teams to enhance developer productivity through AI - powered capabilities.
Senior Software Engineer responsible for designing microservices and enhancing LLM performance for Fortanix's Generative AI platform. Collaborating with data science and ML Infrastructure teams for security and optimization.
Reliability Engineering Technician conducting various verification tests and collaborating with reliability engineers. Preparing technical documentation in a well - equipped laboratory environment in Poland.
Reliability Engineer ensuring quality and reliability of products. Conducting various verification tests in a well - equipped laboratory in Mierzyn, Poland.
Senior SRE driving incident management and operational excellence in financial software solutions. Working with innovation and technology in Brazil's leading software company's team.
Salesforce DevOps Engineer focused on CI/CD pipeline management for Salesforce at S&P Global Mobility. Collaborating with cross - functional teams to ensure stable and secure releases.
Senior DevOps Engineer designing and building infrastructure for AI workloads across cloud and edge environments. Collaborating with engineering teams to implement scalable, automated solutions.
Mid - level Site Reliability Engineer at WEX managing Azure Cloud systems and driving reliability practices. Collaborating with teams to enhance performance, reduce toil and automate processes.
Reliability Engineer II improving efficiencies and safety in copper mining operations at Freeport - McMoRan. Developing recommendations for engineering projects and collaborating with Operations and Maintenance teams.