Engineer ensuring stability and reliability of workflows built on Palantir software. Engage with problems directly and advocate for product enhancements.
Responsibilities
Ensure stability and reliability of mission-critical workflows built on Palantir software
Gather signal by going on call — resolving problems before the customer is impacted
Drive product change, shape internal tooling, and refine operational processes
Rapidly address issues as they arise with quick and effective solutions
Advocate for workflow or product improvements after immediate issues are resolved
Engage directly with problems, from writing a script to automate a manual task to finding creative workarounds
Build a case for product enhancement
Synthesize learnings from support into best practices for others to follow
Create clear, actionable documentation and share best practices to elevate team and company-wide reliability
Requirements
Background in Computer Science, Engineering, Information Systems, or other technical field.
Ability to work independently and collaboratively to solve ambiguous technical and operational challenges
Excellent written and verbal communication skills, capable of interacting effectively with both technical and non-technical stakeholders.
Proficiency in Python, Java, and SQL
Familiarity with parallel data processing and Spark job optimization
Strong organizational skills and attention to detail, with the ability to prioritize effectively
Resourcefulness and creativity in fast-paced dynamic environments
Experience with root cause analysis and documenting solutions for broader impact
Enthusiasm for hands-on problem solving, continuous improvement, and knowledge sharing
Benefits
Employees (and their eligible dependents) can enroll in medical, dental, and vision insurance as well as voluntary life insurance
Employees are automatically covered by Palantir’s basic life, AD&D and disability insurance
Commuter benefits
Take what you need paid time off, not accrual based
2 weeks paid time off built into the end of each year (subject to team and business needs)
10 paid holidays throughout the calendar year
Supportive leave of absence program including time off for military service and medical events
Paid leave for new parents and subsidized back-up care for all parents
Fertility and family building benefits including but not limited to adoption, surrogacy, and preservation
Stipend to help with expenses that come with a new child
DevOps Engineer assisting developers in leveraging DevOps tooling and best practices for Cat Digital applications. Collaborating closely with development teams to optimize delivery and troubleshooting.
Reliability Engineer providing strategic support at Y12 National Security Complex. Enhancing equipment reliability and maintainability through proactive maintenance strategies.
Upper Steering System Design and Release Engineer responsible for managing steering components and suppliers. Engaging in design and development of upper steering systems for Ford vehicles in a hybrid capacity.
Senior DevOps Engineer implementing CI/CD solutions for software projects. Requires expertise in Docker, Azure, and IAC tools in a hybrid work environment.
DevOps Engineer ensuring the stability and scalability of the justtrack platform. Collaborate with development teams managing the cloud infrastructure for a SaaS solution.
Site Reliability Intern ensuring smooth operation of Compute services and collaborating on tooling development. Participate in teams for system performance and reliability improvements in a global tech company.
Site Reliability Engineer at ING enhancing BTP platform services with a focus on reliability and scalability. Collaborating with cross - functional teams to drive continuous improvement and implement effective monitoring solutions.
DevOps role at Vodafone responsible for designing and maintaining decisioning workflows for automated credit vetting using DataView360 platform. Collaborate with analysts to translate requirements into technical solutions.
SRE Lead responsible for driving reliability and performance across Platform Engineering ecosystem at Birlasoft. Leading capacity planning, incident management, and mentoring SRE engineers.
Senior Director of Engineering leading the DevSecOps Platform team. Championing developer experiences and integrated practices to enhance security and effectiveness at FIS.