Senior Manager leading engineering systems for reliability, scalability, and automation at Nextiva. Collaborating across teams to enhance cloud and application performance in a multi-cloud environment.
Responsibilities
Begin as a hands-on technical leader, with team leadership responsibilities scaling over time
Drive reliability, performance, and stability efforts across the engineering organization
Collaborate with developers to promote shift-left reliability practices and improve service health
Partner with Cloud Engineering, SRE, Observability, and Database/Middleware teams on cross-functional initiatives
Lead the evolution and maturation of observability, monitoring, alerting, logging, tracing, and incident management
Act as a technical partner to engineering teams, improving scalability and performance through design reviews, architecture discussions, and direct engagement
Participate in the production on-call rotation for critical services
Mentor and support the growth of engineers, fostering strong technical development and autonomy
Champion automation-first approaches across systems, processes, and tooling
Introduce and operationalize AI-driven improvements across reliability, performance, and developer workflows.
Requirements
Bachelor’s degree in Computer Science or related field, or equivalent experience
7+ years of experience in software engineering roles supporting production systems
3+ years of experience managing or leading teams responsible for production engineering or reliability
Hands-on expertise in SRE/DevOps tooling, practices, and incident management
Deep experience with at least two major cloud providers (AWS, GCP, Azure)
Strong experience with Kubernetes and modern container orchestration
Proven success building or maturing reliability, performance engineering, or cloud infrastructure programs
Direct experience with observability platforms (metrics, logs, traces) and service health instrumentation
Demonstrated ability to work closely with developers and influence design, architecture, and operational practices
Required experience using modern AI tools and techniques to enhance engineering workflows and operational efficiency
Excellent communication skills and demonstrated success leading multi-team initiatives.
Experience with build and release pipelines and engineering productivity tooling
Experience with cloud cost optimization or performance engineering initiatives
Familiarity with service mesh technologies and distributed systems patterns
Experience improving developer experience and internal platform capabilities.
Benefits
Medical 🩺 - Medical insurance coverage is available for employees, their spouse, and up to two dependent children with a limit of 500,000 INR, as well as their parents or in-laws for up to 300,000 INR. This comprehensive coverage ensures that essential healthcare needs are met for the entire family unit, providing peace of mind and security in times of medical necessity.
Group Term & Group Personal Accident Insurance 💼 - Provides insurance coverage against the risk of death / injury during the policy period sustained due to an accident caused by violent, visible & external means.
Coverage Type - Employee Only
Sum Insured - 3 times of annual CTC with minimum cap of INR 10,00,000
Free Cover Limit - 1.5 Crore
Work-Life Balance ⚖️ - 15 days of Privilege leaves per calendar year, 6 days of Paid Sick leave per calendar year, 6 days of Casual leave per calendar year. Paid 26 weeks of Maternity leaves, 1 week of Paternity leave, a day off on your Birthday, and paid holidays
Financial Security 💰 - Provident Fund & Gratuity
Wellness 🤸 - Employee Assistance Program and comprehensive wellness initiatives
Growth 🌱 - Access to ongoing learning and development opportunities and career advancement.
Senior Specialist, Engineering providing automation support for clinical manufacturing processes at FLEx Center in Rahway, NJ. Ensuring reliable operation of automation systems and leading capital projects.
Automation Engineer providing engineering support for sterile clinical manufacturing operations at FLEx Sterile facility in Rahway, NJ. Ensuring operation compliance and reliability of automation systems with a focus on process automation.
SPS programmer developing and implementing various control systems for automation in industrial processes. Collaborating closely with engineering and project management teams in a global manufacturer setting.
AI Application Engineer responsible for developing and maintaining backends for AI applications using Python and relevant frameworks. Collaborating on full software lifecycle with cloud deployment expertise.
Document Developer responsible for developing and maintaining document templates for legal workflows at LexisNexis. Engaging with attorneys and managing regional account specifications to ensure compliance and quality.
Rotational Engineering program at GE HealthCare for high potential talent. Commitment to growing engineers through technical training and leadership assignments across various teams.
Automation Programmer at Keystone Clearwater Solutions responsible for maintaining automated assets and ensuring system functions. Involves troubleshooting, software version management, and field operations.
Automation Programmer responsible for maintaining SCADA and PLC systems at Keystone Clearwater Solutions. Involves field travel for troubleshooting and upgrades while overseeing automation assets.
Engineering Services Coordinator managing contract administration activities within Florida Department of Transportation. Overseeing agreements, coordinating with consultants, and ensuring project development is efficient.