DevOps Engineer responsible for building and maintaining infrastructure for SaaS platform. Ensuring high availability, security, and performance for growing customer base in Ahmedabad, Gujarat.
Responsibilities
Design, implement, and maintain scalable AWS infrastructure using EC2, VPC, and related services
Build and optimize CI/CD pipelines for rapid, reliable software delivery
Automate deployment and configuration management using Ansible
Develop automation scripts using Python and Bash for operational efficiency
Manage and optimize Linux-based server environments
Monitor system performance using ELK Stack (Elasticsearch, Logstash, Kibana) for logging and observability
Optimize PostgreSQL database performance, backups, and high availability configurations
Respond to incidents and participate in on-call rotations
Conduct root cause analysis and implement preventive measures
Optimize AWS resource utilization and cost efficiency
Implement security best practices across infrastructure and deployment pipelines
Manage access controls, secrets management, and security scanning
Ensure compliance with relevant security standards and regulations
Conduct regular security audits and vulnerability assessments
Work closely with development teams to improve application architecture and deployment strategies
Provide guidance on infrastructure considerations during feature development
Document infrastructure, processes, and runbooks for the team
Participate in technical discussions and contribute to architectural decisions
Requirements
8+ years of experience in DevOps, Site Reliability Engineering, or related roles
Strong proficiency with AWS services, particularly EC2, VPC, IAM, S3, RDS, CloudWatch, and Auto Scaling
Extensive Linux system administration experience (RHEL, Ubuntu, or similar distributions)
Hands-on experience with PostgreSQL administration, optimization, and backup strategies
Proficiency in configuration management using Ansible (playbooks, roles, and inventory management)
Strong scripting skills in Python and Bash for automation and tooling
Experience implementing and managing ELK Stack for centralized logging and monitoring
Solid understanding of CI/CD tools and practices (Jenkins, GitLab CI, GitHub Actions, or similar)
Strong understanding of networking, security, and AWS best practices
DevOps Engineer for designing and maintaining Azure - based hybrid cloud infrastructure for a company specializing in nature - based smart city solutions. Leading cloud architecture and mentoring engineers as part of a high - impact team.
SRE responsible for ensuring reliability and performance of IT systems at a digital transformation company specializing in public sector efficiency. Collaborating on system health, incident response, and automation tasks.
DevOps Senior role at Beyond Soluções managing CI/CD for .NET and Kubernetes applications. Collaborating on cloud solutions while fostering a culture of innovation and quality.
Senior Software Engineer at PayPal managing cloud infrastructure and DevOps solutions. Delivering complete SDLC solutions and guiding engineering teams for scalable and reliable services.
Senior Site Reliability Engineer at Diligent leading reliability, automation, and observability across cloud infrastructure. Build tools for incident response and enhance performance in fast - paced environments.
Perception Deployment Engineer deploying deep learning models on embedded systems at Caterpillar. Collaborating with cross - functional teams for integration and optimization of perception modules in vehicles.
Principal Site Reliability Engineer at AT&T required to design scalable solutions for critical operations with minimal downtime. Collaborating with teams to monitor and improve system performance in cloud environments.
DevOps Engineer managing AI SaaS infrastructure at a high - growth European company. Supporting AI model deployment and ensuring platform security and compliance with multiple systems integration.
Engineering Manager leading teams for observability platforms at LexisNexis. Owns operational excellence across software delivery lifecycle in Raleigh, NC.
Reliability Engineer optimizing site facility infrastructure and utility systems at Roche. Conducting root cause analyses and developing maintenance plans to enhance reliability and efficiency.