Assist in managing and maintaining cloud environments (AWS and Google Cloud), ensuring uptime, performance, and scalability
Work with the Senior Cloud Operations Manager to ensure that all operational procedures and infrastructure meet industry and federal government security guidelines and requirements
Monitor cloud infrastructure, detect issues, and troubleshoot problems to minimize downtime and ensure system stability
Implement automation tools and CI/CD pipelines to streamline operations, deployments, and system updates
Support security initiatives such as patching, vulnerability scanning, and remediation in compliance with industry standards
Respond to infrastructure incidents, resolve issues quickly, and provide reports on root causes and resolution strategies
Ensure backup systems are operational, and disaster recovery plans are in place and regularly tested
Assist in maintaining comprehensive documentation of system configurations, operational procedures, and compliance-related activities
Work closely with the development and security teams to ensure seamless integration between infrastructure and applications
Requirements
3+ years of experience in cloud operations, infrastructure management, or DevOps, with a strong emphasis on AWS
Familiarity with SOC2, ISO 27001 , FedRAMP or other government cloud security frameworks is highly desirable
Strong knowledge of AWS services such as EC2, S3, RDS, Cognito, and Load Balancer; and familiarity with Google Cloud services such as GCE, CloudRun, and BigQuery is a plus
Experience with automation tools such as Terraform, CloudFormation, or Ansible, and familiarity with CI/CD pipelines
Proficiency in scripting languages (Python, Bash, etc.) for automation and system management
Understanding of cloud security principles, encryption, and compliance requirements
Experience with monitoring tools (CloudWatch, Prometheus) for tracking infrastructure performance and resolving issues
Strong troubleshooting skills with the ability to resolve technical issues quickly and efficiently
Ability to collaborate effectively with cross-functional teams, and take direction from senior leadership
Bachelor’s degree in Computer Science, Information Systems, or a related field, or equivalent work experience
DevOps Product Manager working on complex platform and infrastructure projects. Consulting on DevOps best practices and ensuring scalable, efficient digital ecosystems for clients.
Site Reliability Engineer optimizing large - scale Linux environments at Bumble Inc. Troubleshooting incidents and driving performance improvements on platforms such as Kafka and Kubernetes.
Senior DevOps Engineer at mylo, managing multi - cloud infrastructure and CI/CD pipelines. Promoting DevOps culture while ensuring compliance and automating system maintenance.
Lead Site Reliability Engineer at S&P Global's Cloud Engineering team. Responsible for designing and maintaining cloud infrastructure and ensuring the performance of cloud - based systems.
Site Reliability Engineer responsible for monitoring and improving the reliability of satellite operations infrastructure. Collaborating with teams to automate processes in a dynamic environment.
DevOps Analyst providing high quality and reliable solutions within multifuncional teams at technology - focused financial organization. Automating build and deployment solutions in a hybrid work environment.
Network & Datacenter Deployment Engineer at Cloudflare focused on building and expanding their global network infrastructure with collaboration across multiple engineering teams and vendors.
Senior DevOps Engineer leading cloud - native solutions at Sparksoft Corporation. Driving automation and system reliability within a fast - paced Agile team.