DevOps/MLOps Engineer designing, automating, and maintaining scalable infrastructure for federal client. Collaborating with software engineers and data scientists for resilient solutions.
Responsibilities
Design, implement, and maintain robust CI/CD pipelines to support continuous integration and delivery of both application code and AI/ML models across development, testing, and production environments.
Automate infrastructure provisioning, configuration management, and deployment processes using Infrastructure as Code (IaC) tools to ensure consistency, scalability, and repeatability.
Manage and optimize cloud-based environments, leveraging platforms such as AWS, Azure, or GCP to support high availability, fault tolerance, and cost efficiency.
Implement and manage containerization and orchestration technologies (e.g., Docker, Kubernetes) to support scalable, portable, and resilient application and model deployments.
Monitor system performance, availability, and reliability using centralized logging, metrics, and alerting tools; proactively identify and resolve performance bottlenecks and system issues.
Ensure seamless integration and promotion of code and models across development, testing, staging, and production environments through automated workflows and release management processes.
Collaborate with data scientists and ML engineers to operationalize machine learning models, enabling versioning, reproducibility, and continuous model delivery through MLOps best practices.
Implement and enforce security best practices across the DevOps lifecycle, including secure configurations, vulnerability management, and compliance with federal security standards.
Support system reliability engineering (SRE) practices, including incident response, root cause analysis, and continuous improvement of system resilience.
Document infrastructure, pipelines, and operational procedures to support maintainability, auditability, and compliance with federal standards and accreditation requirements.
Requirements
US Citizenship with ability to obtain a Public Trust.
Bachelor’s degree or higher in Computer Science, Engineering, Information Technology, or a related technical discipline from an accredited institution.
Minimum of 4 years of experience in DevOps, Site Reliability Engineering (SRE), MLOps, or a related field supporting enterprise or mission-critical systems.
Hands-on experience designing and maintaining CI/CD pipelines using tools such as Jenkins, GitLab CI/CD, GitHub Actions, or similar.
Experience with Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, or Ansible.
Experience working with cloud platforms such as AWS, Microsoft Azure, or Google Cloud Platform (GCP).
Proficiency in containerization technologies such as Docker and orchestration platforms such as Kubernetes.
Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack, or similar).
Familiarity with supporting AI/ML workflows and implementing MLOps practices is highly preferred.
Understanding of security best practices and experience working within regulated or federal environments (e.g., NIST, FedRAMP) is preferred.
Strong problem-solving, troubleshooting, and communication skills, with the ability to collaborate across cross-functional teams.
Local to Ashburn, VA and available to work onsite as needed.
Benefits
Flexible Work Hours : Life doesn’t always fit into a 9-to-5 schedule. We offer flexibility to help you manage your work-life balance effectively.
Remote Work : Niyam understands the value of flexibility. We offer remote work.
Career Growth : Niyam is not just a job; it’s a career journey. We provide a supportive environment for your professional development and offer fully paid opportunities for training and advancement within the company.
Great People : Our people are the blueprint of who Niyam is to the industry and community.
Great Environment : Niyam fosters a great environment where innovation, collaboration, and personal growth thrive.
Diversity & Inclusion : We believe in the strength of diverse perspectives. Your unique ideas are welcomed and celebrated every day at Niyam.
Senior DevSecOps Engineer/Developer responsible for building Humana's software security platform. Modernizing architecture and managing CI/CD pipelines as part of core engineering team.
Senior Information Security Analyst focusing on DevSecOps for Unidas, a major mobility company in Brazil. Responsible for optimizing security governance processes and delivering secure software.
Back - End & DevOps Software Developer contributing to building digital products to change the world. Specializing in back - end development and command of DevOps ecosystem for robust infrastructure.
DevOps Manager overseeing scaling for Seekr's AI platform using Kubernetes, Terraform, and Ansible. Leading a hands - on team and collaborating with engineering for efficiency.
Lead DevOps Developer at Boeing, focusing on CI/CD and cloud infrastructure management. Collaborating with teams to automate processes and improve system performance across environments.
Vulnerability & Configuration Management Engineer responsible for vulnerability management and remediation processes at Relax Gaming. Collaborate with IT teams to improve security measures across various platforms.
DevOps Engineer for designing and maintaining Azure - based hybrid cloud infrastructure for a company specializing in nature - based smart city solutions. Leading cloud architecture and mentoring engineers as part of a high - impact team.
SRE responsible for ensuring reliability and performance of IT systems at a digital transformation company specializing in public sector efficiency. Collaborating on system health, incident response, and automation tasks.
DevOps Senior role at Beyond Soluções managing CI/CD for .NET and Kubernetes applications. Collaborating on cloud solutions while fostering a culture of innovation and quality.