Senior ML Ops/DevOps developing MLOps platform components at Capco Poland for financial digital transformation. Responsibilities include CI/CD, model deployment, monitoring, and team collaboration.
Responsibilities
Design, build, and improve MLOps platform components that support the full model lifecycle (development à validation à deployment à monitoring).
Create reusable templates and standardized pipelines to reduce time-to-production and improve consistency across teams.
Implement robust deployment patterns for credit risk models (primarily batch; other patterns as required).
Build & maintain CI/CD pipelines using Jenkins and GitHub, with appropriate quality gates and traceability.
Automate environment configuration and repeatability using Ansible.
Implement model and pipeline monitoring covering operational health, data quality signals, and model performance/drift indicators.
Establish dashboards, alerting, and runbooks; partner with stakeholders to ensure alerts are actionable and aligned to business impact.
Drive continuous improvement through post-release reviews and reliability enhancements (no on-call requirement).
Work closely with credit risk modellers to productionise models built with tools such as TensorFlow, MLFlow, and similar.
Translate modelling needs into scalable engineering solutions, balancing pace with control expectations.
Mentor junior team members (nice-to-have) and contribute to shared engineering standards and documentation.
Requirements
5+ years’ experience across MLOps/DevOps/Platform Engineering, with a track record of delivering production-grade ML or data solutions.
Strong experience building CI/CD and automation using Jenkins and GitHub.
Strong experience with Airflow (Bash), Bash itself, and Groovy for pipeline automation.
Hands-on configuration automation using Ansible.
Strong coding/scripting capability in Python (including PySpark), plus working knowledge of Spark.
Experience with ML tooling such as MLFlow, TensorFlow, and similar, including model packaging and deployment considerations.
Proven ability to implement observability (metrics/logs/dashboards/alerting), with tooling flexibility (e.g., Grafana, Splunk, or similar).
Comfortable working in hybrid environments; experience with Hadoop and an ability to integrate with cloud services (preference for GCP).
Benefits
Flexible collaboration model based on a B2B contract
Principal Safety and Reliability Engineer developing and supporting safety design for mission - critical aerospace systems. Engaging in design reviews and ensuring compliance with requirements.
Cloud DevOps Engineer playing a pivotal role in developing migration plans for Coast Guard Cloud Architecture. Collaborating with teams to ensure effectiveness and best practices in cloud implementation.
Reliability Engineer III at Daimler Truck developing propulsion technology solutions for electrified and conventional axle components. Leading testing and validation for complex powertrain systems.
Electrical Reliability Engineer at Marathon Petroleum maintaining electrical equipment and systems. Collaborating with cross - functional teams and ensuring compliance with electrical codes and standards.
Senior DevOps Engineer focused on GCP platform engineering at healthtech startup. Collaborating with teams to enhance compute and networking capabilities.
SME DevOps Engineer delivering enhancements for enterprise data and analytics products across DoD organizations. Collaborating with government and industry partners to translate strategic requirements into scalable solutions.
DevOps Engineer designing CI/CD pipelines and managing Azure cloud infrastructure for leading organizations. Collaborating with global teams and automating deployment processes across projects.
Senior DevOps professional at iugu managing system reliability and performance in a dynamic environment. Collaborating with development teams and automating processes for efficiency.
Site Reliability Engineer ensuring platform stability and managing AWS migration. Focused on hands - on maintenance work and engineering automation for healthcare staffing platform.
Site Reliability Engineer maintaining the ShiftKey Marketplace platform while ensuring its stability and availability. Collaborating on infrastructure projects and support with a remote - first approach.