Senior Data Engineer joining Financial Crime team to build data pipelines for fraud detection. Working with complex datasets in Databricks and collaborating with cross-functional teams.
Responsibilities
Build and optimize data ingestion pipelines using Python and PySpark to collect and transform data from multiple sources (transactions, KYC, AML, authentication, devices, logs, etc.).
**Proficiency in SQL (PostGres preferred) **
**Design and maintain data model that support Financial Crime/Fraud detection, profiling, and entity resolution. **
**Implement data quality checks and ensure data reliability across environments.
**Collaborate closely with Data Scientists, Analysts, Compliance, Operations and our Product/Feature teams to operationalize models and rules. **
Utilize jobs, workflows, APIs and streaming to manage large-scale data processing workloads.
Integrate with external systems (e.g. sanctions, ID&V, biometrics and authentication systems) to enrich risk and identity data.
Support **automation and monitoring** of ETL processes to improve operational efficiency.
Requirements
Bachelor’s degree.
**5+ years of experience **
**Strong skills in Python, PySpark, Scala and Advanced SQL (preferably PostGres) **
**Hands-on experience with Databricks, Snowflake, Fabric or similar **
**Experience working with structured and unstructured data in a production environment. **
**Experience with Agentic AI, MLFlow, ML models, Eval **
**Secure Coding practices – testing/QA **
**Comfortable with cloud-based data platforms (preferably AWS). **
**Good communication skills in English — able to collaborate with cross-functional teams in an international environment. **
**Proficiency in working with Text, Delta, Parquet, JSON, CSV, and XML data formats. **
**Working knowledge of Spark structured streaming. **
**AWS infrastructure experience, specifically working with S3. **
**Solid understanding of git-based version control, DevOps, and CI/CD. **
**Experience of working on Atlassian stack a plus. **
**Knowledge of common web API frameworks and web services. **
Strong teamwork, relationship, and client management skills, and the ability to influence peers and senior management to accomplish team goals.
Willingness to embrace modern technology, best practice, and ways of work.
**Nice to Have: **
Experience in **Financial Crime/AML, KYC, **or** fraud detection** systems.
Familiarity with **Entity Resolution frameworks** (e.g., Quantexa, Sensing, open source Entity Resolution tools).
Experience with **data streaming frameworks** (Kafka, Spark Streaming, MQ).
Benefits
Be part of a **mission-driven** team tackling real-world financial crime problems.
Work with **modern data tech stack** with Agentic AI and advanced ML.
**Hybrid working model **with flexible hours.
International and collaborative culture — working with colleagues across **Vietnam, Singapore, Philippines and South Africa**.
Competitive salary, performance bonuses, and learning support.
Associate Data Engineer supporting privacy engineering controls and executing privacy impact assessments in a financial services company. Collaborating across business units to ensure alignment with privacy regulations.
Data Engineer at CVS Health optimizing data pipelines and analytical models. Driving data - driven decisions with healthcare data for improved business outcomes.
Senior Data Engineer at CVS Health developing robust data pipelines for healthcare data. Collaborating with teams to provide actionable insights and integrate them with consumer touchpoints.
Senior Data Engineer supporting AI - enabled financial compliance initiative with data pipelines and ingestion processes. Collaborating with diverse teams in a mission - critical regulated environment.
Data Architect leading the definition and construction of cloud data architecture for Kyndryl. Participating in significant technological modernization initiatives, focusing on Google Cloud Platform.
Senior Data Engineer driving data intelligence requirements and scalable data solutions for a global consulting firm. Collaborating across functions to enhance Microsoft architecture and analytics capabilities.
Experienced AI Engineer designing and building production - grade agentic AI systems using generative AI and large language models. Collaborating with data engineers, data scientists in a tech - driven company.
Intermediate Data Engineer designing and building data pipelines for travel industry data management. Collaborating across teams to ensure reliable data for analytics and reporting.
Data Engineer managing and organizing datasets for AI models at Walaris, developing AI - driven autonomous systems for defense and security applications.
Data Engineer designing and maintaining data pipelines at Black Semiconductor. Collaborating with process, equipment, and IT teams to support manufacturing analytics and decision - making.