Senior Data Engineer joining Financial Crime team to build data pipelines for fraud detection. Working with complex datasets in Databricks and collaborating with cross-functional teams.
Responsibilities
Build and optimize data ingestion pipelines using Python and PySpark to collect and transform data from multiple sources (transactions, KYC, AML, authentication, devices, logs, etc.).
**Proficiency in SQL (PostGres preferred) **
**Design and maintain data model that support Financial Crime/Fraud detection, profiling, and entity resolution. **
**Implement data quality checks and ensure data reliability across environments.
**Collaborate closely with Data Scientists, Analysts, Compliance, Operations and our Product/Feature teams to operationalize models and rules. **
Utilize jobs, workflows, APIs and streaming to manage large-scale data processing workloads.
Integrate with external systems (e.g. sanctions, ID&V, biometrics and authentication systems) to enrich risk and identity data.
Support **automation and monitoring** of ETL processes to improve operational efficiency.
Requirements
Bachelor’s degree.
**5+ years of experience **
**Strong skills in Python, PySpark, Scala and Advanced SQL (preferably PostGres) **
**Hands-on experience with Databricks, Snowflake, Fabric or similar **
**Experience working with structured and unstructured data in a production environment. **
**Experience with Agentic AI, MLFlow, ML models, Eval **
**Secure Coding practices – testing/QA **
**Comfortable with cloud-based data platforms (preferably AWS). **
**Good communication skills in English — able to collaborate with cross-functional teams in an international environment. **
**Proficiency in working with Text, Delta, Parquet, JSON, CSV, and XML data formats. **
**Working knowledge of Spark structured streaming. **
**AWS infrastructure experience, specifically working with S3. **
**Solid understanding of git-based version control, DevOps, and CI/CD. **
**Experience of working on Atlassian stack a plus. **
**Knowledge of common web API frameworks and web services. **
Strong teamwork, relationship, and client management skills, and the ability to influence peers and senior management to accomplish team goals.
Willingness to embrace modern technology, best practice, and ways of work.
**Nice to Have: **
Experience in **Financial Crime/AML, KYC, **or** fraud detection** systems.
Familiarity with **Entity Resolution frameworks** (e.g., Quantexa, Sensing, open source Entity Resolution tools).
Experience with **data streaming frameworks** (Kafka, Spark Streaming, MQ).
Benefits
Be part of a **mission-driven** team tackling real-world financial crime problems.
Work with **modern data tech stack** with Agentic AI and advanced ML.
**Hybrid working model **with flexible hours.
International and collaborative culture — working with colleagues across **Vietnam, Singapore, Philippines and South Africa**.
Competitive salary, performance bonuses, and learning support.
Data Engineer building modern Data Lake architecture on AWS and implementing scalable ETL/ELT pipelines. Collaborating across teams for analytics and reporting on gaming platforms.
Chief Data Engineer leading Scania’s Commercial Data Engineering team for growing sustainable transport solutions. Focused on data products and pipelines for BI, analytics, and AI.
Data Engineer designing and building scalable ETL/ELT pipelines for enterprise - grade analytics solutions. Collaborating with product teams to deliver high - quality, secure, and discoverable data.
Entry - Level Data Engineer at GM, focusing on building large scale data platforms in cloud environments. Collaborating with data engineers and scientists while migrating systems to cloud solutions.
Data Engineer responsible for data integrations with AWS technology stack for Adobe's Digital Experience. Collaborating with multiple teams to conceptualize solutions and improve data ecosystem.
People Data Architect designing and managing people data analytics for Gen, delivering actionable insights for HR. Collaborating across teams to enhance data - driven decision - making.
Data Engineer role focused on shaping future connectivity for customers at Vodafone. Involves solving complex challenges in a diverse and inclusive environment.
VP, Senior Data Engineer responsible for designing and developing cloud data solutions for insider risk in Information Security at SMBC. Collaborating with multiple teams to enhance cybersecurity data platform.
Data Engineer responsible for architecting, developing, and maintaining Allegiant’s enterprise data infrastructure. Overseeing transition to cloud hosted data warehouse and developing next - generation data tools.
Senior Data Engineer developing Azure - based data solutions for clients in the Data & AI department. Collaborating with architects and consultants to enhance automated decision making.