Projects

Projects

Data engineering engagements across financial services, retail, and high-tech — real-time pipelines, cloud migrations, and lakehouse platforms on Azure.

Now
2023
Active Oct 2023 – Present

Real-time Processing of Machine Data

ASML Senior Data Engineer

Migrated machine-data pipeline from Databricks Connect to PySpark Streaming on Azure Databricks with Unity Catalog for near-real-time equipment telemetry processing.

Also:

  • Stabilised pipeline under production load and introduced end-to-end observability.
  • Configured pytest-based unit and integration testing with pre-commit hooks and mandatory PR quality gates.
  • Delivered documentation-as-code for the project using MkDocs.
  • Migrated dev environment from Windows to Ubuntu via WSL2.
Azure Databricks Unity Catalog Python Kubernetes Terraform
2022
Completed MAR 2022 – OCT 2023

Retail Lake House Analytics Platform

Ahold Delhaize Senior Data Engineer

Part of the Fundament team building the reusable data ingestion and processing platform underpinning Ahold Delhaize's Lakehouse medallion architecture.

  • Designed and delivered config-driven, YAML-based universal pipelines (Kappa/Streaming-only) for Bronze ingestion from multiple source systems and Silver-layer deduplication and cleanup transformations.
  • Onboarded 400+ data sources onto the platform.
  • Handled high-throughput real-time ingestion from thousands of POS terminals and warehouse movement systems — processing millions of semi-structured records per hour under production load.
  • Implemented data masking and transparent data encryption for sensitive personal and financial data.
Azure Databricks Kafka Streaming GitHub Actions Terraform Bicep
2021
Completed Mar 2021 – Mar 2022

Legacy Teradata to Cloud Data Mesh Migration

ABN AMRO Bank N.V. Senior Data Engineer

Migrated a legacy Teradata solution to a cloud-native Data Mesh architecture on Azure. Rebuilt analytical pipelines using PySpark on Databricks with Apache Hive and Data Lake storage, orchestrated via Airflow and Azure Data Factory.

Azure Databricks PySpark Apache Hive Teradata Apache Airflow Azure Data Factory
2020
Completed Feb 2020 – Mar 2021

Azure DevOps Automation & Cloud Database Migration

Rabobank Azure DevOps / Database Engineer

Built Azure DevOps deployment pipelines and automation for cloud migrations covering Azure SQL Database, Data Factory, and Storage. Developed reusable PowerShell modules and introduced telemetry and measurement tooling across the data platform.

Azure DevOps Azure SQL Database Data Factory PowerShell Azure Storage
Collaboration

Have a data challenge?

I work with teams across financial services, retail, and technology to deliver data platforms that scale. Let's talk about your project.

Get in touch →