Senior Data Engineer · Azure Databricks

Building scalable lakehouse pipelines & AI-powered data products.

5+ years specializing in Azure Databricks — Delta Lake, Delta Live Tables, Unity Catalog, Asset Bundles, and Genie AI — across insurance, digital media, and analytics.

Toronto, ON, Canada 5+ years experience Databricks Certified — Professional Open to opportunities

Platforms & tooling I build with

Databricks Microsoft Azure AWS Google Cloud Apache Spark Python Apache Airflow PostgreSQL Neo4j GitHub

01 — Selected work

GitHub

Open source · GitHub

Databricks AI Demo

A hands-on Databricks demo exploring AI-powered data workflows on the lakehouse.

Stars
Forks
Language
Updated
View on GitHub →

02 — Capabilities

Databricks

Delta LakeDelta Live TablesUnity CatalogWorkflowsAsset BundlesAuto LoaderGenie AILakebaseDatabricks Apps

Languages

PythonPySparkSpark SQLSQL

Cloud

Azure ADLSAzure DevOpsADFAWS S3GCP BigQueryPub/SubDataflow

Tools & strengths

GitHubJiraStreamlitAirflowMedallion ArchitectureCI/CDData GovernancePerformance Tuning

03 — Experience

ZCZurich Zurich CanadaApr 2024 – Present

Senior Data Engineer

Toronto, ON

  • Built an AI-powered insurance policy & claims app using Databricks Genie and Lakebase — natural-language querying for underwriters, reducing insight turnaround from days to seconds.
  • Architected medallion lakehouse pipelines (bronze → silver → gold) with Delta Lake, Unity Catalog governance, and Auto Loader.
  • Designed config-driven PySpark ETL frameworks across 10+ programs, cutting new source onboarding time by ~60%.
  • Modernized CI/CD with Databricks Asset Bundles and Azure DevOps across dev/staging/prod.
  • Optimized Spark pipelines via partitioning, caching, and Z-ordering — reducing job runtimes by ~40%.
GMGroupM GroupMAug 2021 – Apr 2024

Data Engineer

Toronto, ON

  • Built Databricks and PySpark pipelines on GCP integrating Pub/Sub, Dataflow, BigQuery, Bigtable, and Cloud Functions — improving reporting performance by 60%.
  • Delivered multi-cloud solutions across GCP, Azure, and AWS; led data governance for enterprise data warehousing.
CL ClueJan 2021 – Jul 2021

Data Engineer

Toronto, ON

  • Designed graph data models with Neo4j and Cypher; built ETL pipelines on AWS (S3, Athena, SageMaker, DataBrew) for digital marketing analytics.
CL ClueFeb 2020 – Dec 2020

Data Analyst

Toronto, ON

  • Built Azure analytics services ingesting Salesforce data for media campaign reporting with near-zero downtime.
MSMindshare MindshareMay 2019 – Feb 2020

Digital Analyst

Toronto, ON

  • Architected Datorama solutions for Nestlé and CPG clients, improving campaign ROI measurement.

04 — Education & certifications

M.Eng, Computer Engineering

University of Ottawa

B.E, Computer Science

Visvesvaraya Technological University

Databricks Certified Data Engineer Professional

Databricks · Dec 2024 – Dec 2026

AWS Certified Cloud Practitioner

Amazon Web Services

Let's build something