✏️ Portfolio Guide Update your links, project repos, certifications & personal details

Portfolio Edit Guide

Your Name & Title

Line 6
Browser tab title — replace Linda M-Okoronkwo with your name
Line 386
Nav bar brand name — replace with your display name
Line 400
Hero headline — replace name and Data Engineer role
Line 620
Footer name — update to match

Bio, Stats & Availability

Lines 415–418
Hero stat panel — update numbers: projects, years, platforms, uptime
Lines 434–436
About bio paragraphs — rewrite with your own story and specialisms
Line 578
Contact section availability blurb — update with your own message

Project GitHub Links — one per project

Line 490
Project 01 — dbt + Snowflake
href="https://github.com/YOUR-USERNAME/repo-name"
Line 522
Project 02 — Real-Time Pipeline
href="https://github.com/YOUR-USERNAME/repo-name"
Line 554
Project 03 — DW Migration
href="https://github.com/YOUR-USERNAME/repo-name"
Line 554
Project 04 — ADF Pipeline
href="https://github.com/YOUR-USERNAME/repo-name"
Line 554
Project 05 — Customer Analytics
href="https://github.com/YOUR-USERNAME/repo-name"

Certifications — click each card to verify

SnowPro
Replace YOUR-SNOWPRO-BADGE-ID with your Credly badge ID — or link to a PDF: href="certs/snowpro.pdf"
DP-203
Replace YOUR-DP203-BADGE-ID with your Microsoft / Credly badge ID
dbt
Replace YOUR-DBT-CERT-ID with your credential.net URL
PL-300
Replace YOUR-PL300-BADGE-ID — change cs-prog to cs-active when complete

Contact Links

CV Button
Nav + CV card — replace linda-mokoronkwo-cv.pdf with your uploaded CV filename
GitHub
Replace URL and display text with your real GitHub profile link
LinkedIn
Replace URL and display text with your real LinkedIn profile link
Email
Replace mailto:linda.mokoronkwo@email.com and the visible text with your real email

Publishing to GitHub Pages

Step 1
Rename this file to index.html
Step 2
Go to your GitHub repo → Add file → Upload files → drag in index.html
Step 3
Click Commit changes — site updates within 1–2 minutes at your GitHub Pages URL
Step 4
Hard refresh: Ctrl+Shift+R (Windows) or Cmd+Shift+R (Mac)
Data Engineer · Modern Data Stack

Linda
M-Okoronkwo.

I architect cloud-native data pipelines that transform raw, scattered data into clean, trusted intelligence — powering decisions that matter.

// at a glance
5+
Projects Shipped
3+
Years Experience
4
Cloud Platforms
99%
Pipeline Uptime
PythonAirflow PySparkSnowflake dbt CoreAzure Power BISQL
About Me

Engineering data that businesses trust.

I specialise in end-to-end data engineering on the modern data stack — from orchestrating robust Airflow ingestion workflows to building tested, documented dbt models and delivering clean Snowflake data marts that power real decisions.

With hands-on experience across Azure, Snowflake, PySpark, and dbt, I bring both engineering rigour and analytical thinking to every pipeline I build — ensuring data is not just available, but trustworthy and actionable.

Data Engineering
⚡ PySpark 🌬 Apache Airflow 🐍 Python 🗄 SQL
Cloud & Warehouse
❄️ Snowflake ☁️ Azure 🔷 Azure Data Factory 🔷 Azure Databricks
Transform & BI
🔧 dbt Core 📊 Power BI
DevOps & Tooling
🐳 Docker 🔀 Git 🐧 Linux ✅ Great Expectations
Featured Work

Projects

Five production-grade data engineering solutions — each grounded in a real problem, a deliberate approach, and a measurable outcome.

01 ELT · Analytics Engineering

End-to-End Stock Market Data Pipeline

Production-grade ELT pipeline centralising five source systems into Snowflake, transformed via a layered dbt architecture, orchestrated daily in Airflow, and served to business via Power BI.

⛔ The Problem

Manual download of stock price files, that requires cleaning them up in Excel, and uploading somewhere for the team to query. This takes hours, it's error-prone, and eventually delays decision making.

✅ The Approach

Three-layer dbt architecture (staging → intermediate → business-ready) on Snowflake. Daily Airflow DAGs. Automated dbt tests and documentation enforce quality end-to-end.

Snowflakedbt CoreApache Airflow PythonApache KafkaPower BI DockerMinIO
50+ dbt models
40% cost reduction
99.8% uptime
GitHub →
02 Streaming · Real-Time

Real time Ecommerce with Kafka and Snowflake

Event-driven streaming pipeline ingesting high-frequency data, processing with Apache Kafka Structured Streaming, and delivering live Power BI dashboards for operational monitoring.

⛔ The Problem

Modern ecommerce businesses generate thousands of customer events every second — product views, add-to-cart actions, checkouts, payments, and cancellations. Without a real-time data pipeline, these businesses face critical challenges.

✅ The Approach

Captured every ecommerce event instantly using Apache Kafka as the streaming backbone Processes and store events in Snowflake using a Bronze → Silver → Gold layered architecture for clean data delivery. With live business insights to stakeholders through a Power BI dashboard.

Apache KafkaPython SnowflakeGithubPower BI
24hr → 15min latency
Real-time alerting
Auto-scaling
GitHub →
03 Migration · Cloud Modernisation

Insurance Snowflake ELT Pipeline

Full migration from a MongoDB Atlas operational database, loads raw data into Snowflake via Airbyte.

⛔ The Problem

Manual processing of data prone to errors — slow queries, costly maintenance, no scalability, zero self-serve capability for business analysts.

✅ The Approach

Redesigned schemas for Snowflake's columnar engine. Easy transfer of incremental customer data with reliable pipelines with dbt for data transformation .

MongoDBSnowflake dbtPythonAirbyte
Zero-downtime cutover
60% faster queries
35% cost saving
GitHub →
04 Analytics · Insights

Power BI Healthcare Dashboard

An interactive Power BI dashboard designed to support the analysis of brain cancer patient data. This transforms complex clinical and demographic datasets into clear, actionable visual insights

⛔ The Problem

Identify which age groups or demographics are most affected by brain cancer. Delay in Spotting delays in the care pathway that may be affecting patient survival

✅ The Approach

Track tumour grade distribution across a patient population and filtering patient data by age, gender, diagnosis date, and treatment type

Power BIDAX Power QueryGitHub
3 sources unified
Metadata-driven
GitHub →
Let's Connect

Open to new opportunities.

I am actively seeking Data Engineer roles where I can design scalable pipelines and work on meaningful data challenges. Recruiter, hiring manager, or fellow data professional — I would love to hear from you.

Send a message