Portfolio

47+ projects.
Measurable outcomes.

A selection of enterprise engagements spanning data engineering, AI, and cloud — each defined by the business problem we solved and the quantified impact we delivered.

47+

Total Projects

19

Data Engineering

6

AI & ML

22

Cloud Platforms

Featured Case Studies — Full Detail

Additional Engagements

AI & MLInsurance

NLP Document Intelligence Platform

Automated extraction and classification of 2M+ policy and claims documents using Azure Document Intelligence and custom NLP models, eliminating 120 FTE-hours of manual processing per week.

Azure Document IntelligenceAzure OpenAIPython+1
120 FTE-hours saved per week
AI & MLManufacturing

Computer Vision Quality Control System

Deployed edge-optimised computer vision models on production lines to detect surface defects at 99.4% accuracy, replacing 100% manual inspection and reducing defect escape rate 91%.

Azure Custom VisionAzure IoT EdgePython+1
91% reduction in defect escapes
AI & MLTelecom

Customer Churn Prediction Engine

Built an XGBoost churn propensity model scoring 8M subscribers weekly, integrated with CRM for automated retention campaign triggering — reducing 90-day churn 22%.

Azure MLXGBoostDatabricks+2
22% reduction in 90-day churn
CloudFinancial Services

Azure Enterprise Landing Zone

Designed and deployed a enterprise-scale Azure Landing Zone for a FTSE-100 bank — hub-spoke network topology, Azure Policy guardrails, RBAC, and Private Endpoints — passing internal security review first time.

Azure Landing ZoneTerraformAzure Policy+2
Zero security findings on first audit
CloudTechnology

AWS EKS Containerised Data Platform

Migrated fragmented data workloads to a containerised platform on Amazon EKS with Helm-managed Spark operators, autoscaling node groups, and GitOps deployment via ArgoCD.

Amazon EKSApache SparkHelm+3
60% infrastructure cost reduction
CloudRetail

GCP Vertex AI MLOps Platform

Built an end-to-end MLOps platform on GCP Vertex AI with Kubeflow pipelines, model registry, automated retraining triggers, and Looker dashboards for model performance monitoring.

GCP Vertex AIKubeflow PipelinesBigQuery+3
Model deployment cycle: 3 weeks → 2 days
CloudManufacturing

Azure Arc Hybrid Cloud Management

Extended Azure management plane to 40 on-premise Kubernetes clusters across global manufacturing sites via Azure Arc — unified policy enforcement, monitoring, and GitOps deployment without full cloud migration.

Azure ArcKubernetesAzure Policy+2
40 sites under unified governance
CloudMedia & Entertainment

AWS Lake Formation Data Mesh

Implemented a domain-oriented data mesh on AWS Lake Formation — delegated admin to 8 business domains, fine-grained column-level access, and self-serve data cataloguing via AWS Glue.

AWS Lake FormationAWS GlueAmazon S3+3
8 autonomous data domains in production
CloudEnterprise

Azure FinOps & Cost Optimisation

Delivered a FinOps engagement reducing Azure spend 43% through Reserved Instance purchasing strategy, autoscale policies, orphaned resource cleanup, and a real-time cost anomaly alerting framework.

Azure Cost ManagementTerraformAzure Policy+2
43% Azure cost reduction in 90 days
CloudRetail

GCP Dataform Analytics Pipeline

Replaced ad-hoc BigQuery SQL scripts with a fully governed Dataform pipeline — version-controlled transformations, automated testing, dependency graphs, and scheduled incremental loads across 120 tables.

GCP DataformBigQueryCloud Composer+2
120 tables governed with zero manual SQL
CloudFinancial Services

Multi-Cloud Disaster Recovery Architecture

Designed and tested a cross-cloud DR architecture spanning Azure Primary and AWS Standby — achieving RPO <1 hour and RTO <4 hours for Tier-1 trading applications, satisfying FCA operational resilience requirements.

Azure Site RecoveryAWS Route 53Terraform+2
RPO <1 h · RTO <4 h — FCA compliant
CloudFinancial Services

Snowflake Enterprise Data Cloud

Migrated a tier-2 investment bank from on-premise Oracle EDW to Snowflake — zero-copy cloning for dev/test, column-level masking for PII, and automated data sharing with 6 external counterparties.

SnowflakedbtAzure Data Factory+2
Oracle EDW replaced; 6 external data shares live
CloudHealthcare

Azure Databricks Unified Analytics

Consolidated fragmented Jupyter notebooks and legacy R scripts into a governed Databricks Unity Catalog environment — enabling reproducible clinical analytics across 4 hospital trusts with full audit lineage.

Azure DatabricksUnity CatalogDelta Lake+2
4 hospital trusts on single governed platform
CloudTechnology

AWS Well-Architected Review & Remediation

Conducted a full AWS Well-Architected review across 5 pillars for a SaaS company and executed a 90-day remediation roadmap — eliminating 38 high-risk findings and reducing blast radius of a potential security incident by 80%.

AWS Security HubAWS ConfigAWS Organizations+2
38 high-risk findings resolved in 90 days
CloudPharmaceuticals

Azure Kubernetes Service Data Platform

Designed an AKS-based containerised data platform for clinical trial data processing — Helm-managed Spark jobs, Vault secrets management, and fully automated CI/CD with compliance-grade audit trails.

Azure Kubernetes ServiceApache SparkHashiCorp Vault+3
FDA 21 CFR Part 11 compliant deployment
CloudBanking

Microsoft Purview Data Governance

Implemented Microsoft Purview across a 40-domain data estate — automated scanning, business glossary with 3,000+ terms, data lineage from source to report, and sensitivity labels enforced at column level.

Microsoft PurviewAzure Data FactoryPower BI+2
3,000+ terms; end-to-end lineage across 40 domains
CloudInsurance

Snowflake Analytics Platform on AWS

Built an actuarial and claims analytics platform on Snowflake (hosted AWS) for a Lloyd's of London syndicate — replacing Excel-based reserving models with governed, version-controlled dbt transformations.

SnowflakeAWS S3dbt+3
Reserving cycle cut from 5 days to 4 hours
CloudLogistics

GCP Anthos Hybrid Cloud Deployment

Extended GCP management and policy to 15 on-premise data centre locations using Anthos — unified observability via Cloud Monitoring, consistent Kubernetes policy via Config Sync, and zero-trust network controls.

GCP AnthosKubernetesCloud Monitoring+3
15 on-premise sites under unified cloud governance
CloudGovernment

AWS Control Tower Multi-Account Foundation

Established a secure multi-account AWS foundation for a central government department — Control Tower landing zone, Service Control Policies, centralised logging to S3, and NCSC cloud security principles compliance.

AWS Control TowerAWS OrganizationsAWS Config+3
NCSC cloud principles met on day one
CloudManufacturing

GCP Dataproc Spark Migration

Lifted and shifted 200+ on-premise Hadoop Spark jobs to GCP Dataproc with ephemeral clusters — eliminating always-on cluster costs and cutting job execution time 45% via Dataproc Metastore and persistent shuffle.

GCP DataprocApache SparkCloud Storage+3
200+ Spark jobs migrated; 45% faster execution
CloudRetail

Azure DevOps Data Engineering Platform

Introduced DataOps practices across a 30-engineer data team — Azure DevOps pipelines for dbt CI/CD, automated Great Expectations data quality gates, environment promotion, and branch-based development workflows.

Azure DevOpsdbtGreat Expectations+3
Deployment frequency: monthly → daily
CloudMedia & Entertainment

Snowflake Data Sharing Platform

Built a Snowflake-native data marketplace enabling secure, zero-copy audience data sharing between a media group and 12 advertiser partners — replacing manual file transfers and eliminating data replication costs.

SnowflakeSnowflake Data SharingPython+2
12 advertisers onboarded; zero data replication
CloudEnergy

Azure Monitor Unified Observability

Deployed a centralised observability stack across an energy group's 6 Azure subscriptions — Log Analytics workspaces, custom KQL dashboards, alert routing to PagerDuty, and cost anomaly detection.

Azure MonitorLog AnalyticsAzure Application Insights+3
MTTR reduced 65% across 6 subscriptions
Data EngineeringBanking

Change Data Capture Pipeline

Implemented near-real-time CDC from 12 SQL Server source systems to Azure Data Lake using Debezium on Azure Event Hubs, replacing 6-hour batch windows with sub-5-minute data freshness.

DebeziumAzure Event HubsAzure Data Factory+2
Data freshness: 6 h → <5 min
Data EngineeringInsurance

Enterprise Apache Airflow Platform

Centralised workflow orchestration replacing 5 disparate schedulers (SSIS, cron, Control-M, Autosys, and custom scripts) onto a single governed Airflow instance on Azure Kubernetes Service.

Apache AirflowAzure Kubernetes ServiceAzure DevOps+3
5 schedulers consolidated into 1
Data EngineeringFinancial Services

Data Vault 2.0 Implementation

Designed and implemented a DV2.0 model on Azure Synapse enabling full historical replay, satellite-based time travel, and regulatory auditability — replacing a brittle star-schema DW that failed monthly reconciliation checks.

Azure Synapse AnalyticsData Vault 2.0dbt+2
Passed 3 consecutive regulatory reconciliation checks
Data EngineeringE-Commerce

Azure Synapse Link for Cosmos DB

Enabled zero-ETL operational analytics over a 500M-document Cosmos DB collection using Synapse Link analytical store — allowing marketing and ops teams to query live transactional data without any pipeline latency.

Azure Synapse AnalyticsAzure Cosmos DBSynapse Link+2
Zero-ETL analytics over 500M live documents

Ready to add your project to this list?

Every project starts with a conversation. Tell us about your data challenge and we'll tell you how we'd approach it.

Start a Project