Portfolio
47+ projects.
Measurable outcomes.
A selection of enterprise engagements spanning data engineering, AI, and cloud — each defined by the business problem we solved and the quantified impact we delivered.
47+
Total Projects
19
Data Engineering
6
AI & ML
22
Cloud Platforms
Featured Case Studies — Full Detail
Additional Engagements
NLP Document Intelligence Platform
Automated extraction and classification of 2M+ policy and claims documents using Azure Document Intelligence and custom NLP models, eliminating 120 FTE-hours of manual processing per week.
Computer Vision Quality Control System
Deployed edge-optimised computer vision models on production lines to detect surface defects at 99.4% accuracy, replacing 100% manual inspection and reducing defect escape rate 91%.
Customer Churn Prediction Engine
Built an XGBoost churn propensity model scoring 8M subscribers weekly, integrated with CRM for automated retention campaign triggering — reducing 90-day churn 22%.
Azure Enterprise Landing Zone
Designed and deployed a enterprise-scale Azure Landing Zone for a FTSE-100 bank — hub-spoke network topology, Azure Policy guardrails, RBAC, and Private Endpoints — passing internal security review first time.
AWS EKS Containerised Data Platform
Migrated fragmented data workloads to a containerised platform on Amazon EKS with Helm-managed Spark operators, autoscaling node groups, and GitOps deployment via ArgoCD.
GCP Vertex AI MLOps Platform
Built an end-to-end MLOps platform on GCP Vertex AI with Kubeflow pipelines, model registry, automated retraining triggers, and Looker dashboards for model performance monitoring.
Azure Arc Hybrid Cloud Management
Extended Azure management plane to 40 on-premise Kubernetes clusters across global manufacturing sites via Azure Arc — unified policy enforcement, monitoring, and GitOps deployment without full cloud migration.
AWS Lake Formation Data Mesh
Implemented a domain-oriented data mesh on AWS Lake Formation — delegated admin to 8 business domains, fine-grained column-level access, and self-serve data cataloguing via AWS Glue.
Azure FinOps & Cost Optimisation
Delivered a FinOps engagement reducing Azure spend 43% through Reserved Instance purchasing strategy, autoscale policies, orphaned resource cleanup, and a real-time cost anomaly alerting framework.
GCP Dataform Analytics Pipeline
Replaced ad-hoc BigQuery SQL scripts with a fully governed Dataform pipeline — version-controlled transformations, automated testing, dependency graphs, and scheduled incremental loads across 120 tables.
Multi-Cloud Disaster Recovery Architecture
Designed and tested a cross-cloud DR architecture spanning Azure Primary and AWS Standby — achieving RPO <1 hour and RTO <4 hours for Tier-1 trading applications, satisfying FCA operational resilience requirements.
Snowflake Enterprise Data Cloud
Migrated a tier-2 investment bank from on-premise Oracle EDW to Snowflake — zero-copy cloning for dev/test, column-level masking for PII, and automated data sharing with 6 external counterparties.
Azure Databricks Unified Analytics
Consolidated fragmented Jupyter notebooks and legacy R scripts into a governed Databricks Unity Catalog environment — enabling reproducible clinical analytics across 4 hospital trusts with full audit lineage.
AWS Well-Architected Review & Remediation
Conducted a full AWS Well-Architected review across 5 pillars for a SaaS company and executed a 90-day remediation roadmap — eliminating 38 high-risk findings and reducing blast radius of a potential security incident by 80%.
Azure Kubernetes Service Data Platform
Designed an AKS-based containerised data platform for clinical trial data processing — Helm-managed Spark jobs, Vault secrets management, and fully automated CI/CD with compliance-grade audit trails.
Microsoft Purview Data Governance
Implemented Microsoft Purview across a 40-domain data estate — automated scanning, business glossary with 3,000+ terms, data lineage from source to report, and sensitivity labels enforced at column level.
Snowflake Analytics Platform on AWS
Built an actuarial and claims analytics platform on Snowflake (hosted AWS) for a Lloyd's of London syndicate — replacing Excel-based reserving models with governed, version-controlled dbt transformations.
GCP Anthos Hybrid Cloud Deployment
Extended GCP management and policy to 15 on-premise data centre locations using Anthos — unified observability via Cloud Monitoring, consistent Kubernetes policy via Config Sync, and zero-trust network controls.
AWS Control Tower Multi-Account Foundation
Established a secure multi-account AWS foundation for a central government department — Control Tower landing zone, Service Control Policies, centralised logging to S3, and NCSC cloud security principles compliance.
GCP Dataproc Spark Migration
Lifted and shifted 200+ on-premise Hadoop Spark jobs to GCP Dataproc with ephemeral clusters — eliminating always-on cluster costs and cutting job execution time 45% via Dataproc Metastore and persistent shuffle.
Azure DevOps Data Engineering Platform
Introduced DataOps practices across a 30-engineer data team — Azure DevOps pipelines for dbt CI/CD, automated Great Expectations data quality gates, environment promotion, and branch-based development workflows.
Snowflake Data Sharing Platform
Built a Snowflake-native data marketplace enabling secure, zero-copy audience data sharing between a media group and 12 advertiser partners — replacing manual file transfers and eliminating data replication costs.
Azure Monitor Unified Observability
Deployed a centralised observability stack across an energy group's 6 Azure subscriptions — Log Analytics workspaces, custom KQL dashboards, alert routing to PagerDuty, and cost anomaly detection.
Change Data Capture Pipeline
Implemented near-real-time CDC from 12 SQL Server source systems to Azure Data Lake using Debezium on Azure Event Hubs, replacing 6-hour batch windows with sub-5-minute data freshness.
Enterprise Apache Airflow Platform
Centralised workflow orchestration replacing 5 disparate schedulers (SSIS, cron, Control-M, Autosys, and custom scripts) onto a single governed Airflow instance on Azure Kubernetes Service.
Data Vault 2.0 Implementation
Designed and implemented a DV2.0 model on Azure Synapse enabling full historical replay, satellite-based time travel, and regulatory auditability — replacing a brittle star-schema DW that failed monthly reconciliation checks.
Azure Synapse Link for Cosmos DB
Enabled zero-ETL operational analytics over a 500M-document Cosmos DB collection using Synapse Link analytical store — allowing marketing and ops teams to query live transactional data without any pipeline latency.
Ready to add your project to this list?
Every project starts with a conversation. Tell us about your data challenge and we'll tell you how we'd approach it.
Start a Project