Selected system
Telecom network data lake
Designed and implemented data lake and warehouse foundations for mobile tower network events at petabyte scale and roughly five trillion events per day.
Open case studySelected work
A practical index of platform and data work: what problem it solved, my ownership, the system shape, the tradeoffs, and the outcomes that can be shared publicly.
Clear filtersResults
Use search or a focus filter when you want a narrower project list.
Selected system
Designed and implemented data lake and warehouse foundations for mobile tower network events at petabyte scale and roughly five trillion events per day.
Open case studySelected system
Built a real-time proximity pipeline that joined customer location events with points of interest so users could receive relevant offers when they came within roughly a one-kilometer radius.
Open case studySelected system
I architected and built a governed conversational data and visualization agent: it retrieves business knowledge, answers business questions, runs governed queries from that context, reasons over results, and builds charts without making the LLM the data boundary.
Open case studySelected system
Designed and built services that keep Hive metadata consistent across independent environments using real-time listener sync, daily reconciliation, expiry cleanup, one-time interval jobs, observability, and deployment hardening.
Open case studySelected system
Built browsing-log ingestion and analytics pipelines for safe-browsing classification, audience management, cohort creation, and pattern-based downstream data products.
Open case studySelected system
Built data-mesh platform capabilities around Kyuubi, custom engine routing, RBAC, secrets management, Trino query access, dbt transformations, DataHub metadata, and Metabase BI.
Open case studySelected system
Created an in-house YAML-driven CI/CD framework that let teams onboard projects with very little friction while keeping validation, security scans, deployment behavior, and Jira status updates standardized.
Open case studySelected system
Migrated 50+ Spark and data workloads to Red Hat OCP using Spark Operator, shared CI/CD foundations, containerized runtime patterns, and platform deployment conventions.
Open case studySelected system
Extended enterprise data access governance around Apache Ranger-based RBAC, an external attribute store, DataHub tag-driven policies, row-level security, masking, Trino integration, audit clarity, and local/containerized development paths.
Open case studySelected system
Changed Kyuubi engine selection so shared compute could route interactive or batch sessions using user group context.
Open case studyProject note
Built reusable analytics workflows for cross-shopping, category adjacency, aisle-flow, and store-flow analysis across departments, categories, and products.
Open case studyProject note
Built an in-house alerting and monitoring framework around Elastic Stack, Kafka, and custom services.
Open case studyProject note
Built a repeatable Apriori-based workflow for analyzing how products, categories, and departments are shopped together at scale.
Open case studyProject note
Built analysis workflows for measuring launch uplift, incremental sales contribution, and cannibalization within a retail category.
Open case studyProject note
Built this portfolio as an Astro app with structured projects, focused resume views, and an on-domain blog sourced from Wix.
Open case study