Skip to content
Vijay Work Resume Blog Contact

Resume

Governed data platform and enterprise AI resume

I architect and build governed data platforms, text-to-data systems, metadata services, access-governance integrations, workflow orchestration, in-house CI/CD onboarding, Kubernetes/Spark migrations, and network-scale analytics pipelines.

Switch views to reorder the same experience around backend, platform, data, or AI roles. Current content refreshed May 2026.

Fine tune by focus

Projects

Relevant projects

Governed conversational data platform

I architected and built a governed conversational data and visualization agent: it retrieves business knowledge, answers business questions, runs governed queries from that context, reasons over results, and builds charts without making the LLM the data boundary.

Python FastAPI LangGraph Langfuse

Moved analytics discovery toward self-service by connecting a multi-LLM graph runtime with knowledge retrieval, governed Trino execution, answer reasoning, chart generation, prompt tracing, and a richer Open WebUI experience. The natural-language analytics path reduced query time by about 40% for supported workflows.

Read case study

Telecom network data lake

Designed and implemented data lake and warehouse foundations for mobile tower network events at petabyte scale and roughly five trillion events per day.

Flink Kafka Java Elastic Stack

Created the analytics foundation for network data products, reporting, and large-scale downstream consumption.

Read case study

Hive metastore synchronization and metadata governance

Designed and built services that keep Hive metadata consistent across independent environments using real-time listener sync, daily reconciliation, expiry cleanup, one-time interval jobs, observability, and deployment hardening.

Java Spark Airflow Apache Hive

Turned fragile metadata drift into an owned synchronization path with real-time event propagation, reconciliation, recovery behavior, logs, and platform deployment controls.

Read case study

CI/CD onboarding and developer-experience framework

Created an in-house YAML-driven CI/CD framework that let teams onboard projects with very little friction while keeping validation, security scans, deployment behavior, and Jira status updates standardized.

In-house CI/CD YAML Jenkins Docker Kubernetes (OCP)

Improved developer experience, reduced CI/CD onboarding time by about 40%, and increased CI/CD adoption several-fold by replacing bespoke Jenkins pipelines with a small project-level YAML file, reusable templates, automated quality gates, and Jira-triggered delivery flows.

Read case study

Experience

Relevant experience

Data Platform Engineer / Architect · Airtel Digital

I architect and build governed data platforms, metadata services, workflow orchestration, access-governance integrations, in-house CI/CD onboarding, and network-scale analytics systems.

2021 - present

  • Architected and built a governed conversational data platform for text-to-data workflows: multi-LLM subgraphs, knowledge retrieval, controlled query execution, answer reasoning, chart generation, prompt tracing, and platform orchestration.
  • Designed and implemented Hive metadata synchronization for independent Hive environments, combining real-time listener sync, daily reconciliation, missed-addition/removal repair, sync-duration expiry cleanup, one-time interval jobs, observability, and deployment hardening.
  • Built governed self-service data platform capabilities around Kyuubi customizations, Trino query access, DataHub metadata, dbt transformations, Metabase BI, RBAC, and secrets management.

Data Science Engineer · dunnhumby

I worked between data science and big-data platform teams, turning statistical and ML analysis into reusable pipelines, data marts, reporting products, and client-ready analytics workflows.

2018 - 2021

  • Worked as a bridge between data science and big data platform teams by turning analytical work into reusable products and repeatable pipelines.
  • Built customer segmentation, data marts, reporting platforms, and retail optimization analytics for large datasets, including category/adjacency work associated with up to 20% sales-growth impact.
  • Created reusable retail analytics workflows across customer priority assortment, cross-shopping behavior, direct-mail promotion planning, category uplift, and seasonality analysis.

Software Engineer · Mphasis

I worked on mainframe and big-data systems for insurance and telecom clients, including Spark migration work and automation that saved 500+ manual hours per year.

2014 - 2018

  • Worked in both mainframe and big data ecosystems and contributed to migration from mainframe workloads to Spark-based processing.
  • Built automations that saved 500+ manual hours per year by removing repetitive workflow steps.
  • Worked as a developer for insurance and telecom clients across enterprise systems and data workflows.

Skill stack

Primary skills

Spark Flink Python Java SQL Kubernetes (OCP) Kafka Cloud (GCP, AWS, Azure)

Supporting skills

Elastic Stack Airflow Trino Power BI Spark Operator Scala

Additional skills

Linux Apache Kyuubi JavaScript Apache Iceberg Apache Hudi MongoDB