Skip to content
Vijay Work Resume Blog Contact

Resume

Platform and Reliability Engineer

I build reusable platform capabilities around orchestration, CI/CD onboarding, Kubernetes/Spark Operator migrations, access governance, metadata synchronization, observability, and controlled query execution.

Switch views to reorder the same experience around backend, platform, data, or AI roles. Current content refreshed May 2026.

Fine tune by focus

Projects

Relevant projects

Governed conversational data platform

I architected and built a governed conversational data and visualization agent: it retrieves business knowledge, answers business questions, runs governed queries from that context, reasons over results, and builds charts without making the LLM the data boundary.

Python FastAPI LangGraph Langfuse

Moved analytics discovery toward self-service by connecting a multi-LLM graph runtime with knowledge retrieval, governed Trino execution, answer reasoning, chart generation, prompt tracing, and a richer Open WebUI experience. The natural-language analytics path reduced query time by about 40% for supported workflows.

Read case study

CI/CD onboarding and developer-experience framework

Created an in-house YAML-driven CI/CD framework that let teams onboard projects with very little friction while keeping validation, security scans, deployment behavior, and Jira status updates standardized.

In-house CI/CD YAML Jenkins Docker Kubernetes (OCP)

Improved developer experience, reduced CI/CD onboarding time by about 40%, and increased CI/CD adoption several-fold by replacing bespoke Jenkins pipelines with a small project-level YAML file, reusable templates, automated quality gates, and Jira-triggered delivery flows.

Read case study

Kubernetes and Spark Operator migration

Migrated 50+ Spark and data workloads to Red Hat OCP using Spark Operator, shared CI/CD foundations, containerized runtime patterns, and platform deployment conventions.

Kubernetes (OCP) Spark Operator Spark In-house CI/CD YAML

Reduced infrastructure cost by about 30% while improving workload isolation, resource management, deployment repeatability, and scalability for data workloads.

Read case study

Ranger RBAC and policy-governance extensions

Extended enterprise data access governance around Apache Ranger-based RBAC, an external attribute store, DataHub tag-driven policies, row-level security, masking, Trino integration, audit clarity, and local/containerized development paths.

Java Apache Ranger Trino DataHub

Made access policy behavior more expressive and supportable by marrying tag-based governance with row-level security, masking, extensible attributes, query-engine integration, and audit/error visibility.

Read case study

Experience

Relevant experience

Data Platform Engineer / Architect · Airtel Digital

I architect and build governed data platforms, metadata services, workflow orchestration, access-governance integrations, in-house CI/CD onboarding, and network-scale analytics systems.

2021 - present

  • Built governed self-service data platform capabilities around Kyuubi customizations, Trino query access, DataHub metadata, dbt transformations, Metabase BI, RBAC, and secrets management.
  • Migrated 50+ Spark and data workloads to Red Hat OCP using Spark Operator, shared CI/CD, Docker/Helm packaging, and Kubernetes runtime conventions, reducing infrastructure cost by about 30%.
  • Extended Apache Ranger-based access governance with an external attribute store, DataHub tag-driven policies, row-level security, masking, Trino integration, clearer audit/error paths, and local/containerized runtime support.

Software Engineer · Mphasis

I worked on mainframe and big-data systems for insurance and telecom clients, including Spark migration work and automation that saved 500+ manual hours per year.

2014 - 2018

  • Built automations that saved 500+ manual hours per year by removing repetitive workflow steps.
  • Worked in both mainframe and big data ecosystems and contributed to migration from mainframe workloads to Spark-based processing.
  • Worked as a developer for insurance and telecom clients across enterprise systems and data workflows.

Data Science Engineer · dunnhumby

I worked between data science and big-data platform teams, turning statistical and ML analysis into reusable pipelines, data marts, reporting products, and client-ready analytics workflows.

2018 - 2021

  • Worked as a bridge between data science and big data platform teams by turning analytical work into reusable products and repeatable pipelines.
  • Built customer segmentation, data marts, reporting platforms, and retail optimization analytics for large datasets, including category/adjacency work associated with up to 20% sales-growth impact.
  • Created reusable retail analytics workflows across customer priority assortment, cross-shopping behavior, direct-mail promotion planning, category uplift, and seasonality analysis.

Skill stack

Primary skills

Kubernetes (OCP) Spark Operator Cloud (GCP, AWS, Azure) Elastic Stack Apache Ranger Linux Airflow Grafana

Supporting skills

Helm In-house CI/CD YAML Kafka Jenkins Apache Hive Apache Kyuubi

Additional skills

Docker DataHub Apache SeaTunnel Influx Jira Geospatial streaming