RIPUNJAY SINGH

I Don't Build
Wrappers. I Build
Infrastructure.

Production multi-agent systems. Custom LoRA-tuned LLMs. Distributed inference pipelines processing 10,000+ daily requests. I architect the AI infrastructure that enterprise clients trust to ship.

Multi-Agent Systems

LangGraph, LangChain, LlamaIndex

Custom LLM Training

LoRA, vLLM, KServe, MLflow

Cloud & DevOps

AWS, Kubernetes, Terraform, Docker

Production Backend

FastAPI, Neo4j, Redis, Celery

The Problem

Your AI Is Held Together With
API Calls & Prayers.

Most “AI products” are thin wrappers around OpenAI. One rate limit, one policy change, one outage — and your entire system folds. I build the opposite: custom-trained models you own, multi-agent orchestration that self-heals, and infrastructure that scales without a single vendor lock-in.

0%Validation Accuracy
-0%Inference Cost
0K+Daily API Requests
0%Faster Inference
❌ What most do

openai.chat.completions.create()

Vendor-locked. No fallback. No ownership.

✓ What I build

LoRA → vLLM → KServe → K8s → Prometheus

Custom models. Your data. Your infrastructure. Zero lock-in.

Your stack vs. mine
Proof of Work

Built. Shipped. In Production.

Not concept demos. Not hackathon prototypes. These systems process real data, serve real users, and run 24/7 without supervision.

LIVE — processing documents
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
[Agent: DocProcessor] → Extracting fields...
[Agent: Normalizer] → Standardizing format...
[Agent: Validator] → Cross-referencing rules...
[Agent: RepairBot] → Auto-correcting entry...
[Agent: Reporter] → Generating audit trail...
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
✓ Document validated — 95% confidence
150+ reports generated today
01

Autonomous Document Validation

Visa2fly — Production System

LangGraph-powered multi-agent system that autonomously validates complex visa documents across global immigration rules. Seven specialized AI agents work in concert — processing, normalizing, validating, repairing, and reporting — without human intervention.

  • 95% validation accuracy
  • 100+ concurrent requests daily
  • 7 orchestrated AI agents
  • 150+ audit reports per day
LangGraphFastAPINeo4jRedisCeleryMLX-VLM
02

Custom LLM Training & Serving

Inference Cost Elimination

Stop burning API credits. Domain-specific language models fine-tuned with LoRA on proprietary travel and visa datasets, served on an optimized vLLM/KServe stack with intelligent caching via LMCache. Your model, your data, your infrastructure.

-70%Inference Cost
+40%Faster Response
LoRAvLLMKServeMLflowLMCacheKubernetes
Before: OpenAI API$12,000/mo
After: Custom vLLM$3,600/mo
$8,400saved per month
Query
Rules
Docs
Context
Response
03

Context-Aware RAG System

Neo4j Knowledge Graph

Retrieval-Augmented Generation pipeline powered by a Neo4j knowledge graph storing 10,000+ validation rules and document relationships. Graph-based context retrieval delivers precise, regulation-compliant answers — not hallucinations.

  • 10,000+ validation rules indexed
  • Graph-based semantic retrieval
  • Vector + Knowledge Graph hybrid search
  • Regulatory compliance guaranteed
Neo4jQdrantLangChainFastAPIRedis
Production Infrastructure

Ship It.
Keep It Alive.

Building is 10% of the work. The other 90% is keeping it alive under load. Automated CI/CD, real-time observability, auto-scaling infrastructure, and zero-downtime deployments — all battle-tested at scale.

Daily Requests10K+
Latency P9942ms
Uptime99.9%
Active Agents7
Celery Workers8/8
Daily Reports150+
DockerKubernetesNginxJenkinsTerraformPrometheusGrafanaFlowerSupervisordAWS
The Journey

Ripunjay Singh

AI Engineer & Systems Architect

Experience

Visa2fly·AI Engineer
Dec 2024 → Present
Multi-Agent Systems, Custom LLMs, Production ML
Visa2fly·Backend Engineer
Dec 2023 → Nov 2024
SpringBoot, Microservices, CI/CD
Visa2fly·SpringBoot Intern
Jun 2023 → Nov 2023
API Development, Agile

Education

Bennett University·B.Tech Computer Science
2022 → 2026
8.54 CGPA

Recognition

MLH Hackathon Winner — Asia-Pacific
AWS Certified Cloud Practitioner
2 Research Publications in Distributed AI
Microsoft Learn Student Ambassador
Available for Hire

Let's Build Something That Ships.

You've seen the infrastructure. I architect custom, production-grade AI systems — multi-agent pipelines, fine-tuned LLMs, and scalable cloud infrastructure. Open to relocation and remote opportunities worldwide.

0K+Daily API Requests
0AI Agents in Prod
AWSCertified
ZeroVendor Lock-in