Full-Stack AI Engineer

Buildingintelligentsystemsthatship.

Specialized in LLMs, RAG, AI agents, and production-ready ML systems. From prototype to production.

About

IturncomplexAIintoproductspeopleuse.

I'm a full-stack AI engineer based in Chennai, India. I specialize in building production-grade AI systems — from fine-tuned LLMs and RAG pipelines to multi-agent architectures and real-time ML infrastructure.

My approach is engineering-first: every AI feature I ship comes with evaluation harnesses, observability, and a clear path from prototype to production. No demos that break in the real world.

20+
Enterprise AI Systems
$500k+
Cloud Costs Saved
95%
RAG Accuracy @ Scale
10M+
Vectors Indexed
PythonTypeScriptJavaScriptSQLGoRustReactNext.jsTailwind CSSFramer MotionViteNode.jsNestJSFastAPIExpressLangChainLangGraphOpenAIGoogle GeminiAnthropicOllamavLLMHugging FaceTransformersSentenceTransformersPyTorchTensorFlowONNXFAISSPineconeWeaviateQdrantChromaGraphQLRESTgRPCWebSocketsPostgreSQLMySQLRedisMongoDBPrismaSQLAlchemyDockerKubernetesTerraformGitHub ActionsGitLab CINginxLinuxAWSGoogle CloudMicrosoft AzureS3CloudFrontLambdaECSECRRDSVPCCloudWatchIAMRabbitMQKafkaCeleryBullMQOpenTelemetryPrometheusGrafanaSentryPytestJestPlaywrightElectronFlutterOAuth2JWTPythonTypeScriptJavaScriptSQLGoRustReactNext.jsTailwind CSSFramer MotionViteNode.jsNestJSFastAPIExpressLangChainLangGraphOpenAIGoogle GeminiAnthropicOllamavLLMHugging FaceTransformersSentenceTransformersPyTorchTensorFlowONNXFAISSPineconeWeaviateQdrantChromaGraphQLRESTgRPCWebSocketsPostgreSQLMySQLRedisMongoDBPrismaSQLAlchemyDockerKubernetesTerraformGitHub ActionsGitLab CINginxLinuxAWSGoogle CloudMicrosoft AzureS3CloudFrontLambdaECSECRRDSVPCCloudWatchIAMRabbitMQKafkaCeleryBullMQOpenTelemetryPrometheusGrafanaSentryPytestJestPlaywrightElectronFlutterOAuth2JWT

Selected Work

AIProducts&Systems

Realtime Voice AI Agent
Voice AI

Realtime Voice AI Agent

Low-latency voice assistant with interrupt, tool calls, and call-quality UX

  • Barge-in + partial response streaming
  • Latency budget + adaptive buffering
  • Tool calling for real actions (CRM/tickets/payments)
  • PII redaction + transcript storage policy
  • Human handoff when confidence drops
PythonTypeScriptWebRTCFastAPIWebSocketsRedisPostgreSQLOpenAIGoogle GeminiAzure
Multimodal Document Intelligence
Document AI

Multimodal Document Intelligence

Tables, forms, stamps, signatures—structured extraction with confidence scoring

  • Layout-aware extraction (tables/forms)
  • Field-level confidence + audit trails
  • Human review workflow + corrections learning
  • Schema validation + versioned extraction
  • PII masking + secure storage
PythonFastAPIPostgreSQLRedisDockerTransformersOpenCVOCRLangChainOpenAI
LLM Observability & Evals Platform
AI Reliability

LLM Observability & Evals Platform

Trace prompts, tools, retrieval, cost, and quality—ship safe AI with confidence

  • Traces across tools + RAG steps
  • Regression eval suite + gold datasets
  • Safety checks + PII detection
  • Cost dashboards + token budgets
  • SLOs + alerting for quality drops
PythonTypeScriptOpenTelemetryPrometheusGrafanaFastAPIPostgreSQLRedisKubernetes
Secure Enterprise RAG with Permissions
Retrieval AI

Secure Enterprise RAG with Permissions

RAG that respects RBAC/ABAC and never leaks restricted documents

  • RBAC/ABAC-aware chunk retrieval
  • Tenant isolation + encrypted storage
  • Policy tests for leakage prevention
  • Audit logs for every access
  • Hybrid retrieval (dense + keyword)
PythonPostgreSQLRedisOpenAIPineconeFastAPIOAuth2JWTAWS
Real-time Recommendation Engine
Recommender Systems

Real-time Recommendation Engine

Personalized ranking with online + offline learning loops

  • Candidate generation + ranking pipeline
  • Feature store + realtime updates
  • A/B testing + metrics attribution
  • Cold-start strategy (content-based)
  • Explainability signals for trust
PythonPyTorchKafkaRedisPostgreSQLAirflowFastAPIDocker
Fraud & Anomaly Detection System
ML Systems

Fraud & Anomaly Detection System

Streaming anomaly detection with explainable risk scoring

  • Streaming pipeline with near-real-time scoring
  • Hybrid approach: rules + ML anomalies
  • Explainable risk factors for investigators
  • Model drift monitoring + retraining triggers
  • Case management workflow integration
Pythonscikit-learnPyTorchKafkaRedisPostgreSQLFastAPIAWS
LLM Inference Optimization (vLLM + Quantization)
AI Performance

LLM Inference Optimization (vLLM + Quantization)

Lower latency and cost with batching, caching, and quantized serving

  • vLLM serving with continuous batching
  • Quantization + performance benchmarks
  • Autoscaling + load tests
  • Cost per request tracking
  • Fallback routing to cheaper models
PythonvLLMONNXCUDADockerKubernetesPrometheusGrafanaAWS
Demand Forecasting & Inventory Optimizer
Time Series ML

Demand Forecasting & Inventory Optimizer

Forecast demand and optimize reorder points with constraints

  • Forecasting with seasonality + events
  • Constraint-based optimization (lead times)
  • Scenario planning dashboard
  • Backtesting + error analysis
  • API deployment with monitoring
PythonPyTorchPandasFastAPIPostgreSQLDockerAWS
Domain Copilot for Teams
Enterprise AI

Domain Copilot for Teams

A secure internal copilot that knows your docs, tickets, and codebase

  • Grounded answers with citations
  • Jira/Slack/Gmail integrations
  • Role-based access controls
  • Admin analytics + prompt templates
  • Incident summary + action items
TypeScriptNext.jsNode.jsPythonGraphQLPostgreSQLRedisOpenAIGoogle Cloud
Computer-Use Automation Agent
Agentic Automation

Computer-Use Automation Agent

Agent that operates web apps safely with guardrails and approvals

  • UI automation with approvals + rollback
  • Action logs + screenshot evidence
  • Guardrails to prevent dangerous actions
  • Retry logic + state recovery
  • Workflow templates for repeat tasks
PythonPlaywrightTypeScriptFastAPIRedisPostgreSQLDocker
Knowledge Graph + RAG Hybrid
Knowledge Systems

Knowledge Graph + RAG Hybrid

Graph reasoning + retrieval for better accuracy on complex queries

  • Entity extraction + graph construction
  • Multi-hop reasoning with traversal
  • Graph + text fusion answers
  • Eval suite focused on multi-hop queries
  • Explainable citations + relation paths
PythonNeo4jLangChainOpenAIFastAPIPostgreSQLDocker
Privacy-Preserving AI Pipeline
AI Security

Privacy-Preserving AI Pipeline

PII detection, redaction, encryption, and compliance-ready audit trails

  • PII detection + redaction pipeline
  • Encryption at rest + in transit
  • Retention policies + access auditing
  • Prompt/data boundary enforcement
  • Compliance-friendly reporting
PythonTypeScriptPostgreSQLRedisOAuth2JWTAWSAzure

Services

HowICanHelp

01

AI Product Engineering

Ship end-to-end AI features—RAG, agents, evaluation harnesses, and production MLOps with clear timelines.

02

Full-Stack MVPs

Design, build, and deploy fast MVPs with modern stacks, clean UX, and scalable architecture.

03

Automation & Data Systems

Build reliable pipelines, scrapers, and workflows that reduce cost and increase speed to insight.

04

Architecture & Scalability

Design robust system architecture—monorepos, microservices, caching, queues, and scaling plans that prevent rewrites.

05

API & Backend Engineering

Build secure APIs (REST/GraphQL), auth, RBAC, payments, and integrations with clean contracts and observability.

06

Cloud & DevOps Delivery

Productionize on AWS with CI/CD, Docker, monitoring, cost controls, and reliable release workflows.

07

AI Cost & Performance Optimization

Reduce token spend and latency with smart routing, caching, prompt refactors, batch jobs, and eval-driven tuning.

08

Product UX Engineering

Turn complex flows into simple UX—design systems, responsive UI, and interaction polish that boosts adoption.

09

Security & Compliance Basics

Harden apps with least-privilege access, secrets management, audit logs, and compliance-ready defaults.

10

Data & Analytics Foundations

Set up event tracking, dashboards, and metrics so you can measure activation, retention, and ROI from day one.

11

Codebase Rescue & Refactoring

Fix broken builds, reduce tech debt, and refactor safely with tests, linting, and step-by-step rollout plans.

12

Desktop & Mobile Apps

Ship cross-platform apps (Electron/Flutter) with offline support, sync, and production-grade packaging.

13

AI Integrations & Tooling

Integrate LLMs into your product—function calling, toolchains, vector DBs, and secure prompt/data boundaries.

14

RAG Quality & Evaluation

Build eval suites for retrieval and answers—gold datasets, scoring, regression tests, and continuous quality tracking.

15

Search & Recommendation Systems

Improve discovery with ranking, semantic search, personalization, and measurable relevance experiments.

16

Workflow Products for Teams

Build internal tools—admin panels, ops dashboards, approvals, and role-based flows that save hours every week.

17

Payments & Billing Systems

Implement subscriptions, invoicing, webhooks, retries, and dunning with clean ledgers and auditability.

18

Observability & Reliability

Add logs, metrics, tracing, SLOs, and incident-ready monitoring so production issues are found fast.

19

Performance Engineering

Speed up apps with profiling, caching strategies, DB tuning, and frontend performance optimization.

20

Data Modeling & Warehousing

Design schemas, ETL/ELT, and warehouses so your analytics stay accurate as your product scales.

21

Testing & QA Automation

Set up unit/integration/e2e tests, test data, and CI gates that keep releases stable.

22

Documentation & Developer Experience

Create clean docs, SDKs, and onboarding that make teams faster and reduce support load.

23

Migration & Modernization

Move legacy systems to modern stacks with minimal downtime—data migration, phased rollout, and risk control.

24

AI Safety & Guardrails

Add moderation, PII redaction, policy checks, and jailbreak-resistant patterns for safer AI features.

Why Work With Me

TheDifference

no fluff

Clear scope, fast delivery

You get honest estimates, reliable timelines, and steady progress updates—no surprises.

engineering-first

Systems that actually ship

I optimize for production reality: reliability, observability, and performance from day one.

quality bar

Evaluation baked-in

Every AI feature ships with measurable evals for accuracy, safety, and cost.

product thinking

Built for users, not demos

I focus on workflows, UX clarity, and adoption—so the feature is used daily, not just showcased.

cost control

Lower cloud + AI spend

I reduce waste with caching, batching, smart routing, and lean infra choices that keep margins healthy.

security baseline

Safe by default

RBAC, audit logs, secrets management, and sensible guardrails—so you can sell to serious customers.

ownership

Zero hand-holding required

I can run discovery → build → deploy end-to-end, and I document so your team can maintain it confidently.

speed + stability

Move fast without breaking

CI/CD, tests, and release gates make shipping frequent while keeping production calm.

integration-ready

Plays well with your stack

Payments, CRMs, emails, WhatsApp/SMS, analytics, and third-party APIs—integrated cleanly with retries and webhooks.

debugging muscle

I fix what others can’t

Build failures, flaky deployments, slow queries, messy codebases—I diagnose fast and stabilize the system.

future-proof

Architecture that won’t trap you

Modular design, clean contracts, and migration paths—so you can scale features and teams without rewrites.

founder-friendly

Startup pace, enterprise discipline

You get founder-speed execution with production-grade engineering standards and clear decision tradeoffs.