Applied AI Engineer

AyushKumar.

I build AI systems and infrastructure that scale — from fine-tuning models rooted in mathematical understanding, to production-grade distributed systems.

>
Traditional ML·Deep Learning·Computer Vision·LLMs·MLOps
Ayush Kumar
hey, welcome! 👋
AI Engineer
{ML·DL·CV·LLMs}
Scroll
Philosophy

"My standard as an engineer is to be the highest-value hire in any technical evaluation — not by chasing frameworks, but by going deeper than most are willing to: into the mathematics, the infrastructure, and the edge cases that separate good from irreplaceable."

— Engineering philosophy, Ayush Kumar
0%
Token Cost
Prompt optimization
<0%
False Positives
CV deepfake pipeline
<0ms
Voice Latency
V2V agent
0%
Deploy Cycles
CI/CD automation
+0%
Output Reliability
Structured outputs
0%
Debug Time
LLMOps observability
stack · live
// Section 02 — Capabilities

What I Build

// Real production systems, not toy demos

LLM & Generative AI

Production RAG pipelines, fine-tuned LLMs (LoRA/QLoRA), multi-turn agents with persistent memory. Optimized for token cost and p95 latency.

Voice agent with <800ms V2V latency

LangGraphLangChainGeminiOpenAIPineconeChromaDB

Multi-Agent Systems

DAG-based agent orchestration where specialized AI agents collaborate, debate, and route with their own memory, tools, and decision boundaries.

5-agent Competitor Intelligence platform

LangGraphCrewAITool CallingDAGs

Traditional ML & Statistical Learning

Linear/Logistic Regression, SVMs, Gradient Boosting (XGBoost/LightGBM), Ensemble methods, Bayesian models, k-NN, Decision Trees, and Random Forests. I understand the mathematics underneath, not just the API calls.

Feature engineering, cross-validation, hyperparameter tuning

Scikit-LearnXGBoostLightGBMNumPyPandasStatsmodels

Deep Learning

CNNs, RNNs, LSTMs, Transformers from scratch. Backpropagation, weight initialization, batch normalization, regularization — built and trained, not just loaded from a checkpoint.

Fine-tuned DistilBERT on SST-2 with self-healing DAG

PyTorchTensorFlowHugging FaceLoRA / QLoRA

Computer Vision

Object detection, image classification, semantic segmentation, and deepfake/proxy detection. Built CV pipelines for real-world production, not just Kaggle.

Deepfake detection with <2% false positives in production

PyTorchOpenCVDetectron2YOLO

MLOps & Infrastructure

I own the full delivery lifecycle: IaC, containerization, CI/CD, monitoring, and rollback. Every model I ship has metrics, alerts, and a path back.

Multi-node AWS deployment with automated CI/CD

TerraformAnsibleDockerKubernetesAWSGitHub ActionsPrometheusGrafana
workflow · 5 stages
// Section 03 — Method

How I Ship AI

Problem → Production, every time.

🎯

STEP 01

Problem Definition

Scope · Success Criteria · Constraints · Stakeholders

Before any model is chosen, I reverse-engineer from the business outcome. What does success look like numerically? What are the latency, cost, and accuracy constraints? Who are the stakeholders and what decisions will this system power? I write a one-page problem brief that everyone can sign off on — if that's unclear, no model will fix it.

  • Define evaluation metrics before touching data (accuracy, F1, BLEU, latency, cost/token)
  • Map the cost of failure: false positives vs false negatives have very different business consequences
  • Identify data availability, labeling feasibility, and legal/privacy constraints upfront
  • Set a baseline: can a rule-based system solve 80% of this? If so, start there.

Projects

Production systems, not toy demos

ayushk1233
@ayushk1233GitHub
Flagship

Competitor Intelligence Monitor

AI agent that tracks competitors automatically

Distributed AI platform using LangGraph, Gemini 1.5, and RAG to autonomously monitor competitors. Processes ~30K tokens/run extracting pricing shifts, hiring trends, and generating sales battlecards.

p95 ~70s60% debug time ↓30K tokens/run
LangGraphGemini 1.5RAGAWS EKSTerraformCeleryRedisPrometheus
ayushk1233
@ayushk1233GitHub

End-to-End MLOps Deployment

Full pipeline: Training → Deployment → Monitoring

Production-ready ML pipeline with Terraform, Ansible, Docker, and FastAPI on multi-node AWS with Nagios monitoring and automated CI/CD via GitHub Actions.

Multi-nodeAuto CI/CD
FastAPIDockerTerraformAnsibleNagiosAWSGitHub Actions
ayushk1233
@ayushk1233GitHub

K8s Flask + MongoDB Deployment

Production-grade Kubernetes microservice setup

Kubernetes deployment with StatefulSets, HPA autoscaling, persistent volumes, secure auth, and full microservice architecture.

Auto-scalingPersistent Volumes
KubernetesFlaskMongoDBStatefulSetsHPA
ayushk1233
@ayushk1233GitHub

Financial Advisor (CrewAI)

Multi-agent system for financial decisions

Multi-agent orchestration using CrewAI for automated financial analysis and decision-making workflows.

Multi-agent
CrewAILangChainPython
ayushk1233
@ayushk1233GitHub

Multi-Agent Debate System

AI agents debate with memory + context

DAG-based reasoning system where AI agents debate topics with persistent memory and contextual awareness using LangGraph.

Context-aware
LangGraphPythonDAG
ayushk1233
@ayushk1233GitHub

Self-Healing Classification DAG

Confidence-based fallback pipeline

Fine-tuned DistilBERT (SST-2) with confidence-based fallback to zero-shot backup model and human-in-the-loop corrections.

Self-healingHITL
DistilBERTHugging FaceZero-shot
@app.post("/predict")async def predict(req): result = await model(req) return {"score": result}
ayushk1233
@ayushk1233GitHub

Backend API Challenge

Enterprise-grade FastAPI backend

Production FastAPI backend with CI/CD pipelines and secure infrastructure.

Enterprise-grade
FastAPICI/CDDocker
$ ./run --model llama-3> loading weights...> ready ▮
ayushk1233
@ayushk1233GitHub

Local CLI Chatbot

Runs HF models locally on Apple Silicon

CLI chatbot running Hugging Face models locally, optimized for Apple Silicon with minimal memory footprint.

Optimized
Hugging FacePythonApple Silicon

// SKILLS UNIVERSE

Tech Stack

Drag to rotate · Hover any tech to highlight · Auto-rotates when idle

Drag to explore the skills universe
3
Languages
11
AI / ML
6
Backend
6
Cloud / DevOps
2
Observability

Experience

Where I've built production AI systems

Oct 2025 – Present

Applied AI Engineer Intern

Banao Technologies

  • Built multilingual AI interview platform — reduced token cost by 40%
  • Designed CV-based deepfake detection with <2% false positives
  • Built stateful voice agent (LangGraph + Gemini) with <800ms latency
  • Built RAG pipeline with Pinecone for cross-session memory
  • Deployed microservices via FastAPI + Docker + AWS ECS
  • Automated CI/CD → reduced deployment time by 25%
Aug 2025 – Sept 2025

AI & Decentralized Systems Intern

BlockseBlock

  • Ran local LLMs using Ollama on edge hardware
  • Fine-tuned models with LoRA / QLoRA — hosted weights on Hugging Face
  • Built LangChain workflows with structured JSON outputs
  • Improved structured output reliability by 35%
Available for opportunities
// Section 06 — Contact

Let's build somethingproduction-grade.

Looking for roles at the intersection of AI engineering and systems architecture. Open to complex distributed AI challenges.

Phagwara, Punjab, India

Ayush Kumar · 2026 · Phagwara, India