Hamad Khan

Building production
agentic AI systems.

I'm Hamad Khan — an AI/ML engineer shipping multi-agent backends, Agentic RAG, and observable LLM services. Peer-reviewed researcher in low-resource NLP.

CurrentlyAI/ML Engineer@Cointegration.ai·part of Capacity Analytics
FastAPIGeminiLangGraphQdrantCloud Run
0+
Years shipping
0+
Projects
3.63
GPA / 4.0
0
Peer-reviewed
01 / Selected Work

Featured projects.

Production AI, client web work, peer-reviewed research, and open-source dev tools — filter by what you're here for.

Self-Enroll AI
Live
Production

Self-Enroll AI

Conversational enrolment assistant for higher-ed. Five role-scoped agents under a LangGraph supervisor — replacing a static contact form with a stateful, observable assistant.

FastAPILangGraphGemini 2.5 FlashFirebase Auth+5
details
Pension Review
Live
Production

Pension Review

Public pension-eligibility assistant. Natural-language input parsed by Gemini, district lookup served by Typesense, FastAPI returns the verdict with citations.

FastAPIGemini 2.5 FlashTypesenseFirestore+2
details
In progress
Production

LawSync

Agentic RAG over Islamabad High Court rulings. Bounded ReAct loop, hybrid semantic-plus-fuzzy search, and a verifier pass enforcing citations on every answer. Private beta with a legal-research firm.

Gemini 2.5 Flashgemini-embedding-001QdrantLangGraph+3
details
Pension Calculations
Live
Production

Pension Calculations

Public-facing calculator paired with the Pension Review backend. Clean, fast Vercel deployment.

Next.jsTypeScriptTailwindVercel
details
For clients & freelance partners

Have something to build?

Production AI backends, RAG systems, custom chatbots, or full-stack web apps. One-off, ongoing, or agency collaborations.

02 / Experience

Where I've shipped.

Dec 2024 — Present
Full-time

AI/ML Engineer

Cointegration.ai · part of Capacity Analytics · Pakistan

Shipping production AI backends across three flagship products: Self-Enroll AI, Pension Review, and LawSync (IHC Agentic RAG).

  • Built a multi-agent FastAPI backend on LangGraph with five role-scoped agents (RBAC) under a supervisor graph, calling Gemini via google-genai with context caching.
  • Shipped an Agentic RAG service over IHC judgments — bounded ReAct loop, hybrid semantic+fuzzy search, gemini-embedding-001 in Qdrant, and a verifier pass that emits numbered citations.
  • Secured production APIs with API-Key auth, slowapi rate limiting, Pydantic validation; LangGraph checkpoints in Redis (Upstash); users in Firebase Auth + Firestore.
  • Instrumented end-to-end tracing in OpenTelemetry + Grafana Cloud + Langfuse with component-level versioning and per-model cost tracking.
  • Wrote pytest suites and offline eval datasets (regression, refusal, low-relevance) for the legal RAG agent. Shipped all services as Docker containers on Cloud Run.
FastAPILangGraphGemini 2.5 Flashgoogle-genai SDKQdrantTypesenseRedis / UpstashFirebase AuthFirestorePydanticOpenTelemetryGrafana CloudLangfuseDockerCloud Runpytest
May 2024 — Nov 2024
Remote

AI/ML Engineer (Remote)

Revolutionary AI Network · Texas, USA

Built an AI-powered American College Test (ACT) preparation tool for US students.

  • Led creation of an NLP dataset digitised from ACT test books — questions, answers, and rationales.
  • Fine-tuned a language model with LangChain, OpenAI API, and vector databases.
  • Engineered an interactive chatbot to let students review answers and practice new questions.
LangChainOpenAI APIVector DatabasesNLPEdTech
03 / Skills

Production-grade AI/ML stack.

The tools I reach for to ship reliable, observable AI systems.

01

AI / LLM Engineering

Agentic RAGMulti-agent systemsLangGraphLangChainReAct loopsSupervisor graphsRBACHyDETool-callingQ&ASummarizationFine-tuning
02

LLM Models & APIs

Gemini 2.5 Flashgoogle-genai SDKgemini-embedding-001OpenAIGroqLlama 3.1HuggingFace TransformersTransfer learning
03

Backend Engineering

PythonFastAPIPydantichttpxServer-Sent EventspytestAlgorithms & Data Structures
04

Retrieval & Persistence

QdrantPineconeChromaDBTypesenseHybrid searchRedis / UpstashFirestoreFirebase Auth
05

Cloud & Containers

Google Cloud RunDockerGoogle Cloud Platform
06

Observability & Eval

OpenTelemetryGrafana CloudLangfuseComponent versioningPer-model cost trackingEval datasets
04 / About

Engineer and researcher.

I build production-grade backends for multi-agent systems and Agentic RAG — observable LLM services on FastAPI, Gemini, LangGraph, and Qdrant, deployed to Cloud Run with end-to-end tracing and per-prompt cost tracking.

Peer-reviewed researcher in low-resource NLP (PeerJ Computer Science, 2024) — currently applying for fully funded graduate programs.

Academic

BSc Software Engineering

3.63 / 4.0 — University of Malakand

Published

PeerJ Computer Science · 2024

Pashto NLG with transformers

Production

3+ years shipping AI backends

Multi-agent · RAG · LLM APIs

Stack

FastAPI · Gemini · LangGraph

Qdrant · Cloud Run · Langfuse

Certified
Machine LearningStanford
NLP SpecializationDeepLearning.AI
Claude CodeAnthropic
Languages
EnglishExpert · IELTS 6
UrduExpert
PashtoNative
05 / Research

Peer-reviewed publications.

P/01Published

Pashto poetry generation: deep learning with pre-trained transformers for low-resource languages

PeerJ Computer ScienceVol. 10, Article e2163· Aug 30, 2024
DOI: 10.7717/peerj-cs.2163
Read paper →
P/02In Review

Pashto Poetry Dataset (PPD) for Machine and Deep Learning Applications

Under Review· 2025
Research focus
Low-Resource NLPTransformer ModelsText GenerationPre-trained ModelsPashto Language ProcessingDeep Learning
06 / Contact

Let's build something together.

AI/ML projects, research collaborations, or graduate opportunities — drop a line.

Open to opportunities
  • · Full-time & part-time positions
  • · Freelance & contract work
  • · Master's programs in AI / NLP / CS
Send a message