AI Frameworks · Stack Deep Dive

Quick Facts

At a Glance

Basic Concepts

Chain: a sequence of LLM + tool steps where each output feeds the next.
Agent: an LLM that picks tools and takes actions in a loop until done.
RAG (Retrieval-Augmented Generation): fetch relevant docs → stuff into the prompt → ask the model.
Tool calling: the LLM decides which function to invoke; the framework runs it and feeds results back.
Memory: conversation history (short-term) and stored facts (long-term).

Landscape

The Major Frameworks

Framework	Language	Best for
LangChain	Python, TS	Most popular; chains, agents, integrations to ~everything.
LangGraph	Python, TS	Stateful, graph-based agent workflows (LangChain's successor for agents).
LlamaIndex	Python, TS	RAG-first — ingestion, indexing, query engines.
Haystack	Python	Production RAG & search pipelines (deepset).
Semantic Kernel	C#, Python, Java	Microsoft's enterprise orchestrator with planners.
DSPy	Python	Programmatic prompts that auto-optimize against metrics.
Anthropic Agent SDK	Python, TS	Claude-native agents with built-in tool use & subagents.
OpenAI Agents SDK	Python	OpenAI-native agent framework with handoffs & guardrails.
Vercel AI SDK	TypeScript	Streaming UI helpers for Next.js / React.
CrewAI / AutoGen	Python	Multi-agent collaboration patterns.

Mechanics

Core Patterns

Retrieval-Augmented Generation (RAG)

The standard pattern for "ask my docs":

Ingest: chunk documents, generate embeddings, store in a vector DB.
Retrieve: embed the user's question, find top-K similar chunks.
Augment: stuff those chunks into the prompt as context.
Generate: ask the LLM to answer using only that context.

Modern RAG adds re-ranking, hybrid (keyword + vector) search, query rewriting, and metadata filters.

Tool Use & Function Calling

You describe a function (name, JSON schema). The model decides when to call it and produces a JSON argument; the framework runs it, feeds the result back, and the model continues. Used for:

Real-time data (weather, stock prices).
Database queries.
Math & computation (the LLM is bad at it; Python isn't).
External APIs (Stripe, Slack, GitHub).

Agents & Loops

An agent is an LLM in a loop: think → act → observe → repeat. The framework handles:

Planning — break the task into steps.
Tool selection — pick the right action.
State management — what's been tried, what worked.
Termination — when to stop (max steps, completion check).

Newer frameworks (LangGraph, Anthropic Agent SDK) model agents as explicit state machines instead of opaque loops.

Prompt Management & Evaluation

Prompt templates — versioned, parameterized, often kept outside code.
Evals — golden datasets + scoring (LLM-as-judge, human, exact match).
Tracing — Langfuse, LangSmith, Weave, Phoenix capture every prompt & response.
A/B testing prompts & models in production.

Pick a Framework

If you want…	Reach for
Maximum integrations & ecosystem	LangChain
Graph-based stateful agents	LangGraph
RAG done well, fast	LlamaIndex
Microsoft / .NET shop	Semantic Kernel
Provider-native agent	Anthropic / OpenAI Agent SDK
Streaming UI on the web	Vercel AI SDK
Auto-optimized prompts	DSPy

Continue

Other AI Stack Layers

Foundation Models Model Providers Vector DBs Dev Agents MLOps Classic ML Data Prep ↑ Back to AI Landscape