AI Labs
Small interactive AI experiments. Embeddings, retrieval, and tokenization run in your browser; LLM calls go through a rate-limited server route. Try them — they're real, not write-ups.
Multi-Agent Orchestrator
Three agents — Researcher, Writer, Critic — collaborate in sequence on any task you give them. Watch each step stream live.
Tool-Use Simulator
An agent reasons and calls tools — calculator, weather, search — to answer a question. Every THOUGHT → TOOL → RESULT step is visible.
Agent Trace Viewer
What an agent run looks like under the hood — every LLM call and tool invocation with its duration, cost, and tokens. Inspect a recorded trace.
RAG Playground
Paste any document, ask a question, and watch retrieval + generation work end-to-end — chunking, embedding, and similarity all run in your browser.
Embeddings Playground
Compare any two phrases by meaning. See cosine similarity from MiniLM running entirely on-device, no server round-trip.
Prompt Cost Calculator
Type a prompt, set expected output and monthly volume — see per-call and per-month cost across seven OpenRouter models, sorted cheapest first.
Prompt Cache Simulator
How much does prompt caching actually save? Slide the inputs (system size, calls, output) and see the break-even at any volume.
Tokenization Visualizer
See how the GPT-4 tokenizer (cl100k_base) splits any text into tokens, with running counts for characters, tokens, and chars-per-token.