Free forever · Token & AI-spend tracking

See your team's AI spend.
Then cut it.

CDLX Token Tracker is a free background agent that monitors your team's token usage and True Cost — what you'd pay at full-freight API rates instead of a subsidized Max plan — across every tool, project, and member. When you're ready to bring spend down, Optimize — our upcoming MCP server — cuts agent tokens up to 40% — same agent, no workflow changes.

Free forever — full team dashboard, unlimited members
A background agent that tracks token usage & True Cost across tools, projects, and people
5-minute setup — install the CLI, run cdlx init
Weekly & monthly email digests with the insights that matter
Coming soon: Optimize cuts agent tokens up to 40% with an MCP server — join the waitlist

Start tracking — free Explore Track

Works with the agent you already use

Claude Code

Cursor

GitHub Copilot

Codex

Gemini CLI

The cost problem

The agentic free lunch is over.

Anthropic moved programmatic Claude usage off subscription pools onto full API rates. GitHub Copilot moved agent mode to a metered pool. The subsidies that were quietly propping up headless agent economics are gone.

Stanford research found ~70% of agent tokens are waste — re-reading files, exploring dead ends, snowballing context. A single SWE-Bench Pro task on Opus 4.7 can burn $10–$15 in tokens for one PR.

The teams that thrive in the next phase won't have the biggest budgets. They'll be the ones who treat tokens like a resource.

See the benchmark

~70%

of agent tokens are waste — re-grep'ing, re-reading, exploring dead ends. (Stanford)

$10–$15

API tokens burned per complex SWE-Bench Pro task on Opus 4.7.

25×

effective cost increase for teams running claude -p on subsidized Max plans after the June 2026 shift.

Where to start

First see your spend. Then cut it.

Two steps to control your AI bill — start free, optimize when you're ready.

Start here · Free

Track your spend

A free background agent monitors token usage and True Cost across every tool, project, and member — and emails you weekly digests on exactly what's driving the bill. Five-minute setup, unlimited team members.

Explore Track

Optimize · Coming soon

Cut your spend

Optimize plugs into your agent and cuts tokens up to 40% — same agent, higher first-pass quality, no workflow changes. Backed by SWE-Bench Pro benchmarks.

See the benchmark

How it works

Persistent context, delivered just-in-time

Index your codebase once. Plug into your agent. Your agent stops paying to re-discover your architecture on every task.

Index once

Connect your repos. Code Lexica builds a deep, persistent model of your architecture, dependencies, and conventions.

Plug into your agent

One MCP config block. Works with Claude Code, Cursor, Copilot, Codex, Gemini CLI — or any MCP client.

Just-in-time context

Your agent asks for exactly what it needs. Scoped, structured answers — not a firehose of repo content.

Learn more about the MCP server

Platform

More than just the MCP server

Code Lexica's persistent context model also powers reports, codebase chat, spec generation, and inline PM workflows.

MCP Server

Coming Soon

Persistent context layer for your coding agent. Lower tokens, higher quality, no agent swap. Part of the upcoming Optimize plan.

Learn more →

Reports

Five report types — architecture, technical debt, user journeys, onboarding, best practices. All grounded in your actual code.

Learn more →

Specs & Chat

Context-aware specs, codebase Q&A, and JIRA/Linear integration. Every answer references source.

Learn more →

Product Workflow

@CodeLexica in your PM tool turns intent into multi-ticket epics and AI-ready specs in minutes.

Learn more →

Why quality also improves

Cheaper tokens are only half the story.

Persistent, structured context doesn't just save money — it makes the agent's first attempt more likely to be correct.

Evidence-backed answers

Every Code Lexica response references exact files and functions. Your agent stops inventing APIs that don't exist.

Standards before generation

Org conventions, preferred libraries, and error-handling patterns ship into the prompt — so the first draft is already idiomatic.

Higher first-pass quality

Persistent, structured context beats grep'd file dumps. Fewer retries, fewer rejected PRs, faster merge.

Who it's for

Built for the people paying the agent bill

Engineering Leaders

CTOs, VPs, Directors

Cut agent token spend up to 40% across the org
Board-ready ROI math on AI tool investment
Higher first-pass quality, fewer retry cycles
Reproducible benchmark data for procurement

Platform Teams

Platform, DevEx, AI Tooling Leads

One MCP endpoint for every agent your team uses
Standardize context across Claude Code, Cursor, Copilot
Enforce org patterns and guardrails per repo
Self-hosted and BYO-cloud deployment options

Developers

Engineers, Tech Leads

Stop your agent paying to re-discover your repo
Get architecture-aware suggestions, not invented APIs
15-minute setup — no editor swap, no SDK
Evidence-backed answers traceable to source

Know your AI spend. Then take control of it.

Start tracking your team's token usage and True Cost free in five minutes — then cut it with the MCP server when you're ready.

Start tracking — free See the benchmark

See your team's AI spend.Then cut it.

The agentic free lunch is over.

First see your spend. Then cut it.

Track your spend

Cut your spend

Persistent context, delivered just-in-time

Index once

Plug into your agent

Just-in-time context

More than just the MCP server

MCP Server

Reports

Specs & Chat

Product Workflow

Cheaper tokens are only half the story.

Evidence-backed answers

Standards before generation

Higher first-pass quality

Built for the people paying the agent bill

Engineering Leaders

Platform Teams

Developers

Know your AI spend. Then take control of it.

See your team's AI spend.
Then cut it.