Smaller, precise context.
Less spend.
More signal.
We benchmarked nine code-intelligence operations with and without Constellation's MCP tools. Same model, same tasks, same codebase. Constellation carries the load, not your context window.
Where Constellation decisively beats grep-and-guess.
The three analyses below are where engineering hours actually get burned. A pre-indexed knowledge graph turns them from long, expensive exploration runs into deterministic single queries.
The full breakdown. Honest data.
All nine operations, sorted by cost savings. The pattern is clean: the harder the analysis, the more decisively the graph wins. Simple symbol lookups are roughly a wash — but those aren't where engineering hours live.
getSymbolDetails metrics: Baseline wins on this one because the operation
is trivial; grab a single file, read a function. Constellation's overhead exceeds its benefit for this operation, especially at scale.
We're showing it anyway. Constellation provides non-trivial context; cyclomatic complexity, exported state, etc. Constellation doesn't win based on tokens everywhere; it wins where the
work is hard, and that's where token budgets break.
Two ways to answer "what depends on this?"
The difference isn't model capability. It's information architecture. Baseline rediscovers your codebase on every query. Constellation already knows it.
Baseline: repetitive rediscovery
The model has tools — grep, read_file, glob — and no map of the territory. Every query reconstructs structural knowledge that gets thrown away the moment the session ends.
- Wide context loads. Reading whole files to find a few relevant lines.
- Speculative greps. "Maybe it's called X? No, try Y." Each miss bills tokens.
- No persistence. Yesterday's discoveries don't carry into today's session.
Constellation: code understanding
The codebase is indexed into a knowledge graph. Symbols, references, dependencies, and call edges live in structured form. The model writes one piece of code to query the graph.
- Code Mode. A single
code_intelcall composes multiple operations in parallel. - Structured returns. Real symbol IDs, not regex hits. No re-parsing.
- Persistent. The graph survives sessions. Every engineer shares the same map.
- Subject
- Claude Code (Sonnet 4.6)
with @constellationdev/mcp - Operations
- 9 code-intelligence tools
searchSymbols → impactAnalysis - Sample size
- 3 iterations per tool
n = 54 total runs - Measured
- Billable tokens, USD cost,
turn count, duration