MCP Server for Code Retrieval · v1.80.1 · 1,700+ ★

Stop paying AI to read the whole file.
jCodeMunch cuts code-reading tokens 95%+

Model-agnostic MCP server indexes your repo and hands any AI agent — Claude, GPT, Gemini — the exact function, class, or constant. No more 200K-token file dumps. More wins, fewer timeouts, lower bill.

95% avg

Token Reduction
tiktoken-measured · 15 tasks · 3 repos

LIVE TELEMETRY SHOWING OVER

293,421,831,794

Tokens Saved by Participating Users
Since 3/3/2026

MORE THAN $1,483,247.36 ...SAVED!

$1,397.98 average savings by our top 1,000 active clients!

AT LEAST 35,211 kg (77,626 lbs) of CO₂
kept out of the atmosphere

SCI methodology & bounds →

O(1)

Retrieval Speed

Run the 60-Second Proof → Install Free See Benchmarks Commercial Pricing

GSF-credentialed Green Software Practitioner Token reduction as an ISO/IEC 21031:2024 SCI action →

Break-even: in literally days of normal use. Calculate your payback →

Model-agnostic. Works with Claude Code, Cursor, Antigravity, Gemini, Windsurf, Codex, and any MCP-compatible client.

For Developers → For Engineering Leads → For Finance →

↓ See Who It's For

File	Lines	Tokens (Traditional)	Tokens (jCodeMunch)	Savings
fastapi/routing.py	883	8,836	~0 (not needed)	100%
fastapi/dependencies/utils.py	580	5,218	~310 (one function)	94.1%
fastapi/security/oauth2.py	290	2,640	~90 (one helper)	96.6%

Architecture

How jCodeMunch Indexes Code and Retrieves Exact Symbols

A pre-built symbol index lets the MCP server answer code queries in milliseconds with surgical precision.

Index Once

Run index_code_folder(path) to index code symbols, or index_doc_local(path) (jDocMunch) to index documentation sections. Both build a persistent local index — happens once per project.

AI Queries by Intent

Instead of reading files, the AI calls search_symbols(query) or get_symbol(id). The MCP server performs semantic + keyword search against the index in milliseconds.

Surgical Retrieval

Only the matching symbol's source code and metadata is returned — not the surrounding file, not unrelated classes. A 6,000-token file read is replaced by a 400-token symbol pull.

Model-agnostic in the strongest sense. Symbol summaries fall through a 4-tier chain — Anthropic → Gemini → OpenAI-compatible (Ollama, LM Studio, OpenRouter) → signature-only fallback — so it works with whatever LLM you have a key for, or none. Embeddings ship with a bundled ONNX local model (all-MiniLM-L6-v2, 384-dim) for zero-config semantic search out of the box.

9 generations of index format, fully backward-compatible. Indexes built on v1.20 still load on v1.80 without re-parsing.

Use Cases

Common Use Cases for MCP Code Retrieval

Symbol-level retrieval changes how AI agents interact with code. Here are the workflows where jCodeMunch and jDocMunch deliver the most value.

🔍

Onboarding to an Unfamiliar Codebase

Search for key symbols and retrieve their implementations directly. Understand architecture without reading every file in the repo.

🔒

Tracing Authentication Flows

Find auth middleware, token validators, and permission checks by symbol search. Follow the call chain through exact function bodies.

🔄

Understanding Dependency Injection

Retrieve DI containers, providers, and resolver functions by name. See how dependencies wire together without reading framework internals.

💡

Impact Analysis Before Refactoring

Search for all references to a symbol before changing it. Understand the blast radius of a refactor with precise, low-token lookups.

🎯

Retrieving Exact Implementation Details

When an AI agent needs the body of a specific function, it retrieves just that function — 400 tokens instead of 6,000 for the full file.

📃

Documentation Retrieval with jDocMunch

Pair jCodeMunch with jDocMunch to give AI agents surgical access to both code and documentation. Search sections, not files.

Instant Framework Context

Starter Packs: Query Frameworks You've Never Cloned

Pre-built symbol indexes for popular frameworks. A 932 MB React repo becomes a 3 MB pack. A 1.4 GB Node.js monorepo becomes 10.6 MB. Install a pack and your AI agent gets symbol-level access — without cloning the repo or waiting for an index build.

📚

Learn Without Cloning

Explore React's fiber reconciler or Django's query compiler. Your agent searches thousands of symbols — no git clone, no local checkout.

🔎

Debug From Stack Traces

Got a confusing framework error? Search the pack for that symbol, read the implementation, understand the call path. Faster than GitHub search.

🔗

Cross-Repo Analysis

Install a framework pack + index your app. Now find_importers and get_blast_radius trace across the boundary into framework code.

⚡

Zero-Config Trial

No API key. No repo. No config. The free Node.js pack (76,700+ symbols) lets you experience real symbol retrieval in under 60 seconds.

10 packs available. Node.js (free), FastAPI, Django, Flask, React, LangChain, Anthropic SDK, MCP SDK, Laravel, and Spring Boot. Indexes are rebuilt weekly from the latest tagged release — always current, always stable API surfaces.

jcodemunch-mcp install-pack nodejs — Browse all Starter Packs ↗

Companion Tool

jDocMunch MCP: Documentation Retrieval to Match

jCodeMunch munches code. jDocMunch munches documentation — the same surgical retrieval approach, applied to Markdown, READMEs, specs, and any text-based docs in your repo.

📚

index_doc_local()

Index any local folder of Markdown, RST, or plain-text docs. One call, persistent index.

🔍

search_sections()

Semantic search across headings and content. Returns only the matching section, not the whole file.

📋

get_section()

Pull a specific doc section by ID. Same O(1) byte-offset retrieval as jCodeMunch symbols.

🗺

get_toc_tree()

Retrieve the full table of contents structure without loading any content — orient first, fetch later.

Same philosophy, different domain. When an AI agent needs to answer “how does the auth flow work?” it shouldn’t read 40 README files. jDocMunch gives it search_sections(“auth flow”) — one call, the right section, nothing else.

pip install git+https://github.com/jgravelle/jdocmunch-mcp.git — Learn more about jDocMunch on GitHub ↗

New — Now Available

jDataMunch MCP: Instant Answers From Your Data

No code required. jDataMunch indexes your spreadsheets, databases, and data files so AI assistants can answer data questions precisely — without reading entire files or guessing at column names.

📊

describe_dataset()

Get an instant summary of any CSV or Excel file — row counts, column names, types, and nulls. No formulas needed.

🔍

search_data()

Search across column values to find matching rows instantly — without opening the file or writing a filter.

📈

describe_column()

Distribution, min/max, top values, histogram — everything about a column, retrieved surgically.

🔗

join_datasets()

Server-side SQL JOIN across two indexed datasets — inner, left, right, cross. Per-side filters and column projection.

aggregate()

GROUP BY with count, sum, avg, min, max, count_distinct, median. Pre-filter support, 1,000-group cap.

📈

get_correlations()

Pairwise Pearson correlations across numeric columns, sorted by strength. Detects relationships your agent would otherwise miss.

🛡

get_data_hotspots()

Rank columns by data-quality risk — null rate, cardinality anomalies, numeric outlier spread. Quality triage in one call.

18 tools. 4 formats. SQLite-backed. Built for data people, not just developers. Supports CSV, Excel, Parquet, and JSONL — index once, then ask questions. If you want your AI assistant to actually understand your data — not hallucinate column names or miss rows — jDataMunch is the missing piece. — See pricing & license details ↗

Drop-in for Claude Code

One command. Every guardrail wired up.

Run jcodemunch-mcp init and the onboarding pipeline detects your MCP clients, writes the config, installs Claude Code hooks, generates Cursor and Windsurf rules, and indexes your repo. Every Read on a large code file auto-suggests symbol retrieval. Every Edit re-indexes the touched file. Every spawned subagent gets a condensed repo briefing.

🛡

PreToolUse

Intercepts Read on large code files and routes the agent to search_symbols + get_symbol_source instead.

🔄

PostToolUse

Auto-reindexes any file the agent edits or writes — the index is never stale across a session.

📂

PreCompact

Generates a session snapshot before context compaction so the next turn picks up with full state.

✅

TaskCompleted

Post-task diagnostics: dead code, untested symbols, dangling references — surfaced before you ship.

🤖

SubagentStart

Injects a condensed repo orientation into every spawned agent so subagents don't burn tokens rediscovering structure.

⚡

Groq Remote MCP

Cloud-hosted, low-latency option for teams that don't want a local Python runtime — ships with the gcm CLI.

One more thing — the embedding-drift canary. check_embedding_drift pins canary embeddings on first index, then re-checks them periodically. When your provider silently swaps models (Gemini, OpenAI, bundled ONNX), we catch it before your retrieval quality quietly degrades. No competitor advertises this. Ours does it by default.

FAQ

Frequently Asked Questions

Common questions about jCodeMunch, jDocMunch, and MCP-based code retrieval.

How is jCodeMunch different from RepoMapper?

Short version: RepoMapper is a ranked repository "map" (great for orientation and "what matters?"), while jCodeMunch is symbol-accurate retrieval (great for "show me the exact code" with tiny token spend). They overlap, but they're optimized for different jobs.

RepoMapper generates a "repo map" that highlights important files/definitions and relationships using Tree-sitter parsing plus a PageRank-like importance ranking. Best for first-pass orientation: "Which files matter for this task?"
jCodeMunch is symbol-first, not file-first: agents search and retrieve functions/classes/methods/constants directly via byte-offset seeking in O(1) time. File reads become the exception, not the default.
Scaling: maps get bigger as repos grow; targeted symbol pulls stay tiny even in massive repos.

If you want a fast, ranked breadcrumb trail across a new codebase, RepoMapper is a solid compass. If you need precise, repeatable, low-token code retrieval for AI agents, jCodeMunch is the right tool.

Which MCP clients does jCodeMunch work with?

jCodeMunch works with any client that supports the Model Context Protocol (MCP), including:

Claude Code — Anthropic's CLI for Claude, with native MCP support
Google Antigravity — Google's AI coding agent
Cursor — AI-native code editor
Gemini — Google's AI assistant with MCP integration
Any other MCP-compatible client or custom integration

Setup typically takes under a minute: install the server, add a config entry, and restart. See the README for client-specific instructions.

How do I integrate jCodeMunch with Google Antigravity?

Antigravity uses a standard MCP config file — setup takes about a minute.

Install the server: pip install git+https://github.com/jgravelle/jcodemunch-mcp.git
In Antigravity, open the Agent pane → click the ⋮ menu → MCP Servers → Manage MCP Servers
Click View raw config to open mcp_config.json
Add the entry below, save, then restart the MCP server from the Manage MCPs pane

{
  "mcpServers": {
    "jcodemunch": {
      "command": "jcodemunch-mcp",
      "env": {
        "GITHUB_TOKEN": "ghp_...",
        "ANTHROPIC_API_KEY": "sk-ant-..."
      }
    }
  }
}

Both env vars are optional. ANTHROPIC_API_KEY enables AI-generated symbol summaries; GITHUB_TOKEN raises GitHub API rate limits and unlocks private repos.

Why not just use a larger context window?

Larger context windows have diminishing returns for code retrieval:

Cost scales linearly: reading 214K tokens costs ~$1.08 per query regardless of window size. jCodeMunch retrieves the same answer for ~$0.0024.
Signal-to-noise ratio drops: dumping entire files into context means the AI wades through thousands of irrelevant lines to find the one function it needs. Larger windows amplify the noise problem, they don't solve it.
Latency grows: more input tokens = slower time-to-first-token. Symbol retrieval returns results in milliseconds.
Attention degrades: research shows LLMs struggle with retrieval accuracy as context length increases — the "lost in the middle" effect. Smaller, precise context performs better.

A 1M-token context window doesn't help if you're paying to fill it with code the AI doesn't need. jCodeMunch gives the AI exactly what it asked for.

Can I use jCodeMunch and jDocMunch together?

Absolutely — they're designed as complementary tools. A typical pairing:

jCodeMunch for code retrieval: functions, classes, constants, methods — any symbol in your codebase.
jDocMunch for documentation retrieval: README sections, API docs, specs, architecture guides.

Together, they give AI agents surgical access to both code and docs without reading entire files. When an agent needs to understand how authentication works, it can search code symbols for the implementation and search doc sections for the design rationale — all with minimal token spend.

Trio bundle pricing (all three tools) is available at all tiers. See pricing below.

How much can jCodeMunch reduce my Claude token usage?

In benchmarks measured with tiktoken cl100k_base across 15 tasks on 3 real repositories, jCodeMunch achieved a 95% average token reduction for code retrieval operations.

fastapi/fastapi benchmark: 214,312 tokens (reading all files) → ~480 tokens (symbol retrieval) = 99.8% reduction
Results vary by query type — retrieval of a single function body is near-zero cost, while broader searches return more symbols but still far less than full file reads.
The 95% average is a conservative, tokenizer-measured figure across diverse query types and repos of different sizes.

Full methodology and raw data: benchmarks/README.md on GitHub

Licensing

Pricing: Choose Your License

Choose a single-product license for code, docs, or data — or get all three in a Munch Trio bundle. All licenses are commercial-use licenses for the specified tier.

Builder

Solo developer

Commercial use for 1 developer.

$99

trio bundle ~~$147 separate~~

jCodeMunch Builder License included
jDocMunch Builder License included
jDataMunch Builder License included
Code, docs, and data retrieval in one

Buy Trio Builder Bundle

jCodeMunch only · $79 jDocMunch only · $29 jDataMunch only · $39

Studio

Small team

Commercial use for up to 5 developers.

$449

trio bundle ~~$597 separate~~

jCodeMunch Studio License included
jDocMunch Studio License included
jDataMunch Studio License included
Full retrieval suite for AI-enabled teams
Org-wide token-savings rollup across all seats (jCodeMunch)

Buy Trio Studio Bundle

jCodeMunch only · $349 jDocMunch only · $99 jDataMunch only · $149

Platform

Org-wide deployment

Unlimited use company-wide!

$2,499

entire suite ~~$2,997 separate~~

jCodeMunch Platform License included
jDocMunch Platform License included
jDataMunch Platform License included
Full retrieval suite, org-wide, all three tools
Org-wide token-savings rollup across all seats (jCodeMunch)

Buy Trio Platform Bundle

jCodeMunch only · $1,999 jDocMunch only · $499 jDataMunch only · $499

Need enterprise terms or a custom deployment arrangement? Contact us for enterprise licensing.

Stop paying AI to read the whole file.
jCodeMunch cuts code-reading tokens 95%+

Built for AI-Powered Development Teams

Benchmark Codebase: fastapi/fastapi

AI Code Retrieval Benchmark: File Reads vs Symbol Retrieval

Real AI Code Retrieval Results from fastapi/fastapi

Where the Tokens Go: File-by-File Breakdown

Token Costs Add Up: The Dollar Impact

How jCodeMunch Indexes Code and Retrieves Exact Symbols

Index Once

AI Queries by Intent

Surgical Retrieval

Common Use Cases for MCP Code Retrieval

Starter Packs: Query Frameworks You've Never Cloned

jDocMunch MCP: Documentation Retrieval to Match

jDataMunch MCP: Instant Answers From Your Data

Built on jMRI — an open retrieval spec

One command. Every guardrail wired up.

Frequently Asked Questions

Pricing: Choose Your License

Stop paying AI to read the whole file. jCodeMunch cuts code-reading tokens 95%+

Built for AI-Powered Development Teams

Benchmark Codebase: fastapi/fastapi

AI Code Retrieval Benchmark: File Reads vs Symbol Retrieval

Real AI Code Retrieval Results from fastapi/fastapi

Where the Tokens Go: File-by-File Breakdown

Token Costs Add Up: The Dollar Impact

How jCodeMunch Indexes Code and Retrieves Exact Symbols

Index Once

AI Queries by Intent

Surgical Retrieval

Common Use Cases for MCP Code Retrieval

Starter Packs: Query Frameworks You've Never Cloned

jDocMunch MCP: Documentation Retrieval to Match

jDataMunch MCP: Instant Answers From Your Data

Built on jMRI — an open retrieval spec

One command. Every guardrail wired up.

Frequently Asked Questions

Pricing: Choose Your License

Stop paying AI to read the whole file.
jCodeMunch cuts code-reading tokens 95%+