AI for Coding

Image: Flickr / Wikimedia Commons / Unsplash

How to Save Tokens in Claude Code Using Graphify

Stop burning your Claude usage limits on file search. Build a knowledge graph once, query it forever.

June 28, 202620 minutes

How to Save Tokens in Claude Code Using Graphify

If you use Claude Code heavily, you have almost certainly seen the usage limit warning cut your session short. The cause is usually not the code you are writing - it is the token overhead of Claude re-reading your project from scratch every session. Graphify is an open-source Claude Code skill that pre-compiles your codebase into a queryable knowledge graph, so Claude navigates structure instead of running Glob and Grep calls. This guide covers installation, Claude Code integration, and the workflow changes that actually move the needle on token savings.

Best For

Claude Code users hitting session or daily usage limits
Developers working on large codebases with 20 or more files
Teams who want shared project context without paying the rebuild cost every session
Engineers running Claude Code on 50k+ line monorepos

Requirements

Python 3.10 or later
Claude Code installed and authenticated
uv (recommended), pipx, or pip
A codebase or project folder to index

Why Claude Code burns through your usage limits

Every Claude Code session starts blind. When you open a project, Claude has no memory of what it saw last time. To answer a basic architecture question, it runs Glob to find files, Grep to search content, and Read to open files one by one. On a mid-size Python or TypeScript project, that navigation overhead alone can cost 15,000 to 20,000 tokens before Claude has enough context to give a useful answer.

For active users, this adds up to 300,000 to 400,000 tokens per week. On Claude Pro or a Claude Code subscription, that is exactly the kind of usage pattern that triggers the claude usage limits warning that ends your session early.

The root cause is structural. Claude is not slow or imprecise. It simply has no map. Every session it rediscovers your codebase the same way a new engineer would on day one, searching around and asking clarifying questions before doing any real work.

Sources for this section

GitHub Gist - 71x token savings guide

What Graphify does instead

Source: graphifylabs.ai

Graphify is an open-source Claude Code skill (61,000 GitHub stars, backed by Y Combinator S26) that processes your project once, builds a persistent knowledge graph, and injects that graph as context before Claude ever touches a raw file. Instead of search calls, Claude reads graph nodes and edges to understand your project's structure.

The graph is built in three passes. First, tree-sitter AST processing extracts code structure locally - no API call required, no file contents leave your machine. Second, video and audio files are transcribed locally with faster-whisper. Third, docs, PDFs, and images go through an LLM extraction pass that pulls concepts and relationships into the graph.

The benchmark most widely cited is 71.5x fewer tokens per query on a 52-file corpus. Token reduction scales with project size - a 6-file project fits in Claude's context window anyway, so the graph adds structural clarity rather than compression. On a 50,000-line codebase, the savings are significant.

Sources for this section

Graphify GitHub README Medium - Graphify + Claude Code: How I Cut Token Usage by 71x

Install Graphify

Graphify is distributed as a Python package on PyPI. One naming detail to get right: the PyPI package is graphifyy (double y). Other packages named graphify on PyPI are not affiliated with this project. The CLI command you run afterward is still graphify (single y). Using uv is recommended because it manages PATH automatically and isolates the package in its own environment, avoiding the ModuleNotFoundError that plain pip can produce on Mac and Windows.

Sources for this section

Graphify Install Docs

Install via uv (recommended on Mac and Linux):

uv tool install graphifyy

Install via pipx (alternative):

pipx install graphifyy

Install via plain pip (requires manual PATH setup on Mac and Linux):

pip install graphifyy

Once installed, register the skill globally with your AI assistant:

graphify install

Quick install by operating system:

macOS: brew install [email protected] uv
Windows: winget install astral-sh.uv
Ubuntu/Debian: sudo apt install python3.12 pipx or curl -LsSf https://astral.sh/uv/install.sh | sh

Connect Graphify to Claude Code

graphify install registers the skill, but graphify claude install is what wires up the always-on behavior inside Claude Code. This command writes a PreToolUse hook to your Claude Code settings.json that fires automatically before any Glob or Grep call, intercepting file search and replacing it with a graph query. Run this from inside your project directory after building a graph (covered in the next section).

Run this inside your project directory to install the always-on Claude Code hook:

graphify claude install

The hook written to settings.json looks like this:

{ "hooks": { "PreToolUse": [ { "matcher": "Glob|Grep", "hooks": [ { "type": "command", "command": "graphify hook-context" } ] } ] } }

Every time Claude Code is about to run a Glob or Grep call, this hook fires first. It queries the graph and returns a structured JSON payload with relevant nodes, edges, and community context. Claude reads the graph result instead of running the search.

Project-scoped install (writes to current repo instead of user profile): graphify install --project
Uninstall from all platforms at once: graphify uninstall
Remove graphify-out/ folder along with uninstall: graphify uninstall --purge
Uninstall only from Claude Code: graphify claude uninstall

Build your first knowledge graph

Once Graphify is installed and connected to Claude Code, open Claude Code in your project directory and run the main command. On first run, expect 60 to 90 seconds on a medium-size project. Subsequent incremental builds only reprocess changed files and are significantly faster. Code files are processed locally with no API call - only docs, PDFs, and images require an LLM extraction pass.

Inside Claude Code, in your project directory:

/graphify .

Three output files are created under graphify-out/:

graph.html - interactive visualization you can open in any browser (click nodes, filter, search)
GRAPH_REPORT.md - highlights including god nodes (most-connected concepts), surprising connections, and suggested questions Claude is uniquely positioned to answer
graph.json - the full graph in JSON format, queryable without re-reading your files

On very large graphs (over 5,000 nodes), skip the HTML to avoid browser slowdowns:

/graphify . --no-viz

At the start of future sessions, refresh only changed files instead of rebuilding:

/graphify . --update

Query the graph instead of grepping raw files

Once the graph is built, Graphify changes how Claude Code navigates your project. The always-on PreToolUse hook handles the most common case: whenever Claude is about to run Glob or Grep, it reads the graph first and returns structured context. That covers most exploratory and orientation questions automatically.

For architectural questions - how two systems connect, what depends on what, why a specific design decision was made - you can call graph queries directly. These go deeper than the summary in GRAPH_REPORT.md, traversing the raw graph.json hop by hop and surfacing edge-level detail like relation type and confidence score.

Ask Claude Code about a connection between two parts of your codebase:

/graphify query "what connects the auth module to the database?"

Trace the exact path between two named components:

/graphify path "UserService" "DatabasePool"

Get a plain-English explanation of a specific component and its relationships:

/graphify explain "RateLimiter"

You can also generate a readable architecture page with Mermaid call-flow diagrams:

graphify export callflow-html

Think of it this way: the always-on hook gives Claude a map. The /graphify query, path, and explain commands let Claude navigate that map precisely when you need specific answers.

Where the savings compound the most

The token reduction is most significant on large projects. A 50,000-line monorepo where Claude currently runs multiple Glob and Grep cycles per question is the ideal use case. On smaller projects (under 20 files), Graphify adds structural clarity but the compression benefit is minimal.

Savings also compound in team setups. The graphify-out/ folder is designed to be committed to git. One developer runs /graphify . and commits the output. Everyone who pulls starts with the graph already built and does not pay the rebuild cost. A post-commit hook keeps the graph current automatically.

For CI pipelines or headless extraction without an active IDE session, Graphify supports a claude-cli backend that routes through your Claude Code subscription rather than a separate Anthropic API key:

Sources for this section

Graphify Team Setup Documentation

graphify extract ./src --backend claude-cli

This routes through your existing Claude Code subscription, not a separate Anthropic API key. It is the most cost-efficient path for CI extraction on teams that already use Claude Code.

Install the commit hook to auto-rebuild the graph (AST only, no API cost) after each commit:

graphify hook install

When Graphify is not the right call

On small codebases where everything fits in Claude's context window anyway, Graphify can increase token usage rather than reduce it. Each /graphify query invocation carries its own token overhead. When Claude Code runs 3 to 4 query invocations in a single workflow, that adds up to 8,000 or more tokens - sometimes more than a direct file read would have cost. This is documented in the Graphify GitHub issue tracker.

The clear signal that Graphify is helping: Claude Code is running 4 or more Glob or Grep calls to answer a single question. That is when the graph query is cheaper than the search chain. If your project is small enough that Claude answers questions in 1 to 2 file reads, the graph adds overhead rather than removing it.

A rough threshold: Graphify is worth setting up on projects with 20 or more files, or where you repeatedly hit the claude code usage limit mid-session due to navigation overhead. Below that, the structural clarity is useful but the compression benefit is negligible.

Sources for this section

Graphify Issue #580 - Token efficiency discussion Graphify Issue #985 - Token usage vs Claude Code agent

Session habits that stack the savings

Run /graphify --update at the start of each session to sync changed files without a full rebuild
Commit graphify-out/ to git so teammates start with the graph already built
Use /graphify query before asking broad architectural questions instead of letting Claude search raw files
Add --no-viz on large projects (5,000+ nodes) to skip HTML generation and speed up the build
Install the commit hook with graphify hook install to auto-rebuild the graph after each commit with no API cost
If you see Claude running multiple Grep calls in a row, interrupt and call /graphify query first
Reserve your Claude usage limits for actual code changes - let graph queries handle codebase exploration
Run graphify reflect at the end of sessions to aggregate Q&A history into a LESSONS.md file for future context

Sources

Graphify GitHub Repository Medium - Graphify + Claude Code: How I Cut Token Usage by 71x GitHub Gist - Cut Claude Code token usage by 71x step-by-step guide Graphify Issue #580 - Token efficiency on small codebases Graphify Issue #985 - Token usage vs Claude Code agent

Brian Weerasinghe

AI & Technology Researcher

Brian Weerasinghe is the founder and editor of AI Eating The World, where he covers artificial intelligence, tech companies, layoffs, startups, and the future of work. His reporting focuses on how AI is transforming businesses, products, and the global workforce. He writes about major developments across the AI industry, from enterprise adoption and funding trends to the real-world impact of automation and emerging technologies.

How to Save Tokens in Claude Code Using Graphify

Best For

Requirements

Why Claude Code burns through your usage limits

What Graphify does instead

Install Graphify

Connect Graphify to Claude Code

Build your first knowledge graph

Query the graph instead of grepping raw files

Where the savings compound the most

When Graphify is not the right call

Session habits that stack the savings

Brian Weerasinghe

Related in AI for Coding

Figma MCP Server Setup: Claude, Codex, Cursor, Copilot, and Antigravity

How to Get Claude Code Free Using OpenRouter's Free API

How to Get Started with Claude Code: A Beginner's Complete Guide

Best For

Requirements

Why Claude Code burns through your usage limits

What Graphify does instead

Install Graphify

Connect Graphify to Claude Code

Build your first knowledge graph

Query the graph instead of grepping raw files

Where the savings compound the most

When Graphify is not the right call

Session habits that stack the savings

Brian Weerasinghe

Related in AI for Coding

Figma MCP Server Setup: Claude, Codex, Cursor, Copilot, and Antigravity

How to Get Claude Code Free Using OpenRouter's Free API

How to Get Started with Claude Code: A Beginner's Complete Guide

The AI brief for people adapting to changes in work