headroom

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

113 articles 7.8k View on GitHub ↗
113 articles
How CodeCompressor Handles AST-Aware Compression in Headroom

Discover how CodeCompressor uses AST-aware compression in Headroom. It parses code into abstract syntax trees, scores symbols, and prunes definitions without losing validity.

deep-dive
Jun 10, 2026
How to Use the Headroom LiteLLM Callback for Automatic Compression

Learn how to use the Headroom LiteLLM callback to automatically compress your requests. Integrate seamlessly for zero-configuration compression with every LiteLLM call.

how-to-guide
Jun 10, 2026
How to Configure OpenTelemetry Tracing with Headroom: Complete Setup Guide

Learn how to configure OpenTelemetry tracing with Headroom using this complete setup guide. Export spans to Langfuse easily by following our step by step instructions. Start tracing today!

how-to-guide
Jun 10, 2026
How Streaming Works with CCR Response Handling in Headroom

Understand streaming CCR response handling in Headroom. Discover how it buffers chunks, retrieves cache content, and injects it seamlessly for efficient stream management.

how-to-guide
Jun 10, 2026
How to Integrate Headroom with the Agno Framework: Complete Implementation Guide

Integrate Headroom with the Agno framework using HeadroomAgnoModel. Optimize context directly in Agno's model pipeline for enhanced AI agents.

how-to-guide
Jun 10, 2026
How to Configure the Compression Policy for Different Content Types in Headroom

Learn to configure Headroom's compression policy for varied content types. Control compression aggressiveness and optimize performance with the ContentRouter and specialized compressors.

how-to-guide
Jun 10, 2026
How Cross-Agent Memory Works with SharedContext in Headroom

Discover how cross-agent memory works with SharedContext in Headroom. This feature efficiently shares large data, cuts token usage by 80%, and maintains content for retrieval.

how-to-guide
Jun 10, 2026
How to Debug Compression Issues with Headroom Performance: 7 Diagnostic Methods

Troubleshoot Headroom performance compression problems. Use 7 diagnostic methods to find and fix issues fast. Debug policy, logging, and routing for better performance.

how-to-guide
Jun 10, 2026
How IntelligentContextManager Handles Message-Level Compression in Headroom

Discover how IntelligentContextManager compresses messages using a rolling-window strategy and ratio-based thresholds to manage token budgets within the Headroom repository.

deep-dive
Jun 10, 2026
TransformPipeline Architecture in Headroom: How to Customize the LLM Compression Pipeline

Explore the Headroom TransformPipeline architecture for LLM compression. Customize this deterministic sequence of composable transforms to reduce token usage and preserve information effectively.

architecture
Jun 10, 2026
How to Configure CCR Retrieval for Your Specific Use Case

Learn how to configure CCR retrieval for your use case by adjusting flags in CCRConfig and ProxyConfig. Control tool injection and response handling efficiently.

how-to-guide
Jun 10, 2026
How to Use the Headroom Wrap Command with Claude Code: Complete Guide

Master the headroom wrap command with Claude Code. Learn how to intercept API calls and run the claude binary locally. Get the complete guide now.

how-to-guide
Jun 10, 2026

Have a question about this repo?

These articles cover the highlights, but your codebase questions are specific. Give your agent direct access to the source. Share this with your agent to get started:

Share the following with your agent to get started:
curl -s "https://instagit.com/install.md"

Works with
Claude Codex Cursor VS Code OpenClaw Any MCP Client

Maintain an open-source project? Get it listed too →