headroom
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
Discover how CodeCompressor uses AST-aware compression in Headroom. It parses code into abstract syntax trees, scores symbols, and prunes definitions without losing validity.
How to Use the Headroom LiteLLM Callback for Automatic CompressionLearn how to use the Headroom LiteLLM callback to automatically compress your requests. Integrate seamlessly for zero-configuration compression with every LiteLLM call.
How to Configure OpenTelemetry Tracing with Headroom: Complete Setup GuideLearn how to configure OpenTelemetry tracing with Headroom using this complete setup guide. Export spans to Langfuse easily by following our step by step instructions. Start tracing today!
How Streaming Works with CCR Response Handling in HeadroomUnderstand streaming CCR response handling in Headroom. Discover how it buffers chunks, retrieves cache content, and injects it seamlessly for efficient stream management.
How to Integrate Headroom with the Agno Framework: Complete Implementation GuideIntegrate Headroom with the Agno framework using HeadroomAgnoModel. Optimize context directly in Agno's model pipeline for enhanced AI agents.
How to Configure the Compression Policy for Different Content Types in HeadroomLearn to configure Headroom's compression policy for varied content types. Control compression aggressiveness and optimize performance with the ContentRouter and specialized compressors.
How Cross-Agent Memory Works with SharedContext in HeadroomDiscover how cross-agent memory works with SharedContext in Headroom. This feature efficiently shares large data, cuts token usage by 80%, and maintains content for retrieval.
How to Debug Compression Issues with Headroom Performance: 7 Diagnostic MethodsTroubleshoot Headroom performance compression problems. Use 7 diagnostic methods to find and fix issues fast. Debug policy, logging, and routing for better performance.
How IntelligentContextManager Handles Message-Level Compression in HeadroomDiscover how IntelligentContextManager compresses messages using a rolling-window strategy and ratio-based thresholds to manage token budgets within the Headroom repository.
TransformPipeline Architecture in Headroom: How to Customize the LLM Compression PipelineExplore the Headroom TransformPipeline architecture for LLM compression. Customize this deterministic sequence of composable transforms to reduce token usage and preserve information effectively.
How to Configure CCR Retrieval for Your Specific Use CaseLearn how to configure CCR retrieval for your use case by adjusting flags in CCRConfig and ProxyConfig. Control tool injection and response handling efficiently.
How to Use the Headroom Wrap Command with Claude Code: Complete GuideMaster the headroom wrap command with Claude Code. Learn how to intercept API calls and run the claude binary locally. Get the complete guide now.
Have a question about this repo?
These articles cover the highlights, but your codebase questions are specific. Give your agent direct access to the source. Share this with your agent to get started:
curl -s "https://instagit.com/install.md" Maintain an open-source project? Get it listed too →