How Transformers Handles Model Hub Caching and Offline Loading

Question

Learn how Hugging Face Transformers handles model hub caching and offline loading. Discover seamless local cache checks and efficient downloads or error handling for offline use.

Accepted Answer

When you call , the library first checks the local cache at ; if the file exists it returns immediately, otherwise it downloads from the Hub unless offline mode is active, in which case it raises a clear error or falls back to cached files. The Hugging Face Transformers library implements a robust two-tier system for model hub caching and offline loading that ensures reproducible, air-gapped deployments. Every call to , , or ultimately routes through , where the library decides whether to read from disk or fetch from the network. Cache Architecture and Directory Layout The cache root is determined by environment variables and constants defined in the codebase. By default, files live in , controlled by and overridable via or . In (lines 87‑93), the library resolves the cache directory: The on-disk layout mirrors the Hub repository structure: This design allows (hub.py lines 100‑108) to locate files using only the repo id, filename, and revision hash. The Caching Pipeline Every file fetch follows a strict resolution order implemented in ( , lines 10‑33 and 47‑108): 1. Local directory check – If points to a local path, it is used directly (lines 70‑86). 2. Cache lookup – checks whether the exact revision already exists in (lines 94‑110). 3. Offline guard – (line 361‑364) forces when or is set, skipping any network call. 4. Network download – If allowed, (single file) or (full repo) writes directly into the cache folder (lines 18‑28). 5. Return resolved path – The final list of local paths is returned to the caller. Error handling (lines 121‑128) produces descriptive messages when a file is missing in offline mode, telling users exactly which file is required and how to pre-download it. Offline Mode Implementation Offline behavior is controlled by two environment variables checked in : - – Modern, preferred switch. - – Legacy alias for backward compatibility. Both accept truthy values ( , , ). When active, the library automatically sets in (hub.py lines 361‑364), ensuring that is never invoked. You can also trigger local-only behavior per-call without setting environment variables: Practical Usage Examples Loading a model with automatic caching Enabling global offline mode Inspecting cache contents programmatically Using cached file directly Key Source Files | File | Role | |------|------| | | Central implementation of , , , and offline-mode guards. | | | Uses to protect weight-loading logic in . | | | Applies offline guards when fetching tokenizer files. | | | Implements offline checks for image and feature-extraction assets. | | | Shows offline guards for video-related processing files. | | | Test suite verifying offline behavior and environment-variable handling. | Summary - The Transformers library stores all downloaded artifacts in (configurable via ), using a structured layout that maps repo IDs to snapshot folders. - Offline mode is triggered by or , forcing and preventing any HTTP requests. - The function in orchestrates the resolution order: local path → cache lookup → (optional) download → error handling. - Users can force cache-only behavior per-call with , or inspect cache contents using . Frequently Asked Questions How do I completely disable network access in Transformers? Set the environment variable (or the legacy ) before importing the library. This forces every call to read only from the local cache at and raises a clear error if files are missing. Where does Transformers store downloaded models? By default, files live in . You can change this by setting the environment variable; the library appends to that path. The internal layout uses to keep different versions isolated. What happens if a file is missing while in offline mode? The library raises an with a descriptive message indicating exactly which file is required and suggesting that you download it while online first. This error originates in the exception handling block of in (lines 121‑128). Can I use a specific revision or commit hash when loading offline? Yes. When you specify in , the cache lookup in searches for a snapshot folder matching that exact hash. As long as you previously downloaded that revision while online, offline loading will succeed.

File	Role
`src/transformers/utils/hub.py`	Central implementation of `cached_files`, `cached_file`, `try_to_load_from_cache`, and offline-mode guards.
`src/transformers/modeling_utils.py`	Uses `is_offline_mode` to protect weight-loading logic in `from_pretrained`.
`src/transformers/tokenization_utils_tokenizers.py`	Applies offline guards when fetching tokenizer files.
`src/transformers/processing_utils.py`	Implements offline checks for image and feature-extraction assets.
`src/transformers/video_processing_utils.py`	Shows offline guards for video-related processing files.
`tests/utils/test_offline.py`	Test suite verifying offline behavior and environment-variable handling.

How Transformers Handles Model Hub Caching and Offline Loading

Cache Architecture and Directory Layout

The Caching Pipeline

Offline Mode Implementation

Practical Usage Examples

Loading a model with automatic caching

Enabling global offline mode

Inspecting cache contents programmatically

Using cached_file directly

Key Source Files

Summary

Frequently Asked Questions

How do I completely disable network access in Transformers?

Where does Transformers store downloaded models?

What happens if a file is missing while in offline mode?

Can I use a specific revision or commit hash when loading offline?

Have a question about this repo?