llm-cache
llm-cache is currently a thin scaffold, not an active cache engine. It exists to reserve the cache-policy namespace and enable the module on a location, but it does not yet implement cache mode selection, TTLs, scope rules, bypass controls, lookup, replay, or storage.
When to use this module
- You want the
llm_cachenamespace present in config while the stack’s cache boundary is still being defined. - You are documenting or testing module composition and want explicit location-level enablement.
- You are preparing for future cache-policy work that will likely execute on
llm-proxy’s live request/response path.
nginx.conf synthesis
Minimal enablement:
location /v1 {
llm_proxy;
llm_proxy_route openai openai_upstream;
llm_proxy_default_provider openai;
llm_cache;
proxy_pass https://$llm_provider_upstream;
}
Directive reference
| Directive | Contexts | Default | Description |
|---|---|---|---|
llm_cache | location | — | Enable the cache module scaffold for this location. |
Current behavior
- The module only stores an enable flag in location config.
- No cache lookup, replay, or storage path exists yet.
- No cache-specific nginx variables are exported.
- No cache eligibility, TTL, scope, bypass, negative-cache, exact-match, or coalescing directives exist in the current implementation.
- Runtime cache execution, when implemented, is expected to live with
llm-proxyrather than in a separate intercepting module.
Behavior notes
- This page intentionally documents the implemented surface only. Earlier draft docs described future cache slices; those are not active module behavior today.
- Semantic caching is not the current implementation target.
- Future cache work should be treated as policy owned by
llm-cacheand live request execution owned byllm-proxy.