Guides

AI Guides for Builders

How-to content for builders, indie hackers, and AI engineers. Less theory, more shipped code.

Intermediate

Semantic Caching for LLMs: Cut Your Token Bill in Python

Build a semantic cache that reuses answers for similar prompts and slashes LLM API costs.

10 min read·Kodetra Technologies

Today

Intermediate

Reuse a huge codebase prefix across every Fable 5 call and pay ~90% less.

8 min read·Kodetra Technologies

13d ago

Intermediate

Use DeepSeek V4 Pro's auto KV cache to run huge-context jobs for cents.

8 min read·Kodetra Technologies

17d ago

Intermediate

Use Opus 4.8 role:system messages mid-conversation to update agent rules without invalidating cache.

10 min read·Kodetra Technologies

May 29

Intermediate

Point the Anthropic SDK at Qwen 3.7 Max with one base-URL change: 1M context, thinking, caching.

8 min read·Kodetra Technologies

May 27

Advanced

Stop cache stampedes with locking, single-flight, and probabilistic early expiry.

11 min read·Kodetra Technologies

May 22