Guides

AI Guides for Builders

How-to content for builders, indie hackers, and AI engineers. Less theory, more shipped code.

Fable 5 Prompt Caching: Slash 1M-Token Codebase Costs

Intermediate

Fable 5 Prompt Caching: Slash 1M-Token Codebase Costs

Reuse a huge codebase prefix across every Fable 5 call and pay ~90% less.

8 min read·Kodetra Technologies

Yesterday

Claude Programmatic Tool Calling: Cut Agent Token Costs

Intermediate

Claude Programmatic Tool Calling: Cut Agent Token Costs

Let Claude write code that calls your tools in a loop — 20–40% fewer tokens, same accuracy.

10 min read·Kodetra Technologies

Yesterday

Kimi K2.7 Code: Build a Multi-File Refactor Agent

Intermediate

Kimi K2.7 Code: Build a Multi-File Refactor Agent

Drive Moonshot's open-weight coding model through a real tool-calling loop in Python.

9 min read·Kodetra Technologies

2d ago

DiffusionGemma in Python: Generate Text 4x Faster

Machine Learning

Intermediate

DiffusionGemma in Python: Generate Text 4x Faster

Run Google's open diffusion LLM with Transformers and learn why it decodes text in parallel.

9 min read·Kodetra Technologies

2d ago

Fable 5 Effort: Cut Thinking Token Costs in Python

Intermediate

Fable 5 Effort: Cut Thinking Token Costs in Python

Claude Fable 5 always thinks. Use effort, display and max_tokens to control reasoning cost.

9 min read·Kodetra Technologies

2d ago

Claude Agent SDK: Lock Down Tools With Permission Hooks

Intermediate

Claude Agent SDK: Lock Down Tools With Permission Hooks

Stop runaway tool calls and agent spawning using canUseTool, PreToolUse hooks and deny rules.

9 min read·Kodetra Technologies

5d ago

More guides like this?

New AI guides for builders, in your inbox. Free.

Handle Fable 5 Refusals With Fallbacks in Python

Intermediate

Handle Fable 5 Refusals With Fallbacks in Python

Catch Claude Fable 5's stop_reason refusal and auto-retry on Opus 4.8 without breaking production.

10 min read·Kodetra Technologies

6d ago

Gemma 4 Tool Calling: Build a Local AI Agent

Intermediate

Gemma 4 Tool Calling: Build a Local AI Agent

Run Google's open Gemma 4 locally with Ollama and wire up real function calling for an agent.

10 min read·Kodetra Technologies

7d ago

Claude Opus 4.8 Fast Mode: 2.5x Faster Output in Python

Intermediate

Claude Opus 4.8 Fast Mode: 2.5x Faster Output in Python

Use speed:"fast" on Claude Opus 4.8 for up to 2.5x faster output, with a safe rate-limit fallback.

9 min read·Kodetra Technologies

7d ago

Claude Fable 5 in Python: Build a Self-Checking Agent

Intermediate

Claude Fable 5 in Python: Build a Self-Checking Agent

Build a tool-using agent on Anthropic's Claude Fable 5 that plans, acts, and verifies its own work.

8 min read·Kodetra Technologies

7d ago

Build an Apple-Style Multi-Model AI Router in Python

Intermediate

Build an Apple-Style Multi-Model AI Router in Python

WWDC let iPhone users pick ChatGPT, Gemini, or Claude. Build the same model router in Python.

9 min read·Kodetra Technologies

8d ago

Claude Memory Tool: Persistent Agent State in Python

Intermediate

Claude Memory Tool: Persistent Agent State in Python

Build agents that remember across sessions with Claude's /memories tool — full Python tutorial.

10 min read·Kodetra Technologies

14d ago

Page 1 of 3