Guides

AI Guides for Builders

How-to content for builders, indie hackers, and AI engineers. Less theory, more shipped code.

Fable 5 Prompt Caching: Slash 1M-Token Codebase Costs

Tutorials

Intermediate

Fable 5 Prompt Caching: Slash 1M-Token Codebase Costs

Reuse a huge codebase prefix across every Fable 5 call and pay ~90% less.

8 min read·Kodetra Technologies

Today

Claude Programmatic Tool Calling: Cut Agent Token Costs

Tutorials

Intermediate

Claude Programmatic Tool Calling: Cut Agent Token Costs

Let Claude write code that calls your tools in a loop — 20–40% fewer tokens, same accuracy.

10 min read·Kodetra Technologies

Today

Fable 5 Effort: Cut Thinking Token Costs in Python

Tutorials

Intermediate

Fable 5 Effort: Cut Thinking Token Costs in Python

Claude Fable 5 always thinks. Use effort, display and max_tokens to control reasoning cost.

9 min read·Kodetra Technologies

Yesterday

DeepSeek V4 Pro: Cheap 1M-Token Context in Python

Tutorials

Intermediate

DeepSeek V4 Pro: Cheap 1M-Token Context in Python

Use DeepSeek V4 Pro's auto KV cache to run huge-context jobs for cents.

8 min read·Kodetra Technologies

3d ago

MiniMax M3: Master 1M-Token Long Context With MSA

Tutorials

Intermediate

MiniMax M3: Master 1M-Token Long Context With MSA

MiniMax M3 hands-on: MSA sparse attention plus real 1M-token long context, with runnable Python.

10 min read·Kodetra Technologies

4d ago

Claude Opus 4.8 Effort Levels: A Hands-On Python Guide

Tutorials

Intermediate

Claude Opus 4.8 Effort Levels: A Hands-On Python Guide

Tune token spend on Opus 4.8 with the effort parameter. Runnable Python, real I/O, real numbers.

7 min read·Kodetra Technologies

15d ago

More guides like this?

New AI guides for builders, in your inbox. Free.

DSPy GEPA: Auto-Optimize AI Agent Prompts

Tutorials

Intermediate

DSPy GEPA: Auto-Optimize AI Agent Prompts

Use DSPy GEPA to auto-evolve prompts with reflection and beat hand-tuned baselines.

4 min read·Kodetra Technologies

24d ago

How to Secure an MCP Server Against Tool Poisoning

Security

Advanced

How to Secure an MCP Server Against Tool Poisoning

Harden MCP servers: kill tool poisoning, validate tokens, sandbox tools

9 min read·Kodetra Technologies

27d ago

GPT-5.4 Tool Search: Load Only What You Need

Tutorials

Beginner

GPT-5.4 Tool Search: Load Only What You Need

Use OpenAI's tool search to dynamically load tools at runtime, cutting token usage by 47% in large tool ecosystems.

4 min read·Kodetra Technologies

Apr 14