Guides

AI Guides for Builders

How-to content for builders, indie hackers, and AI engineers. Less theory, more shipped code.

Build an LLM Spend Governor: Budget Caps in Python

Intermediate

Build an LLM Spend Governor: Budget Caps in Python

A runnable Python governor that caps LLM spend per user and auto-downgrades models.

10 min read·Kodetra Technologies

Yesterday

Code Mode for MCP: Let Claude Write Code to Call Tools

Intermediate

Code Mode for MCP: Let Claude Write Code to Call Tools

Cut MCP agent context up to 99% by exposing tools as a code API the model calls in code.

9 min read·Kodetra Technologies

4d ago

Stateless MCP in Python: The 2026-07-28 Handle Pattern

Intermediate

Stateless MCP in Python: The 2026-07-28 Handle Pattern

Port your MCP server to the stateless 2026-07-28 spec using the explicit-handle pattern.

10 min read·Kodetra Technologies

5d ago

Progressive Skill Loading: 40+ Agent Skills, No Bloat

Intermediate

Progressive Skill Loading: 40+ Agent Skills, No Bloat

Build a skill-manifest registry so an AI agent wields dozens of skills without context bloat.

9 min read·Kodetra Technologies

5d ago

Loop Engineering: From Prompts to Verified Agent Loops

Intermediate

Loop Engineering: From Prompts to Verified Agent Loops

Build a plan-act-verify agent loop with an external check, retry budget, and clear stop rules.

9 min read·Kodetra Technologies

13d ago

Fable 5 Prompt Caching: Slash 1M-Token Codebase Costs

Intermediate

Fable 5 Prompt Caching: Slash 1M-Token Codebase Costs

Reuse a huge codebase prefix across every Fable 5 call and pay ~90% less.

8 min read·Kodetra Technologies

13d ago

DeepSeek V4 Pro: Cheap 1M-Token Context in Python

Intermediate

DeepSeek V4 Pro: Cheap 1M-Token Context in Python

Use DeepSeek V4 Pro's auto KV cache to run huge-context jobs for cents.

8 min read·Kodetra Technologies

17d ago

MiniMax M3: Master 1M-Token Long Context With MSA

Intermediate

MiniMax M3: Master 1M-Token Long Context With MSA

MiniMax M3 hands-on: MSA sparse attention plus real 1M-token long context, with runnable Python.

10 min read·Kodetra Technologies

17d ago

Qwen 3.7 Max: Drop-In Anthropic SDK Swap in Python

Intermediate

Qwen 3.7 Max: Drop-In Anthropic SDK Swap in Python

Point the Anthropic SDK at Qwen 3.7 Max with one base-URL change: 1M context, thinking, caching.

8 min read·Kodetra Technologies

May 27

Connect Your App to Gemini Spark with MCP

Intermediate

Connect Your App to Gemini Spark with MCP

Build a standard MCP server in Python that plugs into Gemini Spark and Claude Desktop.

9 min read·Kodetra Technologies

May 26

LangGraph Subagents: Stop AI Agent Context Bloat

Intermediate

LangGraph Subagents: Stop AI Agent Context Bloat

Use LangGraph v0.4 subagents to isolate tool noise and keep main agent context clean.

4 min read·Kodetra Technologies

May 13

LangGraph Subagents: Stop AI Agent Context Bloat

Intermediate

LangGraph Subagents: Stop AI Agent Context Bloat

Use LangGraph v0.4 subagents to isolate tool noise and keep main agent context clean.

4 min read·Kodetra Technologies

May 13