Guides

AI Guides for Builders

How-to content for builders, indie hackers, and AI engineers. Less theory, more shipped code.

Semantic Caching for LLMs: Cut Your Token Bill in Python

Intermediate

Semantic Caching for LLMs: Cut Your Token Bill in Python

Build a semantic cache that reuses answers for similar prompts and slashes LLM API costs.

10 min read·Kodetra Technologies

Today

GPT-5.6 Sol: Max Effort and Ultra Subagents in Python

Intermediate

GPT-5.6 Sol: Max Effort and Ultra Subagents in Python

Use GPT-5.6 Sol's new max reasoning effort and ultra subagents via the Responses API.

13 min read·Kodetra Technologies

Today

Stream Gemini Thinking: Build a Show-Your-Work CLI

Intermediate

Stream Gemini Thinking: Build a Show-Your-Work CLI

Stream Gemini's thought summaries live, control reasoning effort, and track thinking-token cost.

8 min read·Kodetra Technologies

3d ago

Gemini Thought Summaries: Audit Deep Think Reasoning

Intermediate

Gemini Thought Summaries: Audit Deep Think Reasoning

Surface, stream, and log Gemini 2.5 Pro Deep Think's reasoning chain with thought summaries.

10 min read·Kodetra Technologies

4d ago

Code Mode for MCP: Let Claude Write Code to Call Tools

Intermediate

Code Mode for MCP: Let Claude Write Code to Call Tools

Cut MCP agent context up to 99% by exposing tools as a code API the model calls in code.

9 min read·Kodetra Technologies

4d ago

Claude Advisor Tool: Pair Haiku With Opus in Python

Intermediate

Claude Advisor Tool: Pair Haiku With Opus in Python

Let a cheap executor model consult a stronger advisor mid-task in one Messages API call.

11 min read·Kodetra Technologies

7d ago

Grok Imagine 1.5: Animate a Photo via the xAI API

Intermediate

Grok Imagine 1.5: Animate a Photo via the xAI API

Turn a still image into a 720p video with native audio using xAI's Grok Imagine 1.5 in Python.

8 min read·Kodetra Technologies

10d ago

Nano Banana 2: Migrate Gemini Image Code by June 25

Intermediate

Nano Banana 2: Migrate Gemini Image Code by June 25

Gemini's image preview models die June 25. Swap to the Nano Banana 2 GA IDs with verified Python.

9 min read·Kodetra Technologies

10d ago

Loop Engineering: From Prompts to Verified Agent Loops

Intermediate

Loop Engineering: From Prompts to Verified Agent Loops

Build a plan-act-verify agent loop with an external check, retry budget, and clear stop rules.

9 min read·Kodetra Technologies

13d ago

Gemini Embedding 2: Multimodal RAG Over Your Images

Intermediate

Gemini Embedding 2: Multimodal RAG Over Your Images

Index and search images and text together with Gemini Embedding 2 File Search, no OCR.

9 min read·Kodetra Technologies

13d ago

Fable 5 Prompt Caching: Slash 1M-Token Codebase Costs

Intermediate

Fable 5 Prompt Caching: Slash 1M-Token Codebase Costs

Reuse a huge codebase prefix across every Fable 5 call and pay ~90% less.

8 min read·Kodetra Technologies

13d ago

Claude Programmatic Tool Calling: Cut Agent Token Costs

Intermediate

Claude Programmatic Tool Calling: Cut Agent Token Costs

Let Claude write code that calls your tools in a loop — 20–40% fewer tokens, same accuracy.

10 min read·Kodetra Technologies

14d ago

Page 1 of 4