Guides

AI Guides for Builders

How-to content for builders, indie hackers, and AI engineers. Less theory, more shipped code.

Opus 5 Effort Router: Auto-Pick Cost vs Capability

Intermediate

Opus 5 Effort Router: Auto-Pick Cost vs Capability

Route each task to the right Claude Opus 5 effort level and cut your token bill.

7 min read·Kodetra Technologies

Today

Kimi K3 Dynamic Tool Loading: Inject Tools On Demand

Intermediate

Kimi K3 Dynamic Tool Loading: Inject Tools On Demand

Load tools just-in-time in Kimi K3's agent loop to shrink prompts and sharpen tool choice.

8 min read·Kodetra Technologies

7d ago

Port Kimi K2 Agents to K3: The Reasoning-History Trap

Intermediate

Port Kimi K2 Agents to K3: The Reasoning-History Trap

kimi-k3 is not a drop-in swap. Map the params right and dodge the trap that breaks tool loops.

9 min read·Kodetra Technologies

10d ago

Muse Spark 1.1: Drop-In Agent, Self-Managed Context

Intermediate

Muse Spark 1.1: Drop-In Agent, Self-Managed Context

Point the OpenAI SDK at Meta's agent model, add tools, let it self-manage a 1M-token context.

10 min read·Kodetra Technologies

10d ago

GPT-5.6 Persisted Reasoning: Reuse Thinking Across Turns

Intermediate

GPT-5.6 Persisted Reasoning: Reuse Thinking Across Turns

Use reasoning.context to reuse GPT-5.6's chain of thought across turns and cut redundant tokens.

9 min read·Kodetra Technologies

11d ago

Clone GPT-Live: Full-Duplex Voice Agent That Delegates

Advanced

Clone GPT-Live: Full-Duplex Voice Agent That Delegates

Reproduce GPT-Live's full-duplex voice and background delegation using the GA Realtime API.

11 min read·Kodetra Technologies

12d ago

Prompt Cache Retention: Cut OpenAI Bills Up to 90%

Intermediate

Prompt Cache Retention: Cut OpenAI Bills Up to 90%

Structure prompts, set prompt_cache_retention, and read cached_tokens to slash GPT-5.6 input costs.

8 min read·Kodetra Technologies

16d ago

GPT-5.6 Orchestrates Tools in a V8 Sandbox (Python)

Advanced

GPT-5.6 Orchestrates Tools in a V8 Sandbox (Python)

Use GPT-5.6's Responses API so the model writes JavaScript to run your tools in one call.

10 min read·Kodetra Technologies

17d ago

Live X Search Agent: Real-Time Trend Tracking With Grok

Intermediate

Live X Search Agent: Real-Time Trend Tracking With Grok

Use Grok 4.5's server-side X Search on the xAI API to build a cited, real-time trend agent.

10 min read·Kodetra Technologies

17d ago

Grok 4.5 Tool Loops: Tune Effort and Cache to Cut Cost

Intermediate

Grok 4.5 Tool Loops: Tune Effort and Cache to Cut Cost

Build an agentic Grok 4.5 tool loop in Python: route reasoning_effort and cache to slash cost.

7 min read·Kodetra Technologies

17d ago

Read an LLM's Mind: Probe Hidden Thoughts Like J-Space

Machine Learning

Advanced

Read an LLM's Mind: Probe Hidden Thoughts Like J-Space

Train a linear probe on hidden activations and steer output, the method behind Anthropic's J-Lens.

9 min read·Kodetra Technologies

20d ago

Context Compaction: Run Claude Agents Past 200K Tokens

Intermediate

Context Compaction: Run Claude Agents Past 200K Tokens

Use Anthropic's compact-2026-01-12 beta so long agentic loops survive past the 200K context window.

11 min read·Kodetra Technologies

21d ago

Page 1 of 6