Guides

AI Guides for Builders

How-to content for builders, indie hackers, and AI engineers. Less theory, more shipped code.

Kimi K3 Forced Retrieval: Ground Answers in Python

Intermediate

Kimi K3 Forced Retrieval: Ground Answers in Python

Use Kimi K3 tool_choice=required + strict JSON so agents fetch data before they answer.

9 min read·Kodetra Technologies

Today

Opus 5 Effort Router: Auto-Pick Cost vs Capability

Intermediate

Opus 5 Effort Router: Auto-Pick Cost vs Capability

Route each task to the right Claude Opus 5 effort level and cut your token bill.

7 min read·Kodetra Technologies

Today

Gemini 3.6 Flash: Cut Agent Tokens With thinking_level

Intermediate

Gemini 3.6 Flash: Cut Agent Tokens With thinking_level

Build a token-thrifty tool-calling agent on Gemini 3.6 Flash using the new thinking_level control.

10 min read·Kodetra Technologies

5d ago

Kimi K3 Partial Mode: Prefill Replies to Force Output

Intermediate

Kimi K3 Partial Mode: Prefill Replies to Force Output

Prefill Kimi K3's assistant turn to lock output shape, force clean JSON, and steer tone.

9 min read·Kodetra Technologies

6d ago

Kimi K3 Reasoning Effort: Stream Thoughts, Cut Cost

Intermediate

Kimi K3 Reasoning Effort: Stream Thoughts, Cut Cost

Tune Kimi K3's low/high/max reasoning effort and stream reasoning_content to control token cost.

9 min read·Kodetra Technologies

6d ago

Kimi K3 Dynamic Tool Loading: Inject Tools On Demand

Intermediate

Kimi K3 Dynamic Tool Loading: Inject Tools On Demand

Load tools just-in-time in Kimi K3's agent loop to shrink prompts and sharpen tool choice.

8 min read·Kodetra Technologies

7d ago

Kimi K3 Vision: Extract Structured Data From PDFs in Python

Intermediate

Kimi K3 Vision: Extract Structured Data From PDFs in Python

Use Kimi K3's native vision and JSON schema output to turn messy PDF pages into clean, typed data.

9 min read·Kodetra Technologies

9d ago

Port Kimi K2 Agents to K3: The Reasoning-History Trap

Intermediate

Port Kimi K2 Agents to K3: The Reasoning-History Trap

kimi-k3 is not a drop-in swap. Map the params right and dodge the trap that breaks tool loops.

9 min read·Kodetra Technologies

10d ago

Kimi K3: Query a Million-Token Repo for Pennies

Intermediate

Kimi K3: Query a Million-Token Repo for Pennies

Point the OpenAI SDK at Moonshot's 2.8T K3, load a whole repo, and cut cost with caching.

8 min read·Kodetra Technologies

10d ago

Muse Spark 1.1: Drop-In Agent, Self-Managed Context

Intermediate

Muse Spark 1.1: Drop-In Agent, Self-Managed Context

Point the OpenAI SDK at Meta's agent model, add tools, let it self-manage a 1M-token context.

10 min read·Kodetra Technologies

10d ago

GPT-5.6 Persisted Reasoning: Reuse Thinking Across Turns

Intermediate

GPT-5.6 Persisted Reasoning: Reuse Thinking Across Turns

Use reasoning.context to reuse GPT-5.6's chain of thought across turns and cut redundant tokens.

9 min read·Kodetra Technologies

11d ago

Run Bonsai 27B on Your Phone: 1-Bit Local AI

Machine Learning

Intermediate

Run Bonsai 27B on Your Phone: 1-Bit Local AI

Run the first 27B-class model on a phone: MLX, llama.cpp, tool calls, and the memory math.

10 min read·Kodetra Technologies

11d ago

Page 1 of 11