Guides

AI Guides for Builders

How-to content for builders, indie hackers, and AI engineers. Less theory, more shipped code.

Opus 5 Effort Router: Auto-Pick Cost vs Capability

Intermediate

Opus 5 Effort Router: Auto-Pick Cost vs Capability

Route each task to the right Claude Opus 5 effort level and cut your token bill.

7 min read·Kodetra Technologies

Today

Kimi K3 Partial Mode: Prefill Replies to Force Output

Intermediate

Kimi K3 Partial Mode: Prefill Replies to Force Output

Prefill Kimi K3's assistant turn to lock output shape, force clean JSON, and steer tone.

9 min read·Kodetra Technologies

6d ago

Kimi K3 Reasoning Effort: Stream Thoughts, Cut Cost

Intermediate

Kimi K3 Reasoning Effort: Stream Thoughts, Cut Cost

Tune Kimi K3's low/high/max reasoning effort and stream reasoning_content to control token cost.

9 min read·Kodetra Technologies

7d ago

Kimi K3 Dynamic Tool Loading: Inject Tools On Demand

Intermediate

Kimi K3 Dynamic Tool Loading: Inject Tools On Demand

Load tools just-in-time in Kimi K3's agent loop to shrink prompts and sharpen tool choice.

8 min read·Kodetra Technologies

7d ago

Kimi K3 Vision: Extract Structured Data From PDFs in Python

Intermediate

Kimi K3 Vision: Extract Structured Data From PDFs in Python

Use Kimi K3's native vision and JSON schema output to turn messy PDF pages into clean, typed data.

9 min read·Kodetra Technologies

10d ago

Port Kimi K2 Agents to K3: The Reasoning-History Trap

Intermediate

Port Kimi K2 Agents to K3: The Reasoning-History Trap

kimi-k3 is not a drop-in swap. Map the params right and dodge the trap that breaks tool loops.

9 min read·Kodetra Technologies

10d ago

Muse Spark 1.1: Drop-In Agent, Self-Managed Context

Intermediate

Muse Spark 1.1: Drop-In Agent, Self-Managed Context

Point the OpenAI SDK at Meta's agent model, add tools, let it self-manage a 1M-token context.

10 min read·Kodetra Technologies

11d ago

Run Bonsai 27B on Your Phone: 1-Bit Local AI

Machine Learning

Intermediate

Run Bonsai 27B on Your Phone: 1-Bit Local AI

Run the first 27B-class model on a phone: MLX, llama.cpp, tool calls, and the memory math.

10 min read·Kodetra Technologies

11d ago

Verifiers v1: Train Agents Past the Context Window

Machine Learning

Advanced

Verifiers v1: Train Agents Past the Context Window

Ship a taskset, swap any harness, and turn compacted rollouts into real RL training samples.

13 min read·Kodetra Technologies

13d ago

Prompt Cache Retention: Cut OpenAI Bills Up to 90%

Intermediate

Prompt Cache Retention: Cut OpenAI Bills Up to 90%

Structure prompts, set prompt_cache_retention, and read cached_tokens to slash GPT-5.6 input costs.

8 min read·Kodetra Technologies

17d ago

SWE-Together: Grade Coding Agents on Multi-Turn Sessions

Machine Learning

Intermediate

SWE-Together: Grade Coding Agents on Multi-Turn Sessions

Build a SWE-Together-style multi-turn coding-agent eval with an LLM user simulator in Python.

10 min read·Kodetra Technologies

20d ago

Read an LLM's Mind: Probe Hidden Thoughts Like J-Space

Machine Learning

Advanced

Read an LLM's Mind: Probe Hidden Thoughts Like J-Space

Train a linear probe on hidden activations and steer output, the method behind Anthropic's J-Lens.

9 min read·Kodetra Technologies

20d ago

Page 1 of 5