Guides

AI Guides for Builders

How-to content for builders, indie hackers, and AI engineers. Less theory, more shipped code.

Mercury 2 dLLM: Reasoning at 1000 Tokens Per Second

Intermediate

Mercury 2 dLLM: Reasoning at 1000 Tokens Per Second

Build real-time agents on the first reasoning diffusion LLM: OpenAI-compatible, 1000 tok/s.

8 min read·Kodetra Technologies

11d ago

Build a Deployment Simulation Eval to Catch Model Drift

Machine Learning

Intermediate

Build a Deployment Simulation Eval to Catch Model Drift

Replay real conversations through a candidate model to predict misbehavior before you ship.

11 min read·Kodetra Technologies

12d ago

Kimi K2.7 Code: Build a Multi-File Refactor Agent

Intermediate

Kimi K2.7 Code: Build a Multi-File Refactor Agent

Drive Moonshot's open-weight coding model through a real tool-calling loop in Python.

9 min read·Kodetra Technologies

14d ago

MiniMax M3: Master 1M-Token Long Context With MSA

Intermediate

MiniMax M3: Master 1M-Token Long Context With MSA

MiniMax M3 hands-on: MSA sparse attention plus real 1M-token long context, with runnable Python.

10 min read·Kodetra Technologies

17d ago

Gemma 4 Tool Calling: Build a Local AI Agent

Intermediate

Gemma 4 Tool Calling: Build a Local AI Agent

Run Google's open Gemma 4 locally with Ollama and wire up real function calling for an agent.

10 min read·Kodetra Technologies

19d ago

MiniMax M3 Tool Calling: Build an Agentic Loop in Python

Intermediate

MiniMax M3 Tool Calling: Build an Agentic Loop in Python

Wire MiniMax M3's OpenAI-compatible API into a real tool-calling agent loop.

8 min read·Kodetra Technologies

21d ago

Dreaming V3 Explained: Build Sleep-Time Memory in Python

Intermediate

Dreaming V3 Explained: Build Sleep-Time Memory in Python

Recreate ChatGPT's new Dreaming V3 memory: a background job that learns and forgets.

10 min read·Kodetra Technologies

24d ago

Claude Opus 4.8 Effort Levels: A Hands-On Python Guide

Intermediate

Claude Opus 4.8 Effort Levels: A Hands-On Python Guide

Tune token spend on Opus 4.8 with the effort parameter. Runnable Python, real I/O, real numbers.

7 min read·Kodetra Technologies

29d ago

Gemini 3.5 Flash Function Calling: Build a Tool-Using Agent

Intermediate

Gemini 3.5 Flash Function Calling: Build a Tool-Using Agent

Wire Gemini 3.5 Flash to your own Python functions and run a real multi-step agent loop.

8 min read·Kodetra Technologies

May 25

Build a ReAct Agent from Scratch in Python (No Framework)

Intermediate

Build a ReAct Agent from Scratch in Python (No Framework)

Reason + Act loop, tool routing, retries — implement a real agent in 200 lines of Python.

9 min read·Kodetra Technologies

May 2

Server-Sent Events in Go: A Production Deep Dive

Advanced

Server-Sent Events in Go: A Production Deep Dive

Build a robust SSE service in Go with backpressure, reconnects, fan-out, and graceful shutdown.

12 min read·Kodetra Technologies

Apr 27

Cursor 2.0 Parallel Agents: Ship Features 8x Faster

Beginner

Cursor 2.0 Parallel Agents: Ship Features 8x Faster

Run up to 8 AI agents in parallel in Cursor 2.0 to finish features in a fraction of the time.

4 min read·Kodetra Technologies

Apr 23