Tutorials Run a Local LLM on Android with llama.cpp + Vulkan
Compile llama.cpp with Vulkan in Termux and run a quantized LLM on your Android GPU, no root.
How-to content for builders, indie hackers, and AI engineers. Less theory, more shipped code.
Tutorials Compile llama.cpp with Vulkan in Termux and run a quantized LLM on your Android GPU, no root.
Tutorials Build a multi-step tool-calling agent on Moonshot's open-weight Kimi K2.6 model.
Tutorials Recreate ChatGPT's new Dreaming V3 memory: a background job that learns and forgets.
Tutorials Deploy Microsoft's new reasoning model and build a tool-calling triage agent.
Tutorials Collapse agent tool-call loops into one sandboxed Python program and cut latency in half.
Tutorials Build agents that remember across sessions with Claude's /memories tool — full Python tutorial.
New AI guides for builders, in your inbox. Free.
Join 1,955 builders reading daily.
Tutorials Deploy any agent framework to Microsoft Foundry's managed sandbox runtime (Build 2026 launch).
Tutorials Call Microsoft's June 2 coding model via OpenRouter for cheap, fast refactors.
Tutorials Install grok-build-0.1, run plan mode, stream JSON in CI, and call the API from Python.
Tutorials Build a research-write-review multi-agent pipeline using Microsoft Agent Framework 1.0 in Python.
Tutorials Generate AI video in Python with Veo 3.1 — the model powering Google's Omni Flash launch.
Tutorials Use Claude Code dynamic workflows to fan out and cross-check critical work.