AI Infrastructure.
What it actually costs to run AI in production: GPUs, hosting trade-offs, and the situations where rolling your own model is worth the operational tax. Written for operators making real purchase decisions, not for vendor decks.
Anthropic just shipped Claude Fable 5. What it means for a small business.
Anthropic's most powerful public model is here, and the benchmarks are real: a 50-million-line code migration in a day, state-of-the-art vision and coding. But it costs $10/$50 per million tokens. Here's the honest read for a small business, where the frontier model is rarely the one you should be paying for.
Apple put Google's Gemini inside Siri. What WWDC 2026 means if you run a business.
The headline from Apple's WWDC 2026 keynote isn't a feature; it's a partnership. The new Siri is powered by Google's Gemini. The shift is smaller than the hype, but it matters in one direction. Here's what to take from it, and what to ignore.
Computex 2026 was all about 'AI PCs.' Should your small business buy one?
Taipei spent a week selling on-device AI: NVIDIA's RTX Spark, Intel's Arc G3 and Nova Lake tease, AMD's new X3D chips, a wall of 'AI laptops.' Here's the read for an owner deciding whether to refresh hardware in 2026, and why the memory shortage matters more than the AI badge.
NVIDIA's two 'Sparks': the DGX Spark and the new RTX Spark, explained
NVIDIA now has two 'Spark' machines, and people keep mixing them up. The DGX Spark is a shipping $4,000 AI-developer box; the RTX Spark, announced at Computex 2026, is a Windows-on-Arm consumer platform landing this fall. Here's what each one is, how they compare to an RTX 5090, Mac Studio, and Strix Halo, and whether either makes you more productive.
Google I/O 2026, translated for small business owners
Google's I/O 2026 keynote was three hours of AI. Most of it won't touch a small business this week. But three things will: dramatically cheaper capable models, AI Mode search crossing a billion users, and agentic shopping. Here's the signal, and what to do about it.
PHIPA-compliant AI for Ontario clinics: a build checklist
What it actually takes to deploy AI inside an Ontario healthcare practice without violating PHIPA. A practical checklist for clinic operators and the developers they hire.
The real total cost of running your own LLM in 2026
GPU sticker prices are only one line on the invoice. A breakdown of every cost center for self-hosted inference, with current 2026 numbers, so you can decide whether the math works for your use case.
AI coding assistants, one year in
A studio that ships code every day takes an honest accounting of what AI coding tools changed, what they didn't, and what it means if you're paying for them, or paying a shop that uses them.
The actual break-even point for running LLMs yourself
Cloud APIs are cheap at small volume and expensive at large volume. Here's the rough math on where self-hosting starts to pay off, what it actually costs, and the common mistakes.
What it actually costs to run AI on your own hardware
A plain-English breakdown of the hardware, GPU, memory, and cloud costs involved in running AI inference, for operators who want a number, not a whitepaper.
Where AI development sits in 2026
An honest read on where AI development is in 2026: what's actually in production, what's still vaporware, and what it means for the kind of businesses we build software for.
When local AI deployment is the right answer
There are specific situations where sending data to a cloud AI provider isn't viable. Here's when local deployment is worth the extra work and when it isn't.
Have something that needs shipping?
One call. Thirty minutes. You leave with an honest read on scope, timeline, and price, whether we're the right fit or not.