The Blog

Nova 2 Lite Leading Video Analysis Benchmarks

24th March 2026 · 1 min read

Nova 2 Lite leading overall on one of the most comprehensive video analysis benchmarks across frontier multimodal models. The benchmark evaluates real-world, media-aligned tasks such as video tagging, summarization, and...

OpenClaw: Event-Driven Architecture for Agents

18th March 2026 · 1 min read

Yesterday a friend asked me what I thought about OpenClaw. My TL;DR was it's an event driven architecture that runs agents in a loop with flexible persistence. Then I drew...

Tmux + Claude Code Agent Teams

15th March 2026 · 1 min read

Orchestrating agent swarms is cool, but there's a lot of overhead and debugging involved. The ability to chat with your lead agents in real time while they continue to work...

KV Cache, FlashAttention, and Attention Variants

11th March 2026 · 1 min read

Let's talk KV cache: why it exists, why it's usually disabled during training, how it connects to FlashAttention, and how related attention variants like MHA, MQA, and GQA fit into...

Bigger Context Windows Don't Mean Infinite Memory

7th March 2026 · 1 min read

Bigger context windows don't mean infinite memory. Here's a TL;DR of the AI context and a few practical tips for working within context limits.