1 min read
4.3 · Co-locating Online and Offline Work

Series stub — full post TBD. This page exists so the series shape is reviewable.

Planned focus: Serving is provisioned for peak and idle most of the day; backfill it with batch without hurting latency.


Part of “Inside AI Infrastructure: The Compute Layer.” Opinions are my own; public, documented concepts only.