Series stub — full post TBD. This page exists so the series shape is reviewable.
Planned focus: Sharing one GPU concurrently (MPS) or by turns (time-slicing) — higher utilization, weaker isolation.
Part of “Inside AI Infrastructure: The Compute Layer.” Opinions are my own; public, documented concepts only.