We’ve celebrated an extraordinary breakthrough while largely postponing the harder question of whether the architecture we’re scaling can sustain the use cases promised.
Artificial intelligence has been bottlenecked less by raw compute than by how quickly models can move data in and out of memory. A new generation of memory-centric designs is starting to change that, ...