chapter seven
7 The Pivot to Reasoning
This chapter covers
- Variational Lossy Autoencoders
- Relation Networks on CLEVR and bAbI
- Message Passing Neural Networks on QM9
- Relational Memory Core for sequential reasoning across benchmarks
- Paper versus Living Doubts
Rather than scaling to GPT-5 with trillions of parameters, OpenAI invested in techniques such as “test-time compute,” which involves models “thinking” more during inference.[1] This pivot reflected a broader realization across the field that scaling pretraining alone was yielding diminishing returns, particularly on tasks requiring reasoning. In fact, even the loudest champions of scale began discussing its limits. In 2024, Ilya said that “the 2010s were the age of scaling; now we’re back in the age of wonder and discovery once again.” Sutskever added, “Scaling the right thing matters more now than ever.”[2]