Appendix A. References and further reading

 

A.1 Chapter 1

A.1.1 References

The announcement article of OpenAI's o1 model, which is regarded as the first LLM-based reasoning model:

DeepSeek-R1 is the first open-source reasoning model that was accompanied by a comprehensive technical report, which was the first to show that reasoning emerges from reinforcement learning with verifiable rewards (a topic covered in more detail in chapter 5):

OpenAI CEO’s comment on the reasoning ("chain-of-thought") capabilities of future models:

A research paper by AI researchers at Apple finding that reasoning models are sophisticated (but very capable) pattern matchers:

An in-depth book and guide on implementing and training large language models step-by-step:

A.1.2 Further Reading

An introduction to how DeepSeek-R1 works, providing insights into the foundations of reasoning in LLMs:

A.2 Chapter 2

A.2.1 References

A.2.2 Further Reading