A. Géron, Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd ed. Sebastopol, CA: O’Reilly Media, 2019.
S. Russell and P. Norvig, Artificial Intelligence: A Modern Approach, 4th ed. Upper Saddle River, NJ: Pearson, 2020.
K. P. Murphy, Machine Learning: A Probabilistic Perspective. Cambridge, MA: MIT Press, 2012.
Chapter 5
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, “Attention Is All You Need,” in Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, Dec. 2017, pp. 5998–6008. [Online]. Available:https://arxiv.org/pdf/1706.03762.
V. Sanh et al., “Multitask Prompted Training Enables Zero-Shot Task Generalization,” in Proceedings of the 10th International Conference on Learning Representations (ICLR 2022), Virtual Event, Apr. 2022. [Online]. Available: https://arxiv.org/pdf/2110.08207.