Yifan's Learning Journey
625

read Build a Large Language Model (From Scratch)

  • 1. Understanding large language models
  • 2. Working with text data
  • 3. Coding attention mechanisms
  • 4. Implementing a GPT model from scratch to generate text
  • 5. Pretraining on unlabeled data
  • 6. Fine-tuning for classification
  • 7. Fine-tuning to follow instructions