What are LLMs doing, really?

Lately, I’ve been diving into the science? (it’s mostly linear algebra) behind large language models. It started as a random side quest, but then I found an article by Technically that explains the basics surprisingly well. I’m attaching it as a PDF file here.

Credit: Technically