Writing an LLM from scratch, part 8 -- trainable self-attention
Archives
Categories
Blogroll
This is the eighth post in my trek through Sebastian Raschka's book
"Build a Large Language Model (from Scratch)".
I'm blogging about bits that grab my interest, and things I had to rack my
brains over, as a way
to get things straight in my own head -- and perhaps to help anyone else that
is working through it too. It's been almost a month since my
last update -- and
if you were suspecting that I was
blogging about blogging and spending time
getting LaTeX working on...
Read more at gilesthomas.com