I'm starting a new low-level YT series called "Handcrafted Transformers." It introduces how Large Language Models like ChatGPT are built by coding them from scratch. What makes the series unique is that we find the neural net weights by hand (using math), rather than training them using GPUs/Calculus. https://youtu.be/PvutWdJnwJ8