gpt2-from-scratch Building GPT2 from scratch to understand how it works Code was implemented based on Andej Karpathy's video