Let’s reproduce NanoGPT with JAX!(Part 1)
7 min read
CAT
August 4, 2024
Louis Wang Part 1: Build 124M GPT2 with JAX. Part 2: Optimize the training speed in Single...