CAT
November 27, 2024
Pere Martra This article explores a structured pruning technique for state-of-the-art models, that uses a GLU architecture,...