Run Llama 2 70B on Your GPU with ExLlamaV2
1 min read
CAT
September 29, 2023
Benjamin Marie Finding the optimal mixed-precision quantization for your hardware Continue reading on Towards Data Science »...