The Kaitchup – AI on a Budget
Subscribe
Sign in
Run Llama 2 70B on Your GPU with ExLlamaV2
Benjamin Marie
Sep 27, 2023
6
Finding the optimal mixed-precision quantization for your hardware
Read →
Comments
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Run Llama 2 70B on Your GPU with ExLlamaV2
Finding the optimal mixed-precision quantization for your hardware