The Kaitchup – AI on a Budget
Subscribe
Sign in
Share this post
The Kaitchup – AI on a Budget
Simple, Fast, and Memory-Efficient Inference for Mistral 7B with Activation-Aware Quantization (AWQ)
Copy link
Facebook
Email
Notes
More
Simple, Fast, and Memory-Efficient Inference…
Benjamin Marie
Nov 21, 2023
7
Share this post
The Kaitchup – AI on a Budget
Simple, Fast, and Memory-Efficient Inference for Mistral 7B with Activation-Aware Quantization (AWQ)
Copy link
Facebook
Email
Notes
More
Using AWQ models with Hugging Face Transformers
Read →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
Simple, Fast, and Memory-Efficient Inference…
Share this post
Using AWQ models with Hugging Face Transformers