Quantizing LLMs with PyTorch and Hugging Face
Optimize Memory and Speed for Large Language Models with Advanced Quantization Techniques
5.00 (4 reviews)

739
students
2 hours
content
Nov 2024
last update
$13.99
regular price
What you will learn
Gain an intuitive understanding of linear quantization
Learn different linear quantization techniques
Learn from a high-level how 2 & 4-bit quantization works
Learn how to quantize LLMs from Hugging Face
Screenshots




6287745
udemy ID
14/11/2024
course created date
18/11/2024
course indexed date
Bot
course submited by