Hacker News
new
top
best
ask
show
job
Gemma 3 QAT (Quantized Aware Training) 3x less memory
(
huggingface.co
)
5 points
by
philschmidxxx
10 months ago
2 comments
bigdict
10 months ago
Amazing, I've been wishing for this! Do you have any estimates on how much accuracy is first lost then recovered compared to the original bf16 and the naively quantized models?
bigdict
10 months ago
Thank you so much for continuing to support Gemma 3 with these updates.