worth reading the original paper alongside the blog post. I think the ppaper has details the blog post glosses over, particularly around the calibration-free quantization approach and how they handle outlier channels.
Interestingly: the research sits on arXiv for a year, nobody talks about it