
https://github.com/ggml-org/llama.cpp/blob/master/tools/quantize/README.md