--- license: llama3.1 --- Sources: https://www.reddit.com/r/LocalLLaMA/comments/1ebnkds/llamacpp_android_users_now_benefit_from_faster/ https://github.com/ggerganov/llama.cpp/blob/master/docs/build.md * For Q4_0_4_4 quantization type build, add the GGML_NO_LLAMAFILE=1 flag. For example, use make GGML_NO_LLAMAFILE=1.