llama.cpp cli supports it but llama-cpp-python don't and we need it
llama.cpp cli supports it but llama-cpp-python don't and we need it