gradio llama-cpp-python scikit-learn faiss-cpu huggingface-hub