output = llm(
"Once upon a time,",
max_tokens=512,
echo=True
)
print(output)ggml_bakllava-1
This repo contains GGUF files to inference BakLLaVA-1 with llama.cpp end-to-end without any extra dependency.
Note: The mmproj-model-f16.gguf file structure is experimental and may change. Always use the latest code in llama.cpp.
- Downloads last month
- 472
Hardware compatibility
Log In to add your hardware
16-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="mys/ggml_bakllava-1", filename="", )