Log in
Log into community
Find answers from the community
View all posts
Related posts
Was this helpful?
π
π
π
Powered by
Hall
Inactive
Updated 2 years ago
0
Follow
Quantized LLama2
Quantized LLama2
Inactive
0
Follow
At a glance
C
Cool
2 years ago
Β·
Hello Experts,
How to use llama_index with quantized llama2 models?
E
L
5 comments
Share
Open in Discord
E
Emanuel Ferreira
2 years ago
https://gpt-index.readthedocs.io/en/latest/examples/llm/llama_2_llama_cpp.html
E
Emanuel Ferreira
2 years ago
Almost the same thing, but then you would use a Hugging Face quantized model
https://huggingface.co/TheBloke/Llama-2-7B-GGML
E
Emanuel Ferreira
2 years ago
maybe @Logan M can validate this
L
Logan M
2 years ago
Yea, you'll want to use llama cpp for ggml or gguf files
https://gpt-index.readthedocs.io/en/stable/examples/llm/llama_2_llama_cpp.html
Huggingface also supports normal quantization using bitsandbytes or gptq
E
Emanuel Ferreira
2 years ago
Quantized LLama2
Add a reply
Sign up and join the conversation on Discord
Join on Discord