Hi folks - first time poster, did a

LLeo Murillo

Hi folks - first time poster, did a solid search prior to posting and have been unable to find a lead into a solution. I'm getting a proper response with Llama2 but empty response with Vicuna and Claude, using identical LlamaCPP parameters, more details in thread:

7 comments

LLeo Murillo

This is my LlamaCPP object:

self.llm = LlamaCPP(
model_url=self.model_url,
model_path=None,
temperature=0.1,
max_new_tokens=256,
context_window=3900,
generate_kwargs={},
model_kwargs={"n_gpu_layers": 1},
messages_to_prompt=messages_to_prompt,
completion_to_prompt=completion_to_prompt,
verbose=False,
)

LLeo Murillo

with this model: self.model_url = "https://huggingface.co/TheBloke/Llama-2-13B-chat-GGUF/resolve/main/llama-2-13b-chat.Q4_0.gguf" it all works fine

LLeo Murillo

switching to either of these:

#self.model_url = "https://huggingface.co/TheBloke/claude2-alpaca-13B-GGUF/resolve/main/claude2-alpaca-13b.Q5_K_M.gguf"
#self.model_url = "https://huggingface.co/TheBloke/vicuna-13B-v1.5-GGUF/resolve/main/vicuna-13b-v1.5.Q5_K_M.gguf"

I get empty response

LLeo Murillo

I'm guessing it has to do with model_kwargs, context window or stuff like that, but honestly I have no idea where to start flipping stuff

LLogan M

hmm, I think

you probably need to change the messagese_to_prompt and completion_to_prompt functions so that they format things properly for these models (the ones built-in for llama-index are only for llama2 format -- not sure about those other two models)

Maybe change the global tokenizer to match the new models, usually blank responses are because the inputs got too big (and the tokenizer is used to count tokens)

i.e. for llama2 I might do something like

Plain Text

from llama_index import set_global_tokenizer
from transformers import AutoTokenizer

set_global_tokenizer(
    AutoTokenizer.from_pretrained("NousResearch/Llama-2-7b-chat-hf").encode
)

LLeo Murillo

awesome, this gives me direction. I'll get spelunking and keep you posted 😉

LLeo Murillo

appreciate it

Add a reply

Find answers from the community

Hi folks - first time poster, did a