Find answers from the community

Updated 2 years ago

Prompting

At a glance

A community member is experiencing issues with hallucination when using the 4bit llama2 70b model. They are seeking advice on how to prompt better or finetune the model. Other community members suggest using the INST and EOS/BOS tokens when prompting, as the format for llama2-chat is quite strict. They provide a sample format and mention that there are utility functions available in the llama_index library that may be helpful for implementing a custom LLM class.

Useful resources

wwoojim

Yeah from experience using 4bit llama2 70b it's hallucinating pretty badly. Gotta figure out how to prompt better or to finetune it

7 comments

LLogan M

Are you using the INST and EOS/BOS tokens when prompting?

LLogan M

The format for llama2-chat is pretty strict I've found

wwoojim

Did you mean this?

wwoojim