Find answers from the community

Updated 6 months ago

Load in 8bit

At a glance

hey all - is there a way to load huggingface models (local) in 8bit? i don't see the param in HuggingFaceLLMPredictor (it's a param in the transformers AutoModelForCausalLM)

1 comment

LLogan M

You can pass this as a part of the model_kwargs, or you can load the model yourself and pass that in too if that's easier

https://gpt-index.readthedocs.io/en/latest/reference/llm_predictor.html

Add a reply