Find answers from the community

Updated 4 months ago

Llama

At a glance

I'm sure I'm might not be the only one asking this, but is there support for local install of Alpaca for Llamaindex? I hate spending money each time I run a query.

14 comments

LLogan M

Llama index supports and llm, but it's up to you to write the code for passing the text to the model and returning the newly generated tokens.

Here's a small example with googles FLAN and huggingface

https://github.com/jerryjliu/llama_index/issues/544

LLogan M

I think I need to add this to the docs lol 😆

ddiscor_dian

I was about to ask the same thing!

LLogan M

One more vote to add it to the docs then lol

ddiscor_dian

but also, is there a recommended local embedding model for generating indexes?

ddiscor_dian

or do we have to use "text-embedding-ada-002"

LLogan M

It will probably take some experimenting. Sentence transformers is a good start: https://huggingface.co/sentence-transformers/all-mpnet-base-v2

Some docs on customizing embeddings here: https://gpt-index.readthedocs.io/en/latest/how_to/embeddings.html#custom-embeddings

ddiscor_dian

thanks, thats defo something for the docs! I see Alpaca 30B was released on HuggingFace recently; with some training, it would be my go-to for this stuff

ddiscor_dian

it should run nicely in Int4 on a consumer GPU

AAndreaSel93

Does it make sense to move from embeddings-ada-002 to other models, like those proposed by langchain? Do they perform better/faster?

LLogan M

What is langchain proposing? I haven't seen that 👀

Personally, Ada is pretty good, the input size is huge (8196), and is super cheap.

ddiscor_dian

Lots of companies (in Europe) are not comfortable sending their data to 3rd parties

ddiscor_dian

having a fully local system has a lot of value

Add a reply