Find answers from the community

Updated 3 months ago

Embed model

Just a bit confused on what model this embed_model is?

18 comments

Embedding models are specifically designed to take text and create a numerical representation of it (I.e. a vector)

Then when you query something, the query text is embedded, and using cosine similarity the most similar nodes can be retrieved and sent to the LLM as context to answer that query

In the link I sent, it downloads and runs a local embedding model from huggingface

ddamon

So its not a specific LLM per say like mpt-7b-instruct or stablelm?

LLogan M

Nope, they are completely separate entities

LLogan M

For example, by default, that huggingface embeddings code is downloading this model
https://huggingface.co/sentence-transformers/all-mpnet-base-v2

ddamon

So I am following this https://gpt-index.readthedocs.io/en/latest/examples/customization/llms/SimpleIndexDemo-Huggingface_stablelm.html

Why in this dont you guys use the stablelm embedings?

Sorry if this all is on the docs

LLogan M

StableLM doesn't have embeddings (or at least the last time I checked lol)

Embeddings models are trained specifically to be good at creating representations of text

LLogan M

LLMs (like stabeLM) are trained to be good at generating text

LLogan M

They don't interact at all either really. The embeddings are just a way to retrieve the most relevant text to help the LLM answer a question

ddamon

Okay I think I get that piece, Im just confused to how yours worked but i copied and pasted everything into a notebook and mine didnt work

ddamon

and you dont define a openai key anywhere, unless thats implied

LLogan M

Right, it's meant to be a sort of example of setting up the predictor. I had the openai key set in the background 😅

LLogan M

Sorry if that was unclear

ddamon

Ah okay that makes sense

ddamon

One last question

ddamon

So in order to use the vector store the model must have some corresponding emebdings?

LLogan M

Pretty much. As in, in order to use the vector store, you need to have an embed_model setup (whether it's from openai, hughingface, or something else)

ddamon

and in this context as long as this library can go on the huggingface page and grab the tokenizer it should be able to get embeddings for the text?

LLogan M

Yea exactly 🫡

Add a reply