Is there someway by which I can use

At a glance

The community member is asking if there is a way to use Euclidean distance for similarity rather than cosine similarity when using the LlamaIndex library. In the comments, another community member suggests that the choice of distance metric depends on the embeddings model being used, and provides an example of using Euclidean distance with the Pinecone vector store. Another community member adds that if using a local embeddings model, the embedding class can be subclassed to change the similarity mode.

Useful resources

kkush2861

Is there someway by which I can use euclidean distance for similarity rather than cosine for retrieval using llamaindex?

2 comments

TTeemu

Which embeddings model are you using? OpenAI for example recommends using cosine similarity: https://help.openai.com/en/articles/8984345-which-distance-function-should-i-use

But there are ways to use euclidean if you want to, for example with Pinecone:

Plain Text

if "quickstart-index" not in indexes:
    # dimensions are for text-embedding-ada-002
    pinecone.create_index(
        "quickstart-index", dimension=1536, metric="euclidean", pod_type="p1"
    )

https://docs.llamaindex.ai/en/stable/examples/vector_stores/existing_data/pinecone_existing_data/?h=pinecone

WWhiteFang_Jr

In addition to what @Teemu said, if you are using embedding model locally, You can sub class the embedding class and change the mode of finding similarity:
https://github.com/run-llama/llama_index/blob/5c59394c9d1d4c9a7b5dc960ef85e3327b659243/llama-index-core/llama_index/core/base/embeddings/base.py#L31

https://github.com/run-llama/llama_index/blob/5c59394c9d1d4c9a7b5dc960ef85e3327b659243/llama-index-core/llama_index/core/base/embeddings/base.py#L412

Add a reply

Find answers from the community

Is there someway by which I can use