i am really confused about how to get

At a glance

The community member is confused about how to get more control over retrieval from a Milvus 2.4 vector store. They tried using the hybrid search with RRFRanker but encountered an error connecting to the Milvus server. They are also confused because most of the documentation is about the query engine, which works differently, and uses nodes instead of vector stores.

The community member has found that they can use node postprocessors in the chat engine, which they want to test. They would prefer to have more control over the retrieval function, but they are unsure if the node postprocessors are enough or if they really need that retrieval control. They are confused because there are many different ways to do things, and they wouldn't be surprised if the postprocessors are the same as the retrieval methods.

In the comments, another community member is also confused about what the original poster needs and asks what they mean by "control". They mention that hybrid retrieval was only recently added to Milvus, so the original poster should double-check their Milvus version. They also explain that node postprocessors are applied after retrieval and that the original poster can have as many as they want, and even create a custom one.

The original poster responds that they have read a lot about retrieval but never got

Useful resources

BBanaanBakje

i am really confused about how to get more control with retrieval from a vector store (milvus 2.4).
i have tried the hybrid search with RRFRanker (https://docs.llamaindex.ai/en/stable/examples/vector_stores/MilvusHybridIndexDemo/), but i get error:

Plain Text

pymilvus.exceptions.MilvusException: <MilvusException: (code=2, message=Fail connecting to server on localhost:19530, illegal connection params or server unavailable)>

also since i am using index.as_chat_engine i am very confused since 99% of the docs is about query engine which works slightly different and of those 90% use nodes instead of vector stores, so im kinda stuck.

i have found that i can use node postprocessors in chat engine, which i still have to test:

Plain Text

chat_engine = index.as_chat_engine(
    node_postprocessors=[
        SimilarityPostprocessor(similarity_cutoff=0.7)
    ],
    chat_mode="context",
    memory=memory,
    system_prompt=(
        "You are an expert Q&A system that is trusted around the world. Answer questions based on the given context."
    ),
)

what i would like to do is either get the hybrid method working on milvus, but i would actually prefer to have more control about the retrieval function, im not sure if node postprocessor is enough for me yet and if i really need that retrieval control, but i would like to play around with it since it seems to be available.

i could be wrong, because im kinda confused since there are so many different ways to do things, so i wouldnt be surprised if postprocessors would be the same

TLDR: i need a little guidance on retrieval for milvus vector store index.as_chat_engine

2 comments

LLogan M

I guess I'm also confused at what you need 😅 what do you mean by control?

Hybrid retrieval was only recently added to milvus, so double check your milvus version

pip install -U llama-index-vector-stores-milvus pymilvus

Node postprocessors are applied after retrieval, and you can have as many as you want, and even sub-class and create a custom one

BBanaanBakje

i read a lot about retrieval but never got it working other than the "standard" way: chat_engine.chat(question), i was curious if changing some settings would improve my RAG implementation

since node processing happens after retrieval, im not sure if it has the same effect as the retrieval methods before.
if it is ill try that out more first

edit: i was confused because there are so many different ways of retrieval, but none worked for my chat engines + vector store combination vs the query engine + nodes, the docs dont seem to favor me in that regard 😄

im pretty sure milvus 2.4 is installed and working, double checked it

Attachment

Add a reply

Find answers from the community

i am really confused about how to get