anyone know how to flush the buffer

At a glance

anyone know how to flush the buffer after an upload? after upload and document ingestion is there a way to dump the buffer in python? If I upload and ingest a document then send a query it goes thru the document ingestion again every time unless i reload.

38 comments

LLogan M

Like, using the new ingestion pipeline?

LLogan M

Or which buffer?

MM00nshine

python buffer i think

MM00nshine

Attachment

MM00nshine

that buffer lmao

MM00nshine

its running live here if you want to see

MM00nshine

212.178.231.251:8501

LLogan M

oh hmm, maybe it's just del uploadedfile ?

LLogan M

assuming you don't need it after running that function

MM00nshine

true

MM00nshine

as you see i have it running a rm -r 'dir/* but it is holding the document in the buffer cache

MM00nshine

my send message process

Attachment

MM00nshine

my document ingestion process

Attachment

MM00nshine

do you have any examples of saving the chat to the chromadb?

MM00nshine

i have documents working but it doesnt want to save the chat lol

LLogan M

chromadb is only for vectors, can't really save chats in there 🤔

LLogan M

need something like redis and just pickle the chat history into there

MM00nshine

does redis convert it to a vector?

LLogan M

Redis has support for vectors too (we have RedisVectorSore), but it's also just a general purpose key-val store (like a big dictionary lol)

MM00nshine

is there a way to convert the chat to a vector?

MM00nshine

im trying to use chroma db as long term memory

LLogan M

Oh thats easy enough

Plain Text

from llama_index.embeddings import OpenAIEmbedding

embed_model = OpenAIEmbedding()

vectors = embed_model.get_text_embedding_batch([str(message) for message in chat_history])

LLogan M

or you can create nodes with the chat message strings

LLogan M

and throw those into a VectorStoreIndex

LLogan M

probably more what you wanted lol

LLogan M

nodes = [TextNode(text=str(message)) for message in chat_history]

MM00nshine

can that be used with hugging face instruct embeddings?

MM00nshine

this is my defined function:

MM00nshine

def ingest_conversation(msg, dataframe=False):
    docs = msg
    model_name = "BAAI/bge-small-en"
    model_kwargs = {"device": "cpu"}
    encode_kwargs = {"normalize_embeddings": True}
    embedding = HuggingFaceBgeEmbeddings(model_name=model_name, model_kwargs=model_kwargs, encode_kwargs=encode_kwargs)
    adk = Chroma.from_texts(docs, embedding, collection_name="Conversation", persist_directory="./memory")
    if dataframe:
        return pd.DataFrame(adk)
    return adk

MM00nshine

Attachment

MM00nshine

oh and the buffer issue was easily fixed with:

MM00nshine

Attachment

MM00nshine

I came up with a "hacky" way to achieve the chat to chromadb embedding lol i had python write it to a .txt then sent that .txt to the document ingestion and wa la... lol

MM00nshine

now to build the relevance comparison search... any pointers?

LLogan M

Can you explain that a bit more?

MM00nshine

the relevance search against the chromadb? i don't know if I can lol... im still reading and learning what it is lol

MM00nshine

if it retrieves simmular data and includes it with the prompt before sending to the llm. is that what everyone calls a "RAG" system?

MM00nshine

ok i built an embedding server and I was wondering if you might know. im sending to a get endpoint the text file from my loader. with that i need to pass a couple args. any idea where to put the args? or is there any nifty llama_index embeddings i can load that handle that?

Add a reply

Find answers from the community

anyone know how to flush the buffer