kevingoed

Hey everyone. Is there a way to call the

Hey everyone. Is there a way to call the LLM without an index, I want it to be like a basic call to OpenAI without the knowledge in the index. OR would you recommend doing that with the OpenAI API/SDK?

1 comment

kkevingoed

Vectors

I'm using chat engine to query a vector store. If no vectors exist in the vector store I dont get an error, which is a problem. It will simply send the query to the LLM with an empty context. How do I avoid this?

18 comments

kkevingoed

Docstore

What exactly is the doc store for btw?

1 comment

kkevingoed

Hey everyone 🙂 I was wondering what

Hey everyone 🙂 I was wondering what features of Llama Index would be best to use if I would want to build a chat bot that can ultimately generate a JSON after multiple back and forth questions (without and with knowledge from vector store). Would Agents make sense for that, or would you just chain a few prompts together. Is Llama Index even necessary in that scenario?

1 comment

kkevingoed

Hi there, we're now using PGVectorStore

Hi there, we're now using PGVectorStore and we're seeing that there's a lot of open connections to the postgres db. Do we need to close the connections somehow, has anyone every run into this issue?

8 comments

kkevingoed

Namespace

What's the recommended way to separate different "namespaces" with PGVectorStore. In PineCone we used different indexes for different documents to limit the replies to a particular document/scope. I was wondering what the equivalent would in PGVectoreStore. Do you recommend setting up a new table for each "context"/"namespace"?

3 comments

kkevingoed

Keywords

Quick question, I just started playing around with KeywordExtractor and specified keywords=5 as a param. But yet in my postgres DB i find up to 50 keywords per document chunk. Is this normal?

3 comments

kkevingoed

Quick question: What's the difference

Quick question: What's the difference between loading the index with DocumentSummaryIndex.from_documents and using load_index_from_storage

2 comments

kkevingoed

Also has someone else experienced that

Also has someone else experienced that gpt-3.5-turbo-1106 produces significantly worse results as the old gpt-3.5-turbo-16k-0613

7 comments

kkevingoed

Hey there 🙂 got another questions for y

Hey there 🙂 got another questions for y'all:

Plain Text

    PyMuPDFReader = download_loader("PDFReader")
    loader = PyMuPDFReader()

    documents = loader.load_data(file=Path("./test-doc2.pdf"))
    # Create and store Summray index
    storage_context = StorageContext.from_defaults()

    index = DocumentSummaryIndex.from_documents(
        documents,
        service_context=service_context,
        storage_context=storage_context,
        show_progress=True,
    )
    query_engine = index.as_query_engine()
    result = query_engine.query("Write an extensive summary of this context for me?")
    print(result)

How can I make sure that the summary that it writes is at longer than 20 sentences. Or how do I make sure it uses the full 4096 tokens for the response?

4 comments

kkevingoed

Question: What is the best way to

Question: What is the best way to summarize single documents? Would that be with a VectorIndex and then just prompting it "summarize this document for me"?

21 comments

kkevingoed

I noticed that my `service_context` does

I noticed that my service_context does NOT update unless I restart the entire application, has anyone run into this issue before?

Here's my code:

Plain Text

    set_global_service_context(service_context)
    vector_store = PineconeVectorStore(
        pinecone_index=PINECONE_INDEX, namespace=namespace
    )
    storage_context = StorageContext.from_defaults(
        docstore=DOCUMENT_STORE,
        index_store=INDEX_STORE,
        vector_store=vector_store,
    )
    print(service_context)
    return VectorStoreIndex.from_vector_store(
        vector_store=vector_store,
        storage_context=storage_context,
        service_context=service_context,
    )

45 comments

kkevingoed

I believe Llama Index depends on `tree

I believe Llama Index depends on tree-sitter-languages and Tree Sitter doesnt have a build for Mac: https://github.com/grantjenks/py-tree-sitter-languages/issues/20

13 comments

Find answers from the community

Hey everyone. Is there a way to call the

Vectors

Docstore

Hey everyone 🙂 I was wondering what

Hi there, we're now using PGVectorStore

Namespace

Keywords

Quick question: What's the difference

Also has someone else experienced that

Hey there 🙂 got another questions for y

Question: What is the best way to

I noticed that my `service_context` does

I believe Llama Index depends on `tree