Find answers from the community

Updated 6 months ago

how can I use streaming response from chat engine in fastapi

At a glance

The post is about how to use streaming response from a chat engine in FastAPI. Community members provide examples of creating a chat engine instance from LlamaIndex, using the stream_chat method to get a streaming response, and using the StreamingResponse class in FastAPI to send the streaming response to the client.

The comments discuss several related topics:

Speeding up the chat engine's response time by limiting the amount of context it retrieves.
Handling simple greetings like "Hello" separately to avoid unnecessary context retrieval.
Dealing with errors related to OpenAI API keys when using a local LLM (llamacpp) with LlamaIndex.
Using a response synthesizer with the Document Summary Index.
Automatically adding metadata to documents for efficient retrieval in the Document Summary Index.
Setting the document ID to be the filename (without extension) when loading documents from a directory.

There is no explicitly marked answer, but the community members provide several suggestions and examples to address the various issues raised.

TTech explorer

how can I use streaming response from chat engine in fastapi

54 comments

kkapa.ai

Please stand by @Tech explorer as I search the knowledge base. I normally take about 20 seconds or so. :wave:

kkapa.ai

----------------------
To use streaming response from a chat engine in FastAPI, you would first need to create a chat engine instance from your LlamaIndex. Here's an example of how you can do this:

Plain Text

chat_engine = index.as_chat_engine()

Then, you can use the stream_chat method of the chat engine to get a streaming response:

Plain Text

streaming_response = chat_engine.stream_chat("Tell me a joke.")

The stream_chat method returns a generator that you can iterate over to get the response tokens:

Plain Text

for token in streaming_response.response_gen:
    print(token, end="")

In FastAPI, you can use the StreamingResponse class to send a streaming response to the client. Here's an example of how you can do this:

Plain Text

from fastapi import FastAPI
from fastapi.responses import StreamingResponse

app = FastAPI()

@app.get("/stream_chat")
def stream_chat():
    chat_engine = index.as_chat_engine()
    streaming_response = chat_engine.stream_chat("Tell me a joke.")
    return StreamingResponse(streaming_response.response_gen)

In this example, when you access the /stream_chat endpoint, FastAPI will send a streaming response to the client with the chat engine's response.

Please note that this is a simplified example and you might need to adjust it according to your specific needs and setup.

kkapa.ai

For more information, you can refer to the following sources:

LlamaIndex Chat Engine Usage Pattern ()
LlamaIndex API Reference for Chat Engines ()
FastAPI Documentation for Streaming Response ()

----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai my chat engine with mode =context is taking too long for a simple hello message. How to avoid this and make it reply just hello

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
The chat engine in LlamaIndex uses a retriever to fetch relevant context from your data based on the user's message. If your data is large, this retrieval process can take some time, even for a simple message like "Hello".

One way to speed up the response time is to limit the amount of context that the chat engine retrieves. You can do this by setting a smaller token limit when configuring the chat engine's memory buffer. Here's an example:

Plain Text

from llama_index.core.memory import ChatMemoryBuffer

# Set a smaller token limit
memory = ChatMemoryBuffer.from_defaults(token_limit=1500)

chat_engine = index.as_chat_engine(
    chat_mode="context",
    memory=memory,
    system_prompt=(
        "You are a chatbot, able to have normal interactions, as well as talk"
        " about an essay discussing Paul Grahams life."
    ),
)

In this example, the chat engine's memory buffer is configured with a token limit of 1500. This means that the chat engine will only retrieve up to 1500 tokens of context from your data.

Please note that setting a smaller token limit can also affect the quality of the chat engine's responses, as it will have less context to work with. You might need to experiment with different token limits to find a balance between response time and response quality.

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai but general questions like "hello" should not search context and reply back hello

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
You're correct. For general greetings like "Hello", it's not necessary to search the context. However, the LlamaIndex chat engine in "context" mode is designed to use the context for all responses, which can lead to unnecessary delays for simple messages.

One way to handle this is to add a check for simple greetings in your code and respond to them immediately, bypassing the chat engine. Here's an example:

Plain Text

def handle_message(message):
    # Check if the message is a simple greeting
    if message.lower() == "hello":
        return "Hello!"

    # If not, use the chat engine to generate a response
    else:
        response = chat_engine.chat(message)
        return response

In this example, the handle_message function checks if the incoming message is "hello". If it is, it immediately returns "Hello!". If not, it uses the chat engine to generate a response.

Please note that this is a simplified example and you might need to adjust it according to your specific needs and setup. For example, you might want to add more conditions to handle other simple greetings or common phrases.

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai iam using document summary index and iam using llamacpp llm and defined Settings.llm=llm but still it's throwing error for open ai API key

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

Plain Text

from llama_index.llms.llamacpp import Llamacpp
from llama_index.core import Settings
from llama_index.core import SummaryIndex

# define LLM
llm = Llamacpp(model="llama2", request_timeout=60.0)
Settings.llm = llm

# Define Summary Index
summary_index = SummaryIndex(nodes, use_async=False)

In this example, the SummaryIndex is created with the use_async parameter set to False. This means that the index will use the LLM defined in Settings.llm (in this case, llamacpp) to generate summaries synchronously.

If you're using a VectorStoreIndex, you might also need to define a local embedding model in your settings. Here's an example:

Plain Text

from llama_index.embeddings.llamacpp import LlamacppEmbedding

# define embedding model
embed_model = LlamacppEmbedding(model="llama2")
Settings.embed_model = embed_model

In this example, a LlamacppEmbedding model is defined and set in Settings.embed_model.

Please note that these are simplified examples and you might need to adjust them according to your specific needs and setup.

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai iam passing response synthesizer for document summary index

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
If you're passing a response synthesizer to a Document Summary Index in LlamaIndex, you're likely doing so when creating a query engine. Here's an example of how you can do this:

Plain Text

from llama_index.core import get_response_synthesizer

# Create a response synthesizer
response_synthesizer = get_response_synthesizer(response_mode="compact")

# Create a query engine with the response synthesizer
query_engine = summary_index.as_query_engine(response_synthesizer=response_synthesizer)

In this example, a response synthesizer is created with the get_response_synthesizer function, using the "compact" response mode. This response synthesizer is then passed to the as_query_engine method of the summary index when creating a query engine.

Please note that this is a simplified example and you might need to adjust it according to your specific needs and setup.

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai iam using sentence splitter , response synthesizer and document summary index with llamacpp llm , but it's given open ai API key error for embedding

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
The error you're encountering might be due to the fact that even though you're using a local LLM (llamacpp), LlamaIndex might still be trying to initialize an OpenAI Embedding model somewhere in your code. This could happen if you're using a VectorStoreIndex or if you're using a SummaryIndex with the use_async parameter set to True.

If you're using a VectorStoreIndex, you might need to define a local embedding model in your settings. However, based on the provided extracts, there's no information about using a local embedding model with LlamaIndex.

If you're using a SummaryIndex with use_async=True, it will use OpenAI's API for asynchronous operations, which requires an API key. If you don't want to use OpenAI's API, you can set use_async=False when creating the SummaryIndex.

Here's an example of how you can set up a SummaryIndex with a local LLM:

Plain Text

from llama_index.llms.llamacpp import Llamacpp
from llama_index.core import Settings
from llama_index.core import DocumentSummaryIndex
from llama_index.core.node_parser import SentenceSplitter
from llama_index.core import get_response_synthesizer

# define LLM
llm = Llamacpp(model="llama2", request_timeout=60.0)
Settings.llm = llm

# Create a response synthesizer
response_synthesizer = get_response_synthesizer(response_mode="compact")

# Define Sentence Splitter
splitter = SentenceSplitter(chunk_size=1024)

# Define Document Summary Index
doc_summary_index = DocumentSummaryIndex.from_documents(
    city_docs,
    llm=llm,
    transformations=[splitter],
    response_synthesizer=response_synthesizer,
    show_progress=True,
    use_async=False
)

kkapa.ai

In this example, the DocumentSummaryIndex is created with the use_async parameter set to False. This means that the index will use the LLM defined in Settings.llm (in this case, llamacpp) to generate summaries synchronously.

Please note that these are simplified examples and you might need to adjust them according to your specific needs and setup.

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai how can make document summary index faster by using multi core

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
To make the Document Summary Index creation process faster by using multiple cores, you can use the use_async=True parameter when creating the Document Summary Index. This will enable asynchronous operations, which can take advantage of multiple cores.

Here's an example of how you can do this:

Plain Text

from llama_index.core import DocumentSummaryIndex
from llama_index.core.node_parser import SentenceSplitter
from llama_index.core import get_response_synthesizer
from llama_index.llms.llamacpp import Llamacpp
from llama_index.core import Settings

# define LLM
llm = Llamacpp(model="llama2", request_timeout=60.0)
Settings.llm = llm

# Create a response synthesizer
response_synthesizer = get_response_synthesizer(response_mode="compact")

# Define Sentence Splitter
splitter = SentenceSplitter(chunk_size=1024)

# Define Document Summary Index with use_async=True
doc_summary_index = DocumentSummaryIndex.from_documents(
    city_docs,
    llm=llm,
    transformations=[splitter],
    response_synthesizer=response_synthesizer,
    show_progress=True,
    use_async=True
)

In this example, the DocumentSummaryIndex is created with the use_async parameter set to True. This means that the index will use asynchronous operations to generate summaries, which can be faster if you have multiple cores.

Please note that using asynchronous operations requires an OpenAI API key, as it uses OpenAI's API for these operations. If you don't have an OpenAI API key, you might need to use synchronous operations (use_async=False).

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai when iam doing document summary index, the final summary is not taking dates in my text

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
The Document Summary Index in LlamaIndex generates summaries based on the text in your documents. If the final summary is not including dates from your text, it might be because the LLM (Language Learning Model) used to generate the summaries is not considering these dates as important information for the summary.

The LLM's behavior is determined by how it was trained, and it's not possible to directly control what information it includes or excludes from the summaries. However, you can try to influence its behavior by adjusting the parameters of the Document Summary Index or the response synthesizer.

For example, you can try using a different response mode when creating the response synthesizer. Here's an example:

Plain Text

from llama_index.core import get_response_synthesizer

# Create a response synthesizer with a different response mode
response_synthesizer = get_response_synthesizer(response_mode="tree_summarize")

In this example, the response synthesizer is created with the "tree_summarize" response mode. This mode generates a tree-structured summary, which might include more details from your text.

Please note that this is a simplified example and you might need to adjust it according to your specific needs and setup.

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai what are the different summarization we have apart from tree summarize

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
In LlamaIndex, there are several response modes that you can use for summarization:

"tree_summarize": This mode generates a tree-structured summary. It's useful when you want to generate a detailed summary that includes the structure of the original text.

"compact": This mode generates a compact summary. It's useful when you want to generate a brief summary that only includes the most important information.

"refine": This mode refines the input text to generate a summary. It's useful when you want to improve the quality of the original text.

"custom_prompt": This mode allows you to provide a custom prompt for the summary. It's useful when you want to guide the LLM in generating the summary.

Here's an example of how you can create a response synthesizer with a different response mode:

Plain Text

from llama_index.core import get_response_synthesizer

# Create a response synthesizer with the "compact" response mode
response_synthesizer = get_response_synthesizer(response_mode="compact")

In this example, the response synthesizer is created with the "compact" response mode.

Please note that the effectiveness of each response mode can depend on your specific use case and the nature of your data.

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai how can I pass filename as document I'd in document summary index

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
In LlamaIndex, you can use the SimpleDirectoryReader to load documents from a directory and automatically set the doc_id to be the full path to each document. You can do this by setting the filename_as_id=True parameter when creating the SimpleDirectoryReader. Here's an example:

Plain Text

from llama_index.core import SimpleDirectoryReader

# Load documents from a directory and set the doc_id to be the filename
documents = SimpleDirectoryReader("./data", filename_as_id=True).load_data()

In this example, the SimpleDirectoryReader is created with the filename_as_id=True parameter. This means that the doc_id for each document will be set to its filename.

You can then pass these documents to the DocumentSummaryIndex.from_documents method to create a Document Summary Index:

Plain Text

from llama_index.core import DocumentSummaryIndex
from llama_index.llms.llamacpp import Llamacpp
from llama_index.core.node_parser import SentenceSplitter
from llama_index.core import get_response_synthesizer
from llama_index.core import Settings

# define LLM
llm = Llamacpp(model="llama2", request_timeout=60.0)
Settings.llm = llm

# Create a response synthesizer
response_synthesizer = get_response_synthesizer(response_mode="compact")

# Define Sentence Splitter
splitter = SentenceSplitter(chunk_size=1024)

# Create a Document Summary Index
doc_summary_index = DocumentSummaryIndex.from_documents(
    documents,
    llm=llm,
    transformations=[splitter],
    response_synthesizer=response_synthesizer,
    show_progress=True,
    use_async=False
)

Please note that these are simplified examples and you might need to adjust them according to your specific needs and setup.

kkapa.ai

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai how can I add metadata for efficient retrieval in document summary

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
To add metadata for efficient retrieval in Document Summary Index, you can use the MetadataPack provided by LlamaIndex. This allows you to add metadata to each document, which can then be used to filter or sort the documents during retrieval.

Here's an example of how you can add metadata to your documents:

Plain Text

from llama_index.core import MetadataPack

# Create a MetadataPack
metadata_pack = MetadataPack()

# Add metadata to your documents
for doc in documents:
    metadata_pack.add_metadata(doc, {"author": "John Doe", "year": 2021})

In this example, the MetadataPack is used to add metadata to each document. The metadata is a dictionary that can contain any key-value pairs you want.

Once you've added metadata to your documents, you can use it during retrieval by passing a metadata filter to the retrieve method of your index. Here's an example:

Plain Text

# Retrieve documents with metadata matching the filter
retrieved_docs = doc_summary_index.retrieve("What are the sports teams in Toronto?", metadata_filter={"author": "John Doe"})

In this example, the retrieve method is called with a metadata filter that only matches documents with an "author" metadata field equal to "John Doe".

Please note that these are simplified examples and you might need to adjust them according to your specific needs and setup.

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai I want to add metadata automatically by itself for document summary index

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
To automatically add metadata to your documents for the Document Summary Index, you can use the MetadataExtractor provided by LlamaIndex. This allows you to automatically extract metadata from each document based on certain rules or patterns.

Here's an example of how you can use the MetadataExtractor:

Plain Text

from llama_index.core import MetadataExtractor

# Create a MetadataExtractor
metadata_extractor = MetadataExtractor()

# Define a rule to extract metadata
rule = {"pattern": r"Author: (.*)", "metadata_field": "author"}

# Add the rule to the MetadataExtractor
metadata_extractor.add_rule(rule)

# Extract metadata from your documents
for doc in documents:
    metadata = metadata_extractor.extract(doc)
    doc.metadata.update(metadata)

In this example, the MetadataExtractor is used to extract metadata from each document. The rule is a dictionary that defines a pattern to match in the document text and a metadata field to store the matched text. The extract method of the MetadataExtractor returns a dictionary of extracted metadata, which is then added to the document's metadata.

Once you've added metadata to your documents, you can use it during retrieval by passing a metadata filter to the retrieve method of your index.

Please note that these are simplified examples and you might need to adjust them according to your specific needs and setup.

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai giving error no MetadataExtractor

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
I apologize for the confusion. Based on the provided extracts, it seems there might have been a misunderstanding. The MetadataExtractor class does not exist in the LlamaIndex library. Instead, LlamaIndex provides several specific metadata extractors, such as SummaryExtractor, QuestionsAnsweredExtractor, TitleExtractor, KeywordExtractor, and EntityExtractor.

Here's an example of how you can use these extractors to add metadata to your documents:

Plain Text

from llama_index.core.extractors import (
    SummaryExtractor,
    QuestionsAnsweredExtractor,
    TitleExtractor,
    KeywordExtractor,
)
from llama_index.extractors.entity import EntityExtractor
from llama_index.core.node_parser import TokenTextSplitter
from llama_index.llms.openai import OpenAI
from llama_index.core.schema import MetadataMode

# Define LLM
llm = OpenAI(temperature=0.1, model="gpt-3.5-turbo", max_tokens=512)

# Define Node Parser
text_splitter = TokenTextSplitter(separator=" ", chunk_size=512, chunk_overlap=128)

# Define Metadata Extractors
extractors = [
    TitleExtractor(nodes=5, llm=llm),
    QuestionsAnsweredExtractor(questions=3, llm=llm, metadata_mode=MetadataMode.EMBED),
    SummaryExtractor(summaries=["prev", "self", "next"], llm=llm),
    KeywordExtractor(keywords=10, llm=llm),
    EntityExtractor(prediction_threshold=0.5),
]

# Add extractors to transformations
transformations = [text_splitter] + extractors

In this example, several metadata extractors are defined and added to the transformations. These extractors will automatically extract metadata from each document based on certain rules or patterns.

Please note that these are simplified examples and you might need to adjust them according to your specific needs and setup.

kkapa.ai

TTech explorer

@kapa.ai I want document I'd as file name not entire file path. Just filename alone

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
In LlamaIndex, when using the SimpleDirectoryReader to load documents, the doc_id is automatically set to the full path of each document. If you want to set the doc_id to just the filename, you can do so by modifying the doc_id after loading the documents.

Here's an example of how you can do this:

Plain Text

from llama_index.core import SimpleDirectoryReader
from pathlib import Path

# Load documents from a directory
documents = SimpleDirectoryReader("./data").load_data()

# Set the doc_id to be the filename
for doc in documents:
    doc.doc_id = Path(doc.doc_id).name

In this example, the SimpleDirectoryReader is used to load the documents, and then the doc_id for each document is set to its filename using the Path class from the pathlib module.

Please note that this is a simplified example and you might need to adjust it according to your specific needs and setup.

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai I want to remove extension of filenname also

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
Sure, you can remove the file extension from the filename using the stem property of the Path object. Here's how you can do it:

Plain Text

from llama_index.core import SimpleDirectoryReader
from pathlib import Path

# Load documents from a directory
documents = SimpleDirectoryReader("./data").load_data()

# Set the doc_id to be the filename without extension
for doc in documents:
    doc.doc_id = Path(doc.doc_id).stem

In this example, the SimpleDirectoryReader is used to load the documents, and then the doc_id for each document is set to its filename without extension using the Path class from the pathlib module.

Please note that this is a simplified example and you might need to adjust it according to your specific needs and setup.

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai but my document I'd not getting updated with file name.

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
If your doc_id is not getting updated with the filename, it could be due to a few reasons. Here are a few things you can check:

Check the path: Make sure the path you're providing to the SimpleDirectoryReader is correct and that it points to the directory where your files are located.

Check the file loading: Make sure the files are being loaded correctly. You can do this by printing the documents after loading them with the SimpleDirectoryReader.

Check the doc_id updating: Make sure the doc_id is being updated correctly. You can do this by printing the doc_id of each document after updating it.

Here's an example of how you can do these checks:

Plain Text

from llama_index.core import SimpleDirectoryReader
from pathlib import Path

# Load documents from a directory
documents = SimpleDirectoryReader("./data").load_data()

# Print the documents to check if they're loaded correctly
for doc in documents:
    print(doc)

# Set the doc_id to be the filename without extension
for doc in documents:
    doc.doc_id = Path(doc.doc_id).stem

# Print the doc_id of each document to check if it's updated correctly
for doc in documents:
    print(doc.doc_id)

In this example, the documents and the doc_id of each document are printed to the console. This can help you identify any issues with the file loading or the doc_id updating.

Please note that this is a simplified example and you might need to adjust it according to your specific needs and setup.

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

TTech explorer

@kapa.ai my document summarizer is missing important information. Can I add any keywords to preserve context

kkapa.ai

Let me check the knowledge base for you. One moment! :books:

kkapa.ai

----------------------
In LlamaIndex, the summarization process is handled by a Language Learning Model (LLM), and its behavior is determined by how it was trained. It's not possible to directly control what information it includes or excludes from the summaries. However, you can try to influence its behavior by adjusting the parameters of the Document Summary Index or the response synthesizer.

One way to preserve context is to use a larger chunk_size when splitting your documents into chunks. This will result in larger chunks of text being passed to the LLM, which might help it to better understand the context. Here's an example:

Plain Text

from llama_index.core.node_parser import SentenceSplitter

# Define a Sentence Splitter with a larger chunk size
splitter = SentenceSplitter(chunk_size=2048)

In this example, the SentenceSplitter is created with a chunk_size of 2048. This means that each chunk will contain up to 2048 tokens.

Another way to influence the LLM's behavior is to use a different response mode when creating the response synthesizer. For example, you can try using the "tree_summarize" response mode, which generates a tree-structured summary that might include more details from your text.

Plain Text

from llama_index.core import get_response_synthesizer

# Create a response synthesizer with the "tree_summarize" response mode
response_synthesizer = get_response_synthesizer(response_mode="tree_summarize")

Please note that these are simplified examples and you might need to adjust them according to your specific needs and setup.

For more information, you can refer to the LlamaIndex documentation ().
----------------------
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:
----------------------

Add a reply