How do I add system prompts for my

At a glance

The post asks how to add system prompts for a chatbot. The comments provide the following insights:

Community members suggest defining the system prompt within the service context using the ServiceContext.from_defaults() method. They also discuss adding document IDs to documents and querying based on those IDs, providing an example and a link to relevant documentation.

There is a discussion around the design choice of putting system prompts in the service context, and whether the system message can be changed at each LLM call. Community members indicate that the system prompt can be made dynamic.

The comments also include a discussion around streaming responses, where a community member encounters an error and receives suggestions to add a streaming_query method and check the response type.

The summary indicates that there is no explicitly marked answer in the post and comments.

Useful resources

JJoey

How do I add system prompts for my chatbot

22 comments

WWhiteFang_Jr

You can define system prompt inside the service context itself with llm and embed_model.

Plain Text

from llama_index import ServiceContext

service_context = ServiceContext.from_defaults(system_prompt="Define System prompt here!!!",...)

JJoey

@WhiteFang_Jr Can I add document id while creating a documents and query based on the document id?

WWhiteFang_Jr

Yes you can add doc ID to document. By default each document gets unique ID. You can modify it with

Plain Text

doc = Document(text="hi",doc_id="UNIQUE_ID_HERE")

# change it afterwards also
doc.doc_id="Change_ID"

To query it via docID, i would suggest you add the uniqueId to the metadata and then do metadata extraction with query:
https://docs.llamaindex.ai/en/stable/examples/metadata_extraction/MetadataExtractionSEC.html

JJoey

I am alrready following the same approach:

documents = []
for doc in docs:
if "content_text" in doc:
doc_content = doc["content_text"]
doc_id = doc["document_id"]
print(doc_id)
documents.append(
Document(
text=doc_content,
metadata={"file_name": docid,"date": str(datetime.now())}, id=doc_id,
description=f"Useful to answer question related to file name or id {doc_id}",
)
)

JJoey

But I am not getting correct answers

WWhiteFang_Jr

Give this article a try: https://docs.llamaindex.ai/en/stable/examples/metadata_extraction/MetadataExtractionSEC.html

OOceanLi

a design choice question, why did you guys choose to put system prompts in service context. can the system message be changed at each llm call?

JJoey

Yes @OceanLi

WWhiteFang_Jr

IMO, Almost all the new LLMs have system_prompt which is a different instruction that you add with your text along with the other instructions that you may or may not add along with the text being sent to LLM.
For instance in case of GPT3.5

System prompt
user text ( +-) any other instruction that you want to add [ Done via prompt template in llamaindex ]

So I think putting it service_context makes it easier for user to add/edit and be easily available everywhere

WWhiteFang_Jr

I can be wrong though 😅

Plus to answer your query for each llm call if done via .query() method it will remain the same unless you change itand then call the .query() again

JJoey

Yeah the system prompt will be made dynamic

JJoey

Can you help me with streaming response ?
I am getting an error:

for text in streaming_response.response_gen:
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: 'Response' object has no attribute 'response_gen'

WWhiteFang_Jr

This error is coming as the response that you have is not a streaming response

JJoey

Yeah but I have kept the streaming true

JJoey

response_synthesizer = get_response_synthesizer(
streaming=True,
response_mode="tree_summarize",
)

WWhiteFang_Jr

But the query where actual response is being generated is not supporting streaming

Plain Text

def custom_query(self, query_str: str):
        nodes = self.retriever.retrieve(query_str)

        context_str = "\n\n".join([n.node.get_content() for n in nodes])
        response = self.llm.complete(
            qa_prompt.format(context_str=context_str, query_str=query_str)
        )

JJoey

Any ideas on how I should go about this?

WWhiteFang_Jr

I think you'll need to add the streaming_query method as well in your custom class

WWhiteFang_Jr

You can check on this file to chekc what kind of response you are getting: https://github.com/run-llama/llama_index/blob/b8e565f24d2312d3c6ca8120c1b557e14c77e259/llama_index/response_synthesizers/base.py#L99
I guess this will help you to debug further

JJoey

class RAGStringQueryEngine(CustomQueryEngine):
"""RAG String Query Engine."""

retriever: BaseRetriever
response_synthesizer: BaseSynthesizer
llm: OpenAI
qa_prompt: PromptTemplate

def custom_query(self, query_str: str):
nodes = self.retriever.retrieve(query_str)
context_str = "\n\n".join([n.node.get_content() for n in nodes])
query_str= qa_prompt.format(context_str=context_str, query_str=query_str)
return self.response_synthesizer.synthesize(query_str, nodes)

this tweak worked

WWhiteFang_Jr

Awesome!!

Add a reply

Find answers from the community

How do I add system prompts for my