Settings.context_window
but accessing the actual contents inside the window, which permanently stores the system prompt and the query-response exchanges up to the token limit.Settings.embed_model = HuggingFaceEmbedding(model_name = "BAAI/bge-small-en-v1.5") Settings.llm = Ollama(model = "mistral", request_timeout = 180.0) if index is None: chat_engine = SimpleChatEngine.from_defaults( memory = chat_memory, system_prompt = template ) else: chat_engine = index.as_chat_engine( chat_mode = "condense_plus_context", memory = chat_memory, similarity_top_k = 3, system_prompt = template, verbose = True )
system_prompt
is a permanent portion of the context window, it doesn't show up in the network traffic, so I'm looking for another method to directly access the context window contents.[START] [sys_prompt] [query_1] [response_1] [query_2] [response_2] ... [END]
[START] [sys_prompt] [query_31] [response_31] [query_32] [response_32] ... [END]