VectorStoreIndex
allows to pass in the show_progress
flag which shows the nice tqdm-like bar when generating the chunks.refresh_index
method. Is there a way to show progress there? Would especially be interesting to see how many chunks/what kinds of chunks are being modified/refreshed.storage_context = StorageContext.from_defaults(persist_dir=storage_dir) index: VectorStoreIndex = load_index_from_storage(storage_context=storage_context) index.refresh_ref_docs(documents)
similiarty_top_k
to the value of 100 but only get back 20-22 nodes. Why is that? I'm using the AgentRunner
and a basic knowledge tooldef get_file_documents(config: FileLoaderConfig): parser = llama_parse_parser() files_info = fetch_file_list() logger.info( f"List of files ready for download. Number of files to download: {len(files_info)}" ) if config.use_llama_parse: file_paths = [] for file_info in files_info: resource_url = file_info["resourceURL"] file_name = file_info["fileName"] file_path = os.path.join(config.data_dir, file_name) if not os.path.exists(file_path): download_file(resource_url, file_path) logger.info( f"Successfully downloaded file: {file_name} and saved it on the server." ) file_paths.append(file_path) documents = [] for file_number, file_path in enumerate(file_paths, 1): file_name = os.path.basename(file_path) json_representation = parser.get_json_result(file_path) document = parser.load_data( file_path=file_path, extra_info={ "file_name": file_name, "file_number": file_number, "pages": json_representation[0]["pages"] } ) documents.append( document )
AgentRunner.from_llm
has a bug. In the is_function_calling_model
you are not testing for finetuned models. Which leads to the ReActAgent
being automatically used, even if your finetuned model is an OpenAI
one. @Logan Mchat_engine
function. I just refactored the code to use a query engine instead of an Agent-based approach for the finetuning_events. That should work, but still weird that it fails because of such a reason. I think there needs to be some fixing to be done to the ChatCompletionMessageToolCall
class?VectorStoreIndex.from_documents
gives me an API for this, but unfortunately i don't see it.