Hello. I don't manage to obtain parallel call while summarizing documents:
import nest_asyncio
nest_asyncio.apply()
...
service_context = ServiceContext.from_defaults(llm=llm, chunk_size=1024)
response_synthesizer = get_response_synthesizer(
use_async=True,
response_mode="tree_summarize",
summary_template=PromptTemplate(custom_tmpl)
)
doc_summary_index = DocumentSummaryIndex.from_documents(
tables,
service_context=service_context,
response_synthesizer=response_synthesizer,
show_progress=True,
use_async=True,
)
Logs show that each element in
tables
is summarized one by one, waiting for the previous to be completed, no parallelism at all. Am I missing something? I didn't find a doc specifying how to parameterize the level of parallelism (how many call to an API in parallel for example?)