Find answers from the community

Updated 9 months ago

How can we execute batch inference

How can we execute batch inference optimally using query_pipeline?
While the documentation provides insights on asynchronous, parallel, and multi-root operations, it lacks details on batch inference methods.
Would employing asynchronous execution be the most effective approach?"

3 comments

ZZimmy

Here is the kapa's answer
https://discord.com/channels/1059199217496772688/1220340678362660904/1220340682544381983

ddcoder

you can use multiprocessing pool

LLogan M

Yea, batch is not technically implemented yet

Add a reply