Yea would have to be some custom implementation to accept multiple inputs. But even then, I think you'd only get use out of it by running the LLM object directly, since nothing else in the framework will know to take advantage of that π€
I think the cost will be the same though no? Unless by batch you mean that new 24-hour turnaround batch thing