Find answers from the community

Updated 3 weeks ago

Llm

Which llm is fastest has least response time when deployed as a chatbot
@Logan M @WhiteFang_Jr
W
s
P
3 comments
You could try using llms from grok or cerebras platform. They claim to provide fastest text generation.
depends of the model param and the infra, expect slower token/second on high parameter model. Grok infra claim to be one of the fastest in the field
I don’t think it is opensource. Also can you suggest how can I add human feedback loop to train the model to give better response
@WhiteFang_Jr @saika @Logan M
Add a reply
Sign up and join the conversation on Discord