max_retries
parameter is passed to openai's OpenAI
client ("native" SDK) as well as it is used by llm_retry_decorator
which wraps methods such as chat
. This leads to retrying 3x3 times when we think we only retry 3 times. Is that on purpose or am I missing something?llm_retry_dectorator
vs OpenAI's logic, I don't know. Haven't explored OpenAI's logic for retries