Hi, when calling a simple code to predict, if there is unsufficient quota on the OpenAI, the code tries to hit the point multiple times what causes 429 error too many requests. How can I prevent these hittings and just get one exception about non-suffitient funds? Thanks
llm = OpenAI(temperature=0, model=model_name, api_key=ai_key,
callback_manager=callback_manager)
response = llm.predict(Prompt(prompt))