Hi everyone I m just getting started

At a glance

Hi everyone. I'm just getting started with this, and I've got it up and running, however I'm having an issue with the "create and refine" process. Sometimes I get a response like "Return the original answer. The new context is not relevant..." I've looked through the documentation but was unable to find any way to capture the intermediate responses that are being generated.

15 comments

jjerryjliu0

is this using chat gpt?

jjerryjliu0

or text-davinci-003

jjerryjliu0

if the former, this is a known issue with chatgpt api. sometimes it struggles with following more complex instructions

MMartok

I'm using the chat gpt API. I'm able to see the intermediate prompts and responses by setting the logging level to debug, however it's quite messy, as it includes a lot of extra information. I'd like to add a verbose option so that the output would be something like:

Initial response: ...
Refine context: (snippet, not full text)
Refined response: Return the original answer. The provided context does not relate to ...

jjerryjliu0

@Martok good point. will take a look at adding!

MMartok

I achieved what I was trying to do initially by using a custom StringIO logger to grab the debug output, and then filtering each line. In order to make it work I had to modify /gpt_index/response/utils.py and use .strip() on response_text to remove excess newlines which are sometimes being added in front of the response somewhere along the line.

MMartok

I'm still getting some newlines thrown in there sometimes, like this:

DEBUG:root:> Refined response:

Refined answer: ....

MMartok

Instead of modifying utils.py, adding this after line 125 in /gpt_index/indices/response/builder.py seems to work better.

response = response.strip()

However, I'm unsure of how to handle newlines if they appear in the middle of the generated responses, which does occasionally happen.

jjerryjliu0

I see....to clarify, is this purely for logging purposes? or are you trying to strip newlines from the response text for another purpose?

MMartok

Well, using the debug output was just a relatively easy way to get the intermediate responses, and removing newlines is only needed because my code is reading the log output line by line looking for "Initial response:" and "Refined response:" Ideally, I think any intermediate responses should be stored in the Response object, since the final answer is not always helpful after going through the refining process.

jjerryjliu0

got it, thanks

jjerryjliu0

this is something we'll look into adding very soon!

MMartok

Awesome!

jjerryjliu0

@Martok sorry for the delay. this is a very initial attempt towards solving this: https://github.com/jerryjliu/llama_index/pull/745. just do from gpt_index.logger import LlamaLogger and add it during the index.query call. Currently only works during create/refine response synthesis (which is what you wanted to see)

MMartok

Thanks!

Add a reply

Find answers from the community

Hi everyone I m just getting started