Answer metadata

At a glance

Hey guys!

I have a csv with content and some metadata like link on the content and title of content.

I already made a code which allow to have ask question and get responses. But I have one simple question: How can I display not only answers, but and metadata of answers? Like title and link?

17 comments

LLogan M

That metadata is a column in your csv?

Try checking the source nodes from the response object

response.source_nodes gets a list of every node used to create the response

kkorzhov_dm

@Logan M how can I control columns? For example I have 3 columns: content, link and title and want display only 2 of them.

Attachment

kkorzhov_dm

Yeah, and also this output doesn't provide info about my metadata at all:(

LLogan M

How did you load the csv?

kkorzhov_dm

Here is a code: @Logan M

SimpleCSVReader = download_loader("SimpleCSVReader")

loader = SimpleCSVReader()
documents = loader.load_data(file=Path('output_file.csv'))

max_input_size = 4096
max_chunk_overlap = 20
num_output = 512
prompt_helper = PromptHelper(max_input_size, num_output, max_chunk_overlap)

llm_predictor = LLMPredictor(llm=OpenAI(temperature=0, model_name="gpt-4", max_tokens=num_output))
service_context = ServiceContext.from_defaults(llm_predictor=llm_predictor, prompt_helper=prompt_helper, chunk_size_limit=2500)
index = GPTSimpleVectorIndex.from_documents(documents, service_context=service_context)

LLogan M

That csv loader will create a document object for each row in the csv, just by concatenating everything

You might be also interested in the "PagedCSVReader", which will do something similar but include the column names

If there's extra info you want to store, you can use the extra_info field of each document

document.extra_info = {"Some key": "Some val", ...}

kkorzhov_dm

Oh, got it. I will try, thank you:)

kkorzhov_dm

@Logan M What if I load Data from langchain (which consider about metadata) and after will use Document.from_langchain_format(langchain_document) to convert it to gpt index. Will it work?

LLogan M

Yea, that will set the extra_info field for you. Then that info should show up in the source nodes

kkorzhov_dm

@Logan M but some reason I've got this:

ValueError: nodes must be a list of Node objects.

LLogan M

What line of code throws that error? Did you still use the from_documents method to create the index?

kkorzhov_dm

I fixed this:) @Logan M

kkorzhov_dm

Thank you so much:)

kkorzhov_dm

@Logan M I've tried to use PagedCSVReader, but can't undestand how to add extra_info to each documents based on my columns:(

LLogan M

Hmm. That's annoying.

Maybe it's best to just manually load the sheet yourself and create document objects, so it's formatted the way you want?

kkorzhov_dm

I solve it:)

I literally re-write whole langchain class CSV loader))) @Logan M

LLogan M

Hahaha nice!

Add a reply

Find answers from the community

Answer metadata