Find answers from the community

n
ngle
Offline, last seen 6 months ago
Joined September 25, 2024
Hi all, I built a vector from csv (about 20 rows) with each row as a Document, when I query such as "How many rows..." it only responds 2 rows, it depends on the similarity_top_k variable , I try with 3, and it returns 3. Can I get the exact row number of the csv file?

documents = []
with open(filename) as file_obj:
reader_obj = csv.reader(file_obj)
heading = next(file_obj)
header = list(heading.strip().split(','))

for row in reader_obj:
record = {}
for i, value in enumerate(row):
record[header[i]] = ' '.join(value.split())

doc_id = row[0]
content_from_csv = json.dumps(record)
documents.append(Document(text=content_from_csv, doc_id=doc_id))
return documents
3 comments
n
E
W