Find answers from the community

s
susa
Offline, last seen 3 months ago
Joined September 25, 2024
s
susa
·

Pycario

While trying to use a ConfluenceLoader using
Plain Text
ConfluenceReader = download_loader('ConfluenceReader')

I get an error when the loader dependencies are being pip installed. Specifically the pycairo package is giving me the error :
Plain Text
Command '['pkg-config', '--print-errors', '--exists', 'cairo >= 1.15.10']' died with <Signals.SIGSEGV: 11>.
      [end of output]
  
  note: This error originates from a subprocess, and is likely not a problem with pip.
  ERROR: Failed building wheel for pycairo

Has anyone else faced a similar issue and knows a fix ?
1 comment
L
s
susa
·

Refresh

Does anyone have an idea about why this could be happening ?
  1. I inserted 85 documents into a Weaviate class using the VectorIndex Store
  2. Deleted one of the documents using the doc_id
  3. Used the index.refresh() function with the same list of 85 documents
The expected behaviour was that I would now have 85 documents again in my DB, but this does not seem to be the case -> There are still only 84 documents.
3 comments
L
s
Hi I'm trying to use Llama Index for insertions and updates to a Weaviate database but I'm having an issue with understanding a few things
When creating the Document object, I initialize it with
document_object = Document(text = "my_text", doc_id = "my_doc_id", extra_info = extra_info) . I then use index = GPTVectorStoreIndex.from_documents(all_docs, storage_context=storage_context) to initially insert the documents into the database
However I'm noticing that the doc_id that I send here is being stored in a ref_doc_id property in the Weaviate class, and the doc_id property being stored in the class is something that is auto-generated. This is a problem because I can't keep track of which chunks of the document I have inserted. In addition, this also means that I can't control if there are duplicate inserts. Is there any way to override the doc_id that is generated?
5 comments
d
s
L