Sorry, I'm abit confused about usage of
IngestionPipeline
along with self-created Documents:
embed_model = HuggingFaceEmbedding(model_name="WhereIsAI/UAE-Large-V1")
documents = [Document(text="...", metadata={...}), Document(text=",,,", metadata={...})]
pipeline=[
TokenTextSplitter(
chunk_size=512,
chunk_overlap=20,
separator=" "
),
embed_model
]
pipeline.run(documents=documents)
As of my understanding, the
pipeline.run
must use the
embed_model
to create embeddings for the documents and autopopulate them?
With the code above, I can see that my embeddings are empty in my qdrant local.
This is from qdrantUI:
{"id_": "04fd0f75-0641-4ff9-96df-da768e99922c", "embedding": null,