I'm having an issue getting the ingestion pipeline getting to work with Weaviate + Redis Cache (so that I can ingest new documents later).
I had to add index=VectorStoreIndex(nodes, storage_context = storage_context) to get it to load into weaviate but now it seems that the Cache is not taken into account. Without the index line it seemed like it was processing data but never loading.
When using Chroma I was able to get the ingestion pipeline + cache to work (everything the same minus the index = line.
pipeline = IngestionPipeline(
transformations=[
SimpleNodeParser(),
# SentenceSplitter(chunk_size=512, chunk_overlap=20),
# TitleExtractor(nodes=5),
# SummaryExtractor(summaries=["prev", "self"]),
# KeywordExtractor(keywords=10),
# OpenAIEmbedding(),
],
vector_store=vector_store,
cache=ingest_cache,
)
nodes = pipeline.run(documents=documents, storage_context=storage_context)
index = VectorStoreIndex(nodes, storage_context = storage_context)