Definitely not just an interface change

uunbittable

Definitely not just an interface change. that facility is completely gone, and I'm trying to dig down into the code in order to find a way to serialize to string.

31 comments

ddisiok

hey @unbittable , just curious the use-case you have for saving to string

uunbittable

I'm basically serializing indices into a database

uunbittable

it's turned out to involve jumping through a lot of hoops

uunbittable

llama does some cool thing but it feels a bit like it's designed for either scripting or heavyweight data management rather than web applications -- designed to recalculate everything each time through

ddisiok

ah gotcha, our plan is to directly support saving to the database (you just specify the connection)

uunbittable

I've got an ORM it's got to play nice with

uunbittable

(using Django)

uunbittable

saving/loading directly with the DB connection would create issues with transactions and other data munging that should be happening in the same operation

ddisiok

gotcha, are you mostly building index over a few documents at a time? And then saving the serialized string previously?

uunbittable

yeah, I'm going to have a large and expanding library of documents and the user will select a few to perform an operation on. I'll hydrate the index for each, compose a graph, and then run some queries / prompts against that.

uunbittable

Llama is awesome because it's making this possible

uunbittable

but it's also really difficult to set some things up (ex: the defaults for service context are a little painful, especially to override for automated testing without hitting the OpenAI APIs)

ddisiok

gotcha, this is valuable feedback

ddisiok

ya totally testing is a pain right now. We are setting up a better way to mock it

uunbittable

It's looking like I might have to drop down a level of abstraction and only store the embeddings and then build the index from that each time. It'll mean some extra processing, but allow me to skip the saving to string on the indices.

uunbittable

A little frustrating when I already wrote working code against 0.5.27, so I have to decide what to do about that.

uunbittable

but hey, a moving target is better than an unmaintained one

ddisiok

yea I totally get that, thanks for the patience haha

uunbittable

thanks for all your work supporting us crotchety, demanding developers!

ddisiok

It's not hard to add serialize to string back, let me take a look

uunbittable

appreciate that!

uunbittable

the way I'm doing it right now is a bit hacky because I can't figure out exactly how the (now-proliferating) classes get assembled into the VectorStoreIndex, so I'm just punching through them to get directly to the KVstore

uunbittable

(and also all that dependency injection gets a little tiresome when there are so many layers)

ddisiok

ya as you said, there's different use-cases that we want to support: 1) data management/persistent data/etc with hundres or thousands of documents, 2) adhoc/app workflows dealing with few documents at a time

ddisiok

so we get stretched a bit in both directions.

uunbittable

totally understandable

uunbittable

Do you anticipate continued API volatility at this rate? I know it's a hazard of pre-1.0 software, but having some idea of what's coming up would help

ddisiok

def not as drastic as previous change haha

ddisiok

but ya trying to balance moving forward and not being too volatile

nnexxyb

Yes, saving to string to store in database is definitely needed.

nnexxyb

Welldone guys.

Add a reply

Find answers from the community

Definitely not just an interface change