Has anyone tried putting there

At a glance

Has anyone tried putting there application on the cloud? I've got a python application for creating SimpleVectorIndex from user documents and I was thinking I could just spin up a new instance everytime a user uploads content? Not sure on what the best option would be for a poor Uni student 😅😂

26 comments

HHAL 9000

Why would you spin up a new instance every time they upload a document. Wouldn't that be more expensive 😂? or I missing the context. if you could give more context I could try to help.

MMatthew 🦘🫕

I was thinking maybe a AWS lambda function?

https://docs.aws.amazon.com/lambda/latest/dg/lambda-python.html

I currently have a Vuejs + FastApi & Postgres application

I was wondering if would be better to host the Vuejs on CloudFlare (Very cheap) and move my postgres db to Supabase (Cheap) 🙂

So I was wondering if hosting the python somewhere else might be a viable option

Happy to hear any suggestions though 😊

HHAL 9000

I see now. Like you said. I think you could have lambda function creates the simple vector index from uploaded user content and then you could save the index to temporary json file file and then store it in a S3 bucket.

MMatthew 🦘🫕

Yeah exactly 😊

So I would only need to do the following if I'm correct:

Store uploaded file from Vuejs
Create Index
Store output
Sent output to my Supabase db

HHAL 9000

but you would need another lambda function to create the simplevectorindex from json file.

MMatthew 🦘🫕

Alrighty I'll give it a go 🙂 thanks @HAL 9000

Happy to hear any other suggestions people might have

HHAL 9000

because the json file could only be recreated by using LlamaIndex package again:

Plain Text

GPTSimpleVectorIndex.load_from_disk("index_file.json")

HHAL 9000

I actually forgot you could also save it as string

HHAL 9000

Plain Text

index = GPTSimpleVectorIndex(documents=[Document(...), ...])
index_str_as_json_str = index.save_to_string()

GPTSimpleVectorIndex.load_from_string(index_str_as_json_str)

MMatthew 🦘🫕

Perfect that's what I was thinking

HHAL 9000

and I just realized what I wrote now lmao 😂 edited it.

MMatthew 🦘🫕

This is the calculation using AWS lambda alone, assuming I don't get >10K requests It's free 🙂 if I get more that's a good problem to solve hahah

Attachment

MMatthew 🦘🫕

I could also store my embeddings Supabase I think, I'll need to do further research 🙂

https://supabase.com/blog/openai-embeddings-postgres-vector

MMatthew 🦘🫕

@jerryjliu0 is it possible to output the OpenAi output Embeddings? 🙂

jjerryjliu0

@Matthew 🦘🫕 are you looking for embeddings across all document chunks, or only the document chunks that are returned with the response?

jjerryjliu0

response.source_nodes contains source document chunks, each SourceNode should have embeddings

MMatthew 🦘🫕

I think just per document response for now, and I'll just store that and everything else from the response in the Db 🙂

Thanks for you help and for building such a great tool! 👌👌

HHAL 9000

I didnt know supabase supports storing embeddings. Good to know

MMatthew 🦘🫕

I think they only added it within the last month, from what I read on Hackernews it isn't the most efficient compared to pinecone or Quadrant but it'll do the job for me so I can get a beta version of my app out 🙂

HHAL 9000

Does it only stores it or does it also do embedding similarity like pinecone?

MMatthew 🦘🫕

I think it has similarity as well 🙂

MMatthew 🦘🫕

Attachment

bbbornsztein

Sounds like you already have some good options, but just putting a plug in for Fly.io. It's what I'm using for Agent-HQ and it's been really nice. Here's tutorial:

https://ahmadrosid.com/blog/deploy-fastapi-flyio

They also have Machines (https://fly.io/docs/machines/guides-examples/functions-with-machines/) which you can use to run functions as a service (e.g. run one off machines with user code)

MMatthew 🦘🫕

Oh awesome suggestion, I haven't used Fly.io before but that seems like a great option, thank you!

MMatthew 🦘🫕

Okay for an update, When I tried to upload the layer for lambda python packages I wasn't able to fit everything below the max size, so I'm going to try your recommendation @bbornsztein 🙂

MMatthew 🦘🫕

Sorry for the slow update, I was able to use LLamaIndex on the cloud using ECS, I mainly followed this tutorial:
https://www.eliasbrange.dev/posts/deploy-fastapi-on-aws-part-2-fargate-alb/

And I had no major issues! At the current time it's going to cost $5 usd per month in AWS fees but I'll keep you updated as I increase my processing 🙂

Add a reply

Find answers from the community

Has anyone tried putting there