LlamaIndex

Log inLog into community

Find answers from the community

Updated 2 years ago

I have no idea what I am doing But I am

I have no idea what I am doing But I am

At a glance

·

I have no idea what I am doing; But I am trying to upload a file from nodejs to python fastapi, and create a qdrant index of it and place it in qdrant. And query the index and return an answer.

But it only returns -1 for the query.

I am using docker compose, and I figure I have loads of errors and it cant connect to qdrant or something like that. If anyone wants to take a glance.

Plain Text

import qdrant_client
client = qdrant_client.QdrantClient(
    host="qdrant"  # qdrant is the name of the docker container
)

@app.post("/upload")
async def upload_file(file: UploadFile = File(...)):
    uniqueid = uuid.uuid4()
    os.makedirs(f"files/{uniqueid}/", exist_ok=True)
    file_location = f"files/{uniqueid}/{file.filename}"
    with open(file_location, "wb+") as file_object:
        file_object.write(file.file.read())
    documents = SimpleDirectoryReader(f"files/{uniqueid}/").load_data()
    print(documents)
    index = GPTQdrantIndex(documents, collection_name=uniqueid, client=client)
    response = index.query(
        "Jeg har lyst til å dra på ferie, kan jeg bruke firmahyttene?")
    shutil.rmtree(f"files/{uniqueid}/")
    return {"info": f"file '{file.filename}' successully indexed in Qdrant", "data":  response}

Plain Text

version: "3.8"
networks:
  app-tier:
    driver: bridge
services:
  fastapi:
    build: ./fastapi
    expose:
      - "5000"
    ports:
      - "5000:5000"  
    environment:
      - QDRANT_HOST=qdrant
    depends_on:
      - qdrant  
    networks:
      - app-tier   
    volumes:
      - ./fastapi:/app:Z
  sveltekit:
    build: ./sveltekit
    ports:
      - 3000:3000    
    networks:
      - app-tier    
    depends_on:
      - fastapi
    volumes:
      - ./sveltekit:/app:Z
    environment:
      - VITE_BACKEND_URL=http://localhost:5000
  qdrant:
    image: qdrant/qdrant:v0.10.1
    #    mem_limit: 450m
    ports:
      - 6333:6333
    volumes:
      - ./data/qdrant_storage:/qdrant/storage

    networks:
      - app-tier

Attachments

d

L

.

49 comments

It just returns;

Attachment

You might need to cast the response as a string in the return? Not sure with FastAPI

You might also want to check if the client object has some sort of connection Check function that you can log the result of

Think there might be some async issues or something, doesn't work making a simple vector index either, so think I'll try pulling generation out in its own function.

I wonder if fastAPI is timing out the function? It's a little weird it used the 204 response code (then again, not 100% sure how fast api decides the code lol)

If I remove the index part it's really fast, and documents work. In the instant I introduce index, it halts, gotta get the kids to bed and grind further 🙂

Haha nice, narrowed it down! 💪

@Logan M Oki, it looks like it worked when I created a new async function for the ingestion part, weird 😄 no idea how Python works, all new to me ^^

Haha I'm a little unfamiliar with pythons async stuff too, very weird. Glad it works! 😀

So this, worked. Oh well, lets get qdrant working ^^

Attachment

Yeah, everything worked out smoothly with qdrant after that. It also looked like the answer was alot better from qdrant than from simplevector, how about that

Ahhh nice! I'm guess qdrant might be using some different techniques for similarity matching? 👀

Its werid, i get a slightly different answer each upload. Even tho temp i 0 - so there is some magic going on there

Have to do some testing

Atleast its working ^^

And the 204 response was stupid, that was not from fastapi, but me trying to return bogus data in sveltekit 😄

I have a server side action that posts to fastapi, and it returns the backend data to front-end. And I forgot to do "await return.json()" and just did return 🙂

Ohhhh that makes sense! Good catch!

next issue, how do I query with vectors, thats a big WTF with the QdrantReader 😄

Query with vectors?

@Logan M https://llamahub.ai/l/qdrant

To get the index from the qdrant db, query vectors

Oohhh! See this example for help with that:https://github.com/jerryjliu/llama_index/blob/main/examples/vector_indices/QdrantIndexDemo.ipynb

Yeah, I just thought I could get the data straight from the qdrant db and not have to save and load from file. Because this would just be the same as doing simple vector?

You can save and load still, but it's saving much less info.

When you load, just pass it an empty array instead of your documents, along with the qdrant info

oh

Then it makes sense 😄 tnx for info!

💪

First query after index is good, when I reload the index, I get some weird answers, like major hallucinations.

Hallucinations with the same query?

Maybe check response.source_nodes to make sure it's actually using the qdrant index

@Logan M Yeah, thans for that!

Doesn't look quite right 😄

Attachment

Uh oh haha 😅 that looks like a text encoding issue somehow

My guess is qdrant is encoding with something other than utf8? 🤔

I found the issue - it was my lumpy fingers. I cleaned the filename and it was name-pdf so it didnt understand it was a pdf and parsed it as a binary file 😄

But atleast I am here now! So now I can get working ^^

Tnx for putting me in the right direction ^^

Attachment

my cleaned vs not file 😄

That's what I get for not knowing enough python ^^

Still gives bad answers tho, maybe I have to add some gpt-4 magic to the mix

Glad it's working at least! 💪💪

Yeah, its getting there ^^

Thinking Ill release an open sournce full working stack of Gunicorn, Uvicorn, Fastapi, Qdrant, Sveltekit, Docker compose when I get it up and running ^^

Nice, do it! That would be an awesome resource.

If you want, you could add it to the starter pack (currently at 82 stars 😉)

Totally cool if not though!

Maybe better, I am bad at facilitating that stuff anyways ^^ It has more stars than my top github project ^^

Jerry has (kindly) tweeted out the repo out a few times, which helps with the getting it noticed haha

I'm sure he can give you a shout out too for contributing it 💪

Yeah, thats pretty nice 🙂

@dagthomas did you get uvicorn playing nice w/ llamaindex? I seem to be playing whackamole between the different components from llamaindex when using uvloop

I think it's pretty difficult to use uvloop. I always set the loop type to be asyncio when using fastapi

maybe someday nest_asyncio() will support uvloop :PSadge:

Add a reply

Sign up and join the conversation on Discord