Custom llm output

At a glance

The post indicates that the community member is using a custom LLM (Large Language Model) with Pythia and a pipeline. The comments discuss various issues the community member encountered, such as the pipeline not returning a string, errors related to chunk overlap and tensor size, and breaching the input token limit. The community members provide suggestions and troubleshooting steps to address these issues, including checking the _call function, adjusting prompt helper settings, and updating to a newer version of the LLaMA index. However, there is no explicitly marked answer in the provided information.

AAkinus21

Yes. Custom LLM using pythia and a pipeline

19 comments

LLogan M

Maybe double check that the pipeline is returning a string.

Usually it returns a list of dict, and you need to do something like response = pipeline(...)[0]["generated_text"]

AAkinus21

Hmmmm... I'll check that out

LLogan M

Yea, the _call function of the custom llm needs to return a string, otherwise you get that error

AAkinus21

That worked! Now I'm getting "Larger chunk overlap (20) than chunk size (-354)". Hmm

LLogan M

Thats a classic 🫠 once the prompt helper settings and chunk size limit are tweaked, weird stuff can happen

I think there is a fix just released yesterday that might fix this.

Otherwise, you'll have tonplay around with chunk size settings I think 😞

AAkinus21

Played around with them, got me to the next error:

The size of Tensor a must match the size of tensor b at non-singleton dimension 3.

Searching seems to suggest that it's a problem with pictures, but my indices should only contain text, because it using a directory reader to parse pdf files.

LLogan M

Hmmm does the error tell you if it's embeddings or the llm?

AAkinus21

Looks like it's in the transformers/models/gpt_neox/modeling_gpt_neox.py

torch.where(causal_mask, attn_scores, mask_value
)

AAkinus21

I was breaching the input token limit... 😒

LLogan M

Ahhh that would do it!

LLogan M

Need to make sure the max input size is set in the prompt helper to be small enough 👀

AAkinus21

Now just trying the right combination of values to avoid the "Larger chunk overlap than chunk size" error.

LLogan M

What llama index version do you have btw? I think 0.6.0a6 should have a fix for that

LLogan M

If you have that version, I guess the fix is not perfect hah

AAkinus21

Updated, but seems like many things have changed and my code is broken all over the place! 😖

AAkinus21

I'll need to work it later, but some of the changes look like they are going help streamline what I am trying to do...

LLogan M

Oof, yea many changes. Sorry I thought maybe you were already on 0.6.x lol

LLogan M

But the changes are good!

AAkinus21

No worries, yeah I might go back to the drawing board with how I'm setting up this app, there might be some opportunity to make it better with some of the changes.

Add a reply

Find answers from the community

Custom llm output