Harrison

Why do i get different chunks returned when asking the same question over and over again?

Hi

Quick question

Why do I get different chunks returned when asking the same question over and over again?

6 comments

Azure OpenAI gpt-4o-mini support unknown model

Hi

Any word on Azure OpenAI gpt-4o-mini support?

"Unknown model 'gpt-4o-mini'. Please provide a valid OpenAI model name in: gpt-4, gpt-4-32k, gpt-4-1106-preview, gpt-4-0125-preview, gpt-4-turbo-preview, gpt-4-vision-preview, gpt-4-1106-vision-preview, gpt-4-turbo-2024-04-09, gpt-4-turbo, gpt-4o, gpt-4o-2024-05-13, gpt-4-0613, gpt-4-32k-0613, gpt-4-0314, gpt-4-32k-0314, gpt-3.5-turbo, gpt-3.5-turbo-16k, gpt-3.5-turbo-0125, gpt-3.5-turbo-1106, gpt-3.5-turbo-0613, gpt-3.5-turbo-16k-0613, gpt-3.5-turbo-0301, text-davinci-003, text-davinci-002, gpt-3.5-turbo-instruct, text-ada-001, text-babbage-001, text-curie-001, ada, babbage, curie, davinci, gpt-35-turbo-16k, gpt-35-turbo, gpt-35-turbo-0125, gpt-35-turbo-1106, gpt-35-turbo-0613, gpt-35-turbo-16k-0613"

16 comments

HHarrison

Hey guys - Quick one - is there a way of

Hey guys - Quick one - is there a way of using llama index without using RAG? i.e. just for generating a response using Azure OpenAI ?

3 comments

HHarrison

List index

Hi there. I have a use case for my engine, which is essentially to store all documents as a list index, then fire use the entirety of that index, i.e. all nodes without any filtering - how best to achieve this? - Also, is there a way I can use the LLM to filter if I wanted to? use case is basically to put a chat inteface infront of a monitoring solution, so I envisage a two step process - 1) use the LLM to extract useful stuff from loads of logs 2) send the relevant logs to the LLM along with the users view of the symptoms. I also think I may have just described the base functionality of llama-index 😄

3 comments

HHarrison

Hi I ve been trying to stream into my

Hi! I've been trying to stream into my front end (Javascript) but having no joy. I can get the streaming to work to the console but can't figure out how to stream the text to the front end. Any help is much appreciated!

1 comment

HHarrison

Hi guys so I would like some guidance on

Hi guys - so I would like some guidance on how to tune my bot to give more detailed answers. The source data are different formatted word documents. At the moment it summarises "too much" and I really need to generate large volumes of text with the output. Where should I start?

7 comments

HHarrison

Quick question on the basic SummaryIndex

Quick question on the basic SummaryIndex - If I have 1000 equivalent pages of C sharp, and I build a SummaryIndex, then say "Show me this code as a mermaid diagram" - would this work? im assuming that the main challenge is token limits of the LLM?

3 comments

HHarrison

Hi - I want to get all nodes from my

Hi - I want to get all nodes from my index, just in their raw format so I can log node_id and corresponding text for later evaluation

GPTVectorStoreIndex.from_vector_store

How do I do this?

4 comments

HHarrison

Good afternoon everyone We are trying to

Good afternoon everyone! We are trying to move from OpenAI to Azure OpenAI but are hitting rate limits on embeddings straight away. We think this is because Azure OpenAI allows 240k tokens per min and OpenAI allows 1m. I understand that by lowering the batch size we could potentially reduce the amount of tokens per minute, but I am unsure whether batch size is related to time in anyway, and therefore we still may get this same problem

The only solution I can think of is to introduce a delay somehow between the execution of batches ?

6 comments

HHarrison

Choose llm

hi guys - quick question. I have two python scripts - 1 which builds the indicies and 1 which retrieves the index from storage and queries it - Its clear how to choose and LLM to build hte indicies, but I'm missing on how to specifiy a LLM to query the index? I want to try different combinations - I've probably missed something easy, buy can anyone help?

2 comments

HHarrison

Another question I used azure openapi s

Another question - I used azure openapi's capability to build a chatbot over your data to see how it compared.. Lots of thoughts I can share there ... But one neat feature was that it included the source document and position within the source document as references. I want to reverse engineer this into my engine - how do I go about doing this ?

8 comments

HHarrison

Hey guys So I ve created a few

Hey guys. So I've created a few prototypes for enterprise use using llamaindex, its brilliant ! One of my prototypes is an evaluation engine - other than what's built into llama index are there any other leading ways of scoring questions and answers ?

2 comments

Find answers from the community

Why do i get different chunks returned when asking the same question over and over again?

Azure OpenAI gpt-4o-mini support unknown model

Hey guys - Quick one - is there a way of

List index

Hi I ve been trying to stream into my

Hi guys so I would like some guidance on

Quick question on the basic SummaryIndex

Hi - I want to get all nodes from my

Good afternoon everyone We are trying to

Choose llm

Another question I used azure openapi s

Hey guys So I ve created a few