Find answers from the community

Home
Members
Harrison
H
Harrison
Offline, last seen 2 months ago
Joined September 25, 2024
Hi

Quick question

Why do I get different chunks returned when asking the same question over and over again?
6 comments
L
H
Hi

Any word on Azure OpenAI gpt-4o-mini support?

"Unknown model 'gpt-4o-mini'. Please provide a valid OpenAI model name in: gpt-4, gpt-4-32k, gpt-4-1106-preview, gpt-4-0125-preview, gpt-4-turbo-preview, gpt-4-vision-preview, gpt-4-1106-vision-preview, gpt-4-turbo-2024-04-09, gpt-4-turbo, gpt-4o, gpt-4o-2024-05-13, gpt-4-0613, gpt-4-32k-0613, gpt-4-0314, gpt-4-32k-0314, gpt-3.5-turbo, gpt-3.5-turbo-16k, gpt-3.5-turbo-0125, gpt-3.5-turbo-1106, gpt-3.5-turbo-0613, gpt-3.5-turbo-16k-0613, gpt-3.5-turbo-0301, text-davinci-003, text-davinci-002, gpt-3.5-turbo-instruct, text-ada-001, text-babbage-001, text-curie-001, ada, babbage, curie, davinci, gpt-35-turbo-16k, gpt-35-turbo, gpt-35-turbo-0125, gpt-35-turbo-1106, gpt-35-turbo-0613, gpt-35-turbo-16k-0613"
16 comments
n
L
H
Hey guys - Quick one - is there a way of using llama index without using RAG? i.e. just for generating a response using Azure OpenAI ?
3 comments
L
H
Hi there. I have a use case for my engine, which is essentially to store all documents as a list index, then fire use the entirety of that index, i.e. all nodes without any filtering - how best to achieve this? - Also, is there a way I can use the LLM to filter if I wanted to? use case is basically to put a chat inteface infront of a monitoring solution, so I envisage a two step process - 1) use the LLM to extract useful stuff from loads of logs 2) send the relevant logs to the LLM along with the users view of the symptoms. I also think I may have just described the base functionality of llama-index 😄
3 comments
H
V
L
Hi! I've been trying to stream into my front end (Javascript) but having no joy. I can get the streaming to work to the console but can't figure out how to stream the text to the front end. Any help is much appreciated!
1 comment
a
Hi guys - so I would like some guidance on how to tune my bot to give more detailed answers. The source data are different formatted word documents. At the moment it summarises "too much" and I really need to generate large volumes of text with the output. Where should I start?
7 comments
H
L
Quick question on the basic SummaryIndex - If I have 1000 equivalent pages of C sharp, and I build a SummaryIndex, then say "Show me this code as a mermaid diagram" - would this work? im assuming that the main challenge is token limits of the LLM?
3 comments
L
H
Hi - I want to get all nodes from my index, just in their raw format so I can log node_id and corresponding text for later evaluation

GPTVectorStoreIndex.from_vector_store

How do I do this?
4 comments
H
W
Good afternoon everyone! We are trying to move from OpenAI to Azure OpenAI but are hitting rate limits on embeddings straight away. We think this is because Azure OpenAI allows 240k tokens per min and OpenAI allows 1m. I understand that by lowering the batch size we could potentially reduce the amount of tokens per minute, but I am unsure whether batch size is related to time in anyway, and therefore we still may get this same problem

The only solution I can think of is to introduce a delay somehow between the execution of batches ?
6 comments
L
H
H
Harrison
·

Choose llm

hi guys - quick question. I have two python scripts - 1 which builds the indicies and 1 which retrieves the index from storage and queries it - Its clear how to choose and LLM to build hte indicies, but I'm missing on how to specifiy a LLM to query the index? I want to try different combinations - I've probably missed something easy, buy can anyone help?
2 comments
H
L
Another question - I used azure openapi's capability to build a chatbot over your data to see how it compared.. Lots of thoughts I can share there ... But one neat feature was that it included the source document and position within the source document as references. I want to reverse engineer this into my engine - how do I go about doing this ?
8 comments
H
L
Hey guys. So I've created a few prototypes for enterprise use using llamaindex, its brilliant ! One of my prototypes is an evaluation engine - other than what's built into llama index are there any other leading ways of scoring questions and answers ?
2 comments
H
L