adeelhasan

how to change prompt format for zephyr 7b beta model in llamacpp

3 comments

Mistral 7b

Hello everyone using mistral 7b instruct as my llm but when applying it into chat engine got incomplete response as compared to llama2 llm model

10 comments

aadeelhasan

I am using titleextractor for extracting

I am using titleextractor for extracting metadata by passing a document which contains 20 different pdf but i am getting same title for all the PDFs

1 comment

aadeelhasan

Hey everyone i am having around 20k pdf

Hey everyone i am having around 20k pdf files with each of 2-4 pages, i am using MetadataExtractor to extract metadata using llama2 llm but my kernel got crashed how to resolve it??

1 comment

aadeelhasan

What are the key distinctions between

"What are the key distinctions between using 'as_chat_engine' and 'as_query_engine' in terms of the responses they generate when the same question is asked?"

4 comments

aadeelhasan

How can i pass key value in the

How can i pass key value in the ExactMatchFilter dynamics means if a user si using my rag chatbot in that case how can i defined my key and value which will be relevant to query

3 comments

aadeelhasan

Default Parameters for Llama2

What are the default values of parameters when using llama2 llm such as temperature,context windows

1 comment

aadeelhasan

LlamaCPP

Hello,is there any way in llamaindex to find llama.cpp is running on gpu

1 comment

aadeelhasan

Logan M jerryjliu0 you guys did

you guys did fabulous on query engine , now you need to shift your focus on chatengine because implement it in production is very painful

16 comments

aadeelhasan

i have implemented rag using llama-cpp-

i have implemented rag using llama-cpp-python with mistral7b openorca model but response time is too high although the api is hosted on sever which has 2 nvidia gpu RTX a4000 . can someone help me out

16 comments

aadeelhasan

I am using RAGs with llamacpp as a llm

I am using RAGs with llamacpp as a llm but getting this error

2 comments

aadeelhasan

Empty Response

Hey everyone why i am getting empty response when running query engine on a "doc_summary_index"

3 comments

aadeelhasan

I have developed a rag system now i want

I have developed a rag system now i want to integrate it on my website as a chatbot. What are the ways in which i can do that for example - i am thinking of creating an API in django but problem is how can i return the response in a streaming mode

8 comments

aadeelhasan

Why i am getting these warnings after

Why i am getting these warnings after updating llamaindex version i.e. 0.8.5.post1

3 comments

aadeelhasan

Hello folks i am new in rag so can u

Hello folks i am new in rag so can u help me out i have build a rag system using only 2 text file and weaviate vector database. I am getting pretty decent response but it usually take approx 2 min to get the whole response now what should i do to decrease the response time because i need to scale this system for 1000k text files??

8 comments

Find answers from the community

how to change prompt format for zephyr 7b beta model in llamacpp

Mistral 7b

I am using titleextractor for extracting

Hey everyone i am having around 20k pdf

What are the key distinctions between

How can i pass key value in the

Default Parameters for Llama2

LlamaCPP

Logan M jerryjliu0 you guys did

i have implemented rag using llama-cpp-

I am using RAGs with llamacpp as a llm

Empty Response

I have developed a rag system now i want

Why i am getting these warnings after

Hello folks i am new in rag so can u