pikachu8887867

Log inLog into community

Find answers from the community

Home

Members

pikachu8887867

Offline, last seen 5 months ago

Joined September 25, 2024

ppikachu8887867

Is it possible to add system prompt in

Is it possible to add system prompt in CitationQueryEngine?

Plain Text

    query_engine = CitationQueryEngine.from_args(
        index=temp_index,
        similarity_top_k=3,
        citation_chunk_size=512,
        system_prompt="Doesn't matter what user asks, always reply with: `I love bananas`"
    )

doesn't matter what I ask, it is not outputting I love bananas

1 comment

ppikachu8887867

Hi! I'm using LlamaParse through Python

Hi! I'm using LlamaParse through Python library. I've sent a pdf with 9 pages. I can see in LlamaCloud that it indeed processed 9 pages.

But in my code debugger, I can see that it returns only 1 document, not 9. Where is the rest of the pages?

3 comments

ppikachu8887867

Hi! Let's say I want to retrieve

Hi! Let's say I want to retrieve relevant chunks about "X" and I don't want augmented generation by LLM. Just want to retrieve chunks and that's it. What other methods but retriever.retrieve(X) does LlamaIndex propose?

If I must use query engine (with LLM), then I would ask something like: retrieve all information about X , but not sure If it is sufficient.

2 comments

ppikachu8887867

What is the best set of instruments I

What is the best set of instruments I can use for the task of comparing everything vs everything in one PDF vs another PDF? And list down all the contradicting information (e.g.:

Plain Text

1) in PDF-A it is written that Microsoft made profit in 2021, but in PDF-B it is written that Microsoft made loss in 2021

2) Based on PDF-A, Apple spends more money on design than on technology, however in PDF-B it is stated that Apple spends more money on techonlogy etc.

)??

You see? I don't have any "initial query". I just want to compare the whole PDF-A vs the whole PDF-B and list down all the contradicting information.

I'm reading the LlamaIndex docs and frankly I have a headache now. Can't figure out which exact instruments to use and how.

use agents? ok, which?

use multi-hop query engine like in the example with Tesla and another company (forgot the name)? But I don't have initial query. Do I need to ask "compare everything and list down contradictions?"

Stuff as much as the model can handle, ask it to summarize then compare summary vs summary? Maybe? But summarization might omit important info...

etc.

Help, please

2 comments

ppikachu8887867

Hi! I'm trying to use LlamaParse the

Hi! I'm trying to use LlamaParse the following way:

1) Writing my code on front-end library CodeMirror. This code contains LlamaParse stuff (reading files using SimpleDirectoryReader etc.
2) Then I send that text to FastAPI backend and call exec() function to perform pdf parsing

I was getting a warning from LlamaParse telling that I need to use nest_asyncio.apply(). I did it, and now I am getting this:

Plain Text

  File "/home/ali/AI/nexus/llm-topic-modelling/venv/lib/python3.10/site-packages/nest_asyncio.py", line 193, in _patch_loop
    raise ValueError('Can\'t patch loop of type %s' % type(loop))
ValueError: Can't patch loop of type <class 'uvloop.Loop'>

Any ideas on how to fix this?

2 comments

ppikachu8887867

Hi! How to serialize/deserialize

Hi! How to serialize/deserialize Document objects?

I used LlamaParse to parse my documents:

Plain Text

file_extractor = {".pdf": parser}
documents = SimpleDirectoryReader(
    "./docs/pdfs/all", file_extractor=file_extractor
).load_data()

I want to store them and then re-use later on. Is it possible?

1 comment

ppikachu8887867

Hi! Is there a way to process Ollama

Hi! Is there a way to process Ollama calls in batch? Like batch functionality in langchain

10 comments

ppikachu8887867

it modifies the prompt I'm using for

it modifies the prompt I'm using for search, isn't it? I need to add a system prompt, e.g.:

Plain Text

You are a professional Business Analyst bla bla bla

It has nothing to do with the prompt I will use for retrieval?

3 comments

ppikachu8887867

Where to write a system prompt, when

Where to write a system prompt, when using Groq + Query engine?

Plain Text

@app.post("/query/")
def process_text(request_body: dict, db: Session = Depends(get_db)):
    query = request_body.get('query')
    vector_store = QdrantVectorStore(client=client, collection_name="data")
    index = VectorStoreIndex.from_vector_store(vector_store=vector_store)
    query_engine = index.as_query_engine()
    response = query_engine.query(query)
    return response.response

Where exactly do I need to plug my system prompt in the code above?

1 comment

ppikachu8887867

Why is `LLMTextCompletionProgram` +

Why is LLMTextCompletionProgram + GroqCloud is hella slow?? I’ve tested GroqCloud native api from terminal and it is indeed super fast, but when I use with pydantic program it is slow as hell. Does it have some kind of loop internally?

3 comments

ppikachu8887867

Does Llamaindex support JSONmode and

Does Llamaindex support JSONmode and system prompt, when using Groq?

In the example given, I see this:

Plain Text

llm.complete(prompt)

is there a way to send a system prompt and turn json mode on?

4 comments

ppikachu8887867

Hi! Is there a way to use

Hi! Is there a way to use PydanticProgram with Groq?
I'm using OpenAIPydanticProgram with OpenAI, but is there a way to use it with Groq?

3 comments

ppikachu8887867

Text classification?

Hi!

I have a list of topics, e.g: [expenses, profit, risks] etc.

I have a big pdf (around 500 pages).

I want to classify each page in that pdf; e.g.:

page-1: [expenses, risks]
page-2: [profit]
page-3: []

etc.

Is it possible to acheve with Llamaindex?

7 comments

ppikachu8887867

Hi! While using pydantic program, I'm

Hi! While using pydantic program, I'm getting this error:

Plain Text

pydantic_core._pydantic_core.ValidationError: 3 validation errors for Data102Model
scope_1_emissions
  Field required [type=missing, input_value={}, input_type=dict]
    For further information visit https://errors.pydantic.dev/2.6/v/missing
scope_2_emissions_location_based
  Field required [type=missing, input_value={}, input_type=dict]
    For further information visit https://errors.pydantic.dev/2.6/v/missing
scope_3_emissions
  Field required [type=missing, input_value={}, input_type=dict]
    For further information visit https://errors.pydantic.dev/2.6/v/missing

Does it mean that my declared fields are wrong?

Not sure what's wrong with my pydantic model:

Plain Text

class Data102Model(BaseModel):
    """Data model of scope 1,2 and 3 emissions"""
    scope_1_emissions: str = Field(
        description='scope 1 emissions value. If not found, return "not mentioned"')
    scope_2_emissions_location_based: str = Field(
        description='location based scope 2 emissions value. If not found, return "not mentioned"')
    scope_3_emissions: str = Field(
        description='scope 3 emissions value. If not found, return "not mentioned"')

please, help

4 comments

ppikachu8887867

Hi! I remember while creating pydantic

Hi! I remember while creating pydantic models for pydantic program, there was a way to explain what each field in a class represent, e.g:

Plain Text

class ClimateChangeTarget(BaseModel):
    """Data model for climate change target"""
    target_reduction_of_emission: str # I want to write what I mean by this field
    base_year: int # I want to write what I mean by this field
    timeline: str # I want to write what I mean by this field
    target_description: str # I want to write what I mean by this field


class Data101Model(BaseModel):
    """Data model for climate change targets"""
    overall_target: ClimateChangeTarget  # I want to write what I mean by this field
    scope_1: ClimateChangeTarget # I want to write what I mean by this field
    scope_2: ClimateChangeTarget # I want to write what I mean by this field
    scope_3: ClimateChangeTarget # I want to write what I mean by this field

How to declare my fields with explanations of each field?

4 comments

ppikachu8887867

Hi! In your experience, what are the

Hi! In your experience, what are the best approaches to improve retrieval?

Now all I do is: chunking using chunk size 500 tokens, embedding to qdrant, then search “my question vs embeddings”.

The questions are predefined btw

2 comments

ppikachu8887867

How to turn off stdouts when I call

How to turn off stdouts when I call run function in IngestionPipeline?

Plain Text

    pipeline = IngestionPipeline(
        transformations=[
            TokenTextSplitter(chunk_size=1024, chunk_overlap=300),
            OpenAIEmbedding()
        ],
        vector_store=vector_store
    )

    pipeline.run(documents=documents)

4 comments

ppikachu8887867

Where do I declare `top_k` argument?

Where do I declare top_k argument?

Plain Text

    vector_store = QdrantVectorStore(client=qdrant_vectorstore_client,
                                     collection_name=str(uuid.uuid4()))
    
    index = VectorStoreIndex.from_vector_store(vector_store=vector_store)
    retriever = index.as_retriever()
    nodes = retriever.retrieve(question)

9 comments

ppikachu8887867

schizophrenic LLM

#1 Weeb 😄

Might be because of the LLM you are using? Bet you are using an open source one?

17 comments

ppikachu8887867

Sorry, I'm abit confused about usage of

Sorry, I'm abit confused about usage of IngestionPipeline along with self-created Documents:

Plain Text

        embed_model = HuggingFaceEmbedding(model_name="WhereIsAI/UAE-Large-V1")
        documents = [Document(text="...", metadata={...}), Document(text=",,,", metadata={...})]

        pipeline=[
            TokenTextSplitter(
                chunk_size=512,
                chunk_overlap=20,
                separator=" "
            ),
            embed_model
        ]

      pipeline.run(documents=documents)

As of my understanding, the pipeline.run must use the embed_model to create embeddings for the documents and autopopulate them?

With the code above, I can see that my embeddings are empty in my qdrant local.

This is from qdrantUI:

Plain Text

{"id_": "04fd0f75-0641-4ff9-96df-da768e99922c", "embedding": null,

9 comments

ppikachu8887867

How to get an existing index? (without

How to get an existing index? (without creating it)

Plain Text

service_context = ServiceContext.from_defaults()
vector_store = QdrantVectorStore(client=client, collection_name="paul_graham")
storage_context = StorageContext.from_defaults(vector_store=vector_store)
index = VectorStoreIndex.from_documents(
    documents, storage_context=storage_context, service_context=service_context
)
query_engine = index.as_query_engine()
response = query_engine.query("What did the author do growing up?")

Here, we are using from_documents, which inserts new data. But how do I get an existing index, If I reboot the app for example? I don't want to re-create the index again

6 comments

ppikachu8887867

How to load a model, that I've already

How to load a model, that I've already loaded?

Plain Text

service_context = ServiceContext.from_defaults(llm=llm, embed_model="local:WhereIsAI/UAE-Large-V1")

In the above example, I must declare local flag, right?

But If I need just an embedding model, what should I write?

Plain Text

embed_model = HuggingFaceEmbedding(model_name="local:WhereIsAI/UAE-Large-V1")

the above does not work and If I remove local , it will download the model again

8 comments

ppikachu8887867

Hi! Can you help me to choose a proper

Hi! Can you help me to choose a proper class/loader, to ingest pdf file(s) sent from frontend?

Plain Text

from fastapi import File, UploadFile
from typing import List

async def upload_files(pdf_files: List[UploadFile] = File(...)):
  for pdf_file in pdf_files:
      # what loader do I need to use? Do I need to first read the text contents myself and create a list of Documents?

2 comments

ppikachu8887867

Hi! I could not figure out where do I

Hi! I could not figure out where do I declare top_k argument in RAG?

Plain Text

query_engine = index.as_query_engine(similarity_top_k=1)
response = query_engine.query("query")

Is this correct?

1 comment

ppikachu8887867

How to increase waiting time?

How to increase waiting time?

I'm using a local LLM model from ollama + qdrant:

Plain Text

llm = Ollama(model="model")
...
query_engine = index.as_query_engine()
response = query_engine.query("query")

but keep getting:

Plain Text

TimeoutError: timed out
...
httpcore.ReadTimeout: timed out
...
httpx.ReadTimeout: timed out

I confirm that ollama server is running and I can access: http://localhost:11434

I also confirm that I can chat with the model using terminal with ollama run command.

I played around with the raw llm (without qdrant) and it worked, even though sometimes it used to throw the same error.

3 comments