Thanks @Logan M !
I am new to LLM ecosystem (huggingface, llama_index, langchain), trying to learn llama_index. Thank you so much for helping out π
I am trying to build few applications with open datasets, index those using llama_index and hoping a decent enough llm would be able to help analyse the dataset and provide relevant information:
1: This is the first use case and first dataset:
https://drive.google.com/file/d/1WFvu8dnVwZV5WuluHFS_eCMJv3qOaXr1/viewThis has lending loan information and whether the loan was bad loan or not. Based on this dataset (which could be actually huge in case of production dataset), I want the llm to predict if an upcoming loan request could result into a bad/good loan in future?
2: This is the second use case and second dataset:
https://huggingface.co/datasets/consumer-finance-complaints/viewer/default/train?row=0This has consumer finance related complaints with finance institutions. Based on this dataset, I want llm to summarise things like:
- Most pain-points of the consumers
- For a finance institution, what is the most common occurring issue
3: This is the third use case and third dataset:
https://huggingface.co/datasets/PolyAI/banking77Given a labled dataset of customer queries, I want the llm to understand upcoming live customer query over the phone and understand it (from previous similar labeling) and then correctly respond.
Are the use-cases I listed above even solvable by LLMs today? Are LLMs matured enough to understand numeric relation(like in first use case) and then make predictions? Does passing the datasets by llama_index is good enough or do we need to fine tune existing llm for these use cases?
I am having hard time wrapping my head around understanding all this, hoping you could help?π