Looking for thoughts on four common issues I seeHey all! I'm working on a few chatbots built with LlamaIndex, a collection of (1000s of) blog posts as a data-source, and GPT. Really impressed right out of the box, but as I continue to work I've found a few commons ways in which responses are bad. I'm working through mitigating each issue - all of which I think are very solvable.
Issues- Failing to account for recency. Can I somehow get my bot to prioritize more recent context if the same thing is mentioned many times. Maybe I can store date in some metadata?
- Requiring very specifically worded questions. I.e. Ask two questions that mean the same thing to a human. Bot won't be able to find answer for one, will for the other.
- Aggregating vs. Non-aggregating Index. I'm using a simple vector index. Some questions would benefit from an index that could use aggregation of info from across my blog posts.. Others wouldn't. How can I balance this?
- How to handle subjective questions for which there is nothing in the context. I think this comes down to prompt engineering.
If you have any thoughts on the above, please let me know, I'd love to hear them. I'm sure I'm missing some easy improvements.
More infoI wrote about this in depth on my website
https://www.mattambrogi.com/posts/chat-bots/