I have an idea:
I want to build a tool or software for my company. My company is a consultancy company which has 100's of projects. Each project has their own fair share of documentation in DocMan Databases, Jira tickets, GitLab issues, Wiki, teams chats, etc..
I want the tool to be able to collect all data that is text based and collect it in a vector embeddings database. I then want that tool, after automatically collecting all this data from the project, to attach this DB to a local LLM (local for privacy reasons, probably GPT4ALLJ or Vicuna etc..). This local model should then be able to be prompted much like ChatGPT but offer sources from the aforementioned data. This would make project work much easier because any information can simply be asked to this assistant. (There are many ChatGPT UI clones available open source).
I have a few questions:
- Am I right in assuming this is possible?
- How easy is it? From my understanding all of the tools required for this is open source and just need to be integrated
- How much would it cost? (I assumed free due to the open source nature)
- Could you direct me in how exactly I should go about creating this.
Thanks a lot!