Find answers from the community

Updated 5 months ago

Does anyone have any ideas around best

At a glance

The community member is looking for best practices to ingest documents from Slack while retaining as much metadata as possible for individual messages, while also maintaining the context of the entire conversation. They have tried two approaches:

1. Storing each message as its own document, which successfully captures the permalink, timestamp, and user metadata, but loses the conversational context.

2. Storing multiple messages in a single document, which preserves the conversational context but cannot link to individual messages, and the user, timestamp metadata is not stored.

The community members discuss a potential approach mentioned in a blog post, but note that it has limitations in handling entire conversation threads, as it treats each message as a standalone "fact". There is no explicitly marked answer, and the community members agree that this is a challenging problem without a single "good" solution.

Useful resources
Does anyone have any ideas around best practices to ingest documents with Slack?

I want to retain as much metadata as possible for individual messages, but I want the document retrieval to maintain context of the entire conversation. I've tried this two ways:

  1. Each message is its own document, can sucessfully store permalink, timestamp, user in the metadata, but conversational context is lost.
  1. Store multiple messages in documents, cannot link to individual messages, user, timestamp metadata isn't stored due to multiple messages in the doc.
Any ideas how to solve this problem?
L
H
3 comments
This is basically what I have now - the limitation of it is that it can't take entire conversation threads into account as it handles each message as "fact". If someone were to ask a question in a slack channel, and another user were to respond to the answer without directly mentioning the problem, it wouldn't be relevant in a similarity search. Does that make sense?
yea makes sense. Its a tough problem, I don't think theres a single "good" solution here
Add a reply
Sign up and join the conversation on Discord