Mapping Unstructured Text to Knowledge Graphs: Exploring the State of the Art
Mapping Unstructured Text to Knowledge Graphs: Exploring the State of the Art
At a glance
The post asks about the state of the art for mapping unstructured text to knowledge graphs (KGs) and what community members have found most useful or powerful. The comments discuss the challenges of using LLMs to extract graphs, such as high computational cost, storage complexities, and issues with entity linking and deduplication. One community member prefers graph-like approaches using metadata tagging instead of involving graphs explicitly. They also differentiate between KGs and property graphs, noting that property graphs offer more customization and better code quality. Overall, the community members express skepticism about the current state of mapping unstructured text to KGs, considering it to be in a proof-of-concept stage.
I haven't found it useful at all tbh. Huge computation/token cost, storage complexities, etc. I don't think its worth the effort right now.
What I have found useful is graph-like approaches. Something like using metadata tagging so that if you retrieve some node from a section of text, you use the metadata to help fetch the full section. Things that retrieve by reference can be achieved without needing to involve graphs explicitly
I just tried GraphRAG directly from MSFT. It looks ... reasonable. It created the entities from the content and related them but I don't see edge labels at all. Time to go do more research. Thanks, @Logan M !