👋 Is there a way to change the default

At a glance

The community members discuss how to customize the default templates in schema.py, specifically the DEFAULT_TEXT_NODE_TMPL. A community member provides an example of how to create a customized Document object with specific metadata and templates. The discussion then explores the differences between Document and Node objects, and the community members conclude that there is nearly no difference between them. They suggest that the community member can either instantiate new Node objects with the desired customization or modify the existing nodes. Finally, the community members provide guidance on how to use the VectorStoreIndex class to build an index from the customized nodes.

Useful resources

PPocketColin

👋 Is there a way to change the default templates in schema.py (specifically DEFAULT_TEXT_NODE_TMPL)? The template is used by the get_content method on the TextNode.

20 comments

LLogan M

There is!

LLogan M

https://docs.llamaindex.ai/en/stable/module_guides/loading/documents_and_nodes/usage_documents.html#summary

LLogan M

Plain Text

document = Document(
    text="This is a super-customized document",
    metadata={
        "file_name": "super_secret_document.txt",
        "category": "finance",
        "author": "LlamaIndex",
    },
    excluded_llm_metadata_keys=["file_name"],
    metadata_seperator="::",
    metadata_template="{key}=>{value}",
    text_template="Metadata: {metadata_str}\n-----\nContent: {content}",
)

PPocketColin

perfect! But now wait what's the difference between a document and a node? I thoguht I was working with nodes here

PPocketColin

oh..... I think I see. So is this something I'd have to connect to SimpleDirectoryReader?

LLogan M

there is nearly zero difference between a document and node

LLogan M

mostly just naming/perception lol

LLogan M

the classes are nearly identical

PPocketColin

ohhhhh haha

PPocketColin

so when I iterate through all of my nodes after pulling them out of a PDF, should I just instantiate new Nodes with all of this customization added? I'm guessing I could do:

Plain Text

node = TextNode(
  text="blah blah",
...
)

LLogan M

Yea you can do that! Or you can just modify the existing nodes if you have them

LLogan M

node.text_template = "..."

PPocketColin

🤯

PPocketColin

of course I can. How did I miss that! Thanks!

PPocketColin

Ok haha followup question! VectorStoreIndex().build_index_from_nodes(nodes) returns an IndexDict type object but I really need the VectorStoreIndex. Should I just be passing nodes to .from_documents(nodes) instead?

PPocketColin

Except no that doesn't work because .from_documents expects the list items to have .get_doc_id

LLogan M

Use VectorStoreIndex(nodes, ...)

PPocketColin

oh ok I thought I'd still need to call some sort of processing method but I guess not

PPocketColin

thanks!

LLogan M

build_index_from_nodes() is actually called from the base constructor 👍

Add a reply

Find answers from the community

👋 Is there a way to change the default