Log in
Log into community
Find answers from the community
View all posts
Related posts
Did this answer your question?
π
π
π
Powered by
Hall
Inactive
Updated 9 months ago
0
Follow
Keyword
Keyword
Inactive
0
Follow
N
Niels
9 months ago
Β·
Is it a known issue that when using the keyword extractor for document with some english terms but mostly other content it stores all of the keywords in english?
Or maybe if we wanted to fix that we would need to make a custom extractor?
L
N
10 comments
Share
Open in Discord
L
Logan M
9 months ago
That makes sense to me since the prompt is in English.
You'd probably have to rewrite the extraction prompt in another language?
N
Niels
9 months ago
Yeah will probably have to add "make sure to keep the keywords in the original language" or something
N
Niels
9 months ago
Probably doesn't make sense to add to llama index itself?
L
Logan M
9 months ago
Oh lol the template is hard-coded
https://github.com/run-llama/llama_index/blob/78af3400ad485e15862c06f0c4972dc3067f880c/llama-index-core/llama_index/core/extractors/metadata_extractors.py#L195
N
Niels
9 months ago
Yeah i noticed
L
Logan M
9 months ago
Yea maybe. In general multilingual isn't really accounted for anywhere
N
Niels
9 months ago
Kind of a limitation to be fair
N
Niels
9 months ago
But i guess a custom keyword extractor would resolve that for us
L
Logan M
9 months ago
Yea I agree -- in general, it's on the roadmap to centralize all prompts into some central registry to make things easier to customize.
N
Niels
9 months ago
Cool!
Add a reply
Sign up and join the conversation on Discord
Join on Discord