Log in
Log into community
Find answers from the community
View all posts
Related posts
Was this helpful?
π
π
π
Powered by
Hall
Inactive
Updated 11 months ago
0
Follow
Keyword
Keyword
Inactive
0
Follow
At a glance
N
Niels
11 months ago
Β·
Is it a known issue that when using the keyword extractor for document with some english terms but mostly other content it stores all of the keywords in english?
Or maybe if we wanted to fix that we would need to make a custom extractor?
L
N
10 comments
Share
Open in Discord
L
Logan M
11 months ago
That makes sense to me since the prompt is in English.
You'd probably have to rewrite the extraction prompt in another language?
N
Niels
11 months ago
Yeah will probably have to add "make sure to keep the keywords in the original language" or something
N
Niels
11 months ago
Probably doesn't make sense to add to llama index itself?
L
Logan M
11 months ago
Oh lol the template is hard-coded
https://github.com/run-llama/llama_index/blob/78af3400ad485e15862c06f0c4972dc3067f880c/llama-index-core/llama_index/core/extractors/metadata_extractors.py#L195
N
Niels
11 months ago
Yeah i noticed
L
Logan M
11 months ago
Yea maybe. In general multilingual isn't really accounted for anywhere
N
Niels
11 months ago
Kind of a limitation to be fair
N
Niels
11 months ago
But i guess a custom keyword extractor would resolve that for us
L
Logan M
11 months ago
Yea I agree -- in general, it's on the roadmap to centralize all prompts into some central registry to make things easier to customize.
N
Niels
11 months ago
Cool!
Add a reply
Sign up and join the conversation on Discord
Join on Discord