Order-based matching of corpus metadata to to tokens #11
-
Hi! Thanks a lot for this nice little library, the timing is perfect :) If I want to provide additional metadata in my corpus, how is it matched to the indexed corpus tokens at retrieval time? Is it entirely based on both structures having the same order such that the indices apply? Just looking for a quick confirmation before using this in a real-world application :) Quick example to illustrate:
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
Yes, it is dependent on the structure! If you don't provide |
Beta Was this translation helpful? Give feedback.
-
Thanks for clarifying! |
Beta Was this translation helpful? Give feedback.
Yes, it is dependent on the structure! If you don't provide
corpus=corpus
it will simply return the index, which you can use to match manually. bm25s does not do any index checking (since it only sees lists).