Release BGE-M3 and Activation Beacon
staoxiao
released this
02 Feb 05:57
·
947 commits
to master
since this release
BGE-M3
A new member of the BGE model series! BGE-M3 stands for Multi-linguality, Multi-granularities (input length up to 8192), and Multi-Functionality (unification of dense, lexical, multi-vec retrieval). It is the first embedding model which supports all three retrieval methods.
For more details please refer to Technical Report and Code.
Activation Beacon
An effective, efficient, compatible, and low-cost (training) method to extend the context length of LLM by x100 times. We extend the context length of Llama-2-chat-7b from 4K to 400K.
For more details please refer to paper and code