UPDF AI

C-Pack: Packaged Resources To Advance General Chinese Embedding

Shitao Xiao,Zheng Liu,Peitian Zhang,Niklas Muennighoff

2023 · DBLP: journals/corr/abs-2309-07597
arXiv.org · 引用数 687

TLDR

The English models achieve state-of-the-art performance on the MTEB benchmark; meanwhile, the released English data and models for English text embeddings is 2 times larger than the Chinese data.