view article Article DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models 4 days ago • 31
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 10 days ago • 63
view article Article Multimodal Embedding & Reranker Models with Sentence Transformers 17 days ago • 50
Beyond Hard Negatives: The Importance of Score Distribution in Knowledge Distillation for Dense Retrieval Paper • 2604.04734 • Published 20 days ago • 12
view article Article BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders 18 days ago • 25
BidirLM: From Text to Omnimodal Bidirectional Encoders by Adapting and Composing Causal LLMs Paper • 2604.02045 • Published 24 days ago • 34
ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models Paper • 2602.16609 • Published Feb 18 • 7
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated 18 days ago • 18
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 • 54
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 Feb 4 • 89
view article Article Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model Feb 4 • 28
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper • 2512.10430 • Published Dec 11, 2025 • 119
NanoBEIR datasets Collection These datasets are compatible with the (Sparse)NanoBEIREvaluator with Sentence Transformers v5.2+. Also CrossEncoderNanoBEIREvaluator if bm25 column • 16 items • Updated Mar 2 • 17