EQ-Bench: An Emotional Intelligence Benchmark for Large Language Models Paper • 2312.06281 • Published Dec 11, 2023 • 2
Democratizing Diplomacy: A Harness for Evaluating Any Large Language Model on Full-Press Diplomacy Paper • 2508.07485 • Published Aug 10, 2025 • 10
Antislop: A Comprehensive Framework for Identifying and Eliminating Repetitive Patterns in Language Models Paper • 2510.15061 • Published Oct 16, 2025 • 1
view post Post fblgit/UNA-dolphin-2.6-mistral-7b-dpo-laserRE-Introducing, some of the best SFT model, he legend: DOLPHIN. This model is very special, a LASER-UNA model: UNA-dolphin-2.6-mistral-7b-dpo-laser @fblgit in collaboration with @fernandofernandes and @ehartford 4 replies · 👍 15 15 + Reply