Hydragen: High-Throughput LLM Inference with Shared Prefixes Paper β’ 2402.05099 β’ Published Feb 7, 2024 β’ 20 β’ 4
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling Paper β’ 2312.15166 β’ Published Dec 23, 2023 β’ 60 β’ 9