view reply Nice work! For reference, here are two related recent papers:Scaling Inference-Efficient Language Models (https://arxiv.org/pdf/2501.18107)Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs (https://arxiv.org/pdf/2510.18245)
Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs Paper • 2510.18245 • Published Oct 21 • 6
Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs Paper • 2510.18245 • Published Oct 21 • 6 • 2
Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs Paper • 2510.18245 • Published Oct 21 • 6