view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 25 days ago • 63
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published Nov 15, 2024 • 129