9 19

Alberto Compagnoni

alcompa

alcompa

AI & ML interests

Multimodal LLMs, Reasoning MLLMs, RAG

Recent Activity

upvoted a paper 15 days ago

ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering

authored a paper 18 days ago

ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering

updated a collection 18 days ago

ReAG

View all activity

Organizations

upvoted a paper 15 days ago

ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering

Paper • 2511.22715 • Published Mar 31 • 2

upvoted an article 8 months ago

Article

There is no such thing as a tokenizer-free lunch

catherinearnett

•

Sep 25, 2025

• 98

upvoted an article 9 months ago

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

smohammadi, siro1, winglian, marcsun13, djsaunde

•

Aug 8, 2025

• 98

upvoted an article 11 months ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

toslali-ibm, mirinflim, qgallouedec, esnible, rganti, mudhakar

•

Jun 3, 2025

• 101

upvoted 2 articles 12 months ago

Article

The N Implementation Details of RLHF with PPO

vwxyzjn, tianlinliu0121, lvwerra

•

Oct 24, 2023

• 72

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 292

upvoted an article about 1 year ago

Article

How to generate text: using different decoding methods for language generation with Transformers

patrickvonplaten

•

Mar 1, 2020

• 297

upvoted an article over 1 year ago

Article

Decoding Strategies in Large Language Models

mlabonne

•

Oct 29, 2024

• 113

Alberto Compagnoni

AI & ML interests

Recent Activity

Organizations

alcompa's activity

There is no such thing as a tokenizer-free lunch

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

The N Implementation Details of RLHF with PPO

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

How to generate text: using different decoding methods for language generation with Transformers

Decoding Strategies in Large Language Models