13 1

Chanuk Lee

tally0818

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

upvoted a paper 6 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 8 days ago

IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL

View all activity

Organizations

None yet

upvoted 2 papers 6 days ago

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

Paper • 2603.22341 • Published 10 days ago • 35

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published Jan 13 • 150

upvoted a paper 8 days ago

IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL

Paper • 2603.12151 • Published 19 days ago • 2

updated a model 13 days ago

tally0818/ContextGRPO_2x4_random

Text Generation • Updated 13 days ago • 13

published a model 13 days ago

tally0818/ContextGRPO_2x4_random

Text Generation • Updated 13 days ago • 13

published a model 14 days ago

tally0818/ContextGRPO_2x4

Text Generation • Updated 14 days ago • 10

updated a model 14 days ago

tally0818/ContextGRPO_2x4

Text Generation • Updated 14 days ago • 10

upvoted a paper 19 days ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 21 days ago • 146

upvoted a paper 20 days ago

MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents

Paper • 2603.09827 • Published 21 days ago • 29

upvoted 2 papers about 1 month ago

MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models

Paper • 2602.17602 • Published Feb 19 • 56

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 146

upvoted 3 papers about 2 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 126

Self-Hinting Language Models Enhance Reinforcement Learning

Paper • 2602.03143 • Published Feb 3 • 31

Rethinking the Trust Region in LLM Reinforcement Learning

Paper • 2602.04879 • Published Feb 4 • 37

liked a Space about 2 months ago

Music Flamingo

🎵

160

Ask questions about any song and get detailed answers

upvoted 2 papers about 2 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 110

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

Paper • 2601.23143 • Published Jan 30 • 39

upvoted a paper 3 months ago

Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation

Paper • 2601.00664 • Published Jan 2 • 57

Chanuk Lee

AI & ML interests

Recent Activity

Organizations

tally0818's activity

Music Flamingo