bostero's picture

1

bostero

anon

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

Paper • 2603.24755 • Published Mar 25 • 30