The Alignment Curse: Modality Alignment Supercharges Audio Attacks via Text Transfer Paper ⢠2602.02557 ⢠Published 8 days ago ⢠21
D^2-Monitor: Dynamic Safety Monitoring for Diffusion LLMs via Hesitation-Aware Routing Paper ⢠2605.25893 ⢠Published 12 days ago ⢠39
PolySAE: Modeling Feature Interactions in Sparse Autoencoders via Polynomial Decoding Paper ⢠2602.01322 ⢠Published Feb 1 ⢠8