Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

CLAIRE Lab @EPFL

university
https://github.com/CLAIRE-Labo
CLAIRE-Labo
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

skandermoalla  authored a paper about 4 hours ago
Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers
skandermoalla  authored a paper about 4 hours ago
Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
skandermoalla  authored a paper about 4 hours ago
Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions
View all activity

Papers

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

View all Papers

Justin Deschenaux's profile picture Caglar Gulcehre's profile picture Mikhail Terekhov's profile picture Amin Mansouri's profile picture Skander Moalla's profile picture
CLAIRE-Labo 's Papers 1
Submitted by
Alexander Panfilov
5

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

CLAIRE-Labo CLAIRE Lab @EPFL
2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs