AIM Intelligence

company

https://aim-intelligence.com

AIM-Intelligence

AI & ML interests

AI Safety & AI Security

Recent Activity

Dasool updated a dataset about 22 hours ago

AIM-Intelligence/XL-SafetyBench

Dasool updated a dataset 2 days ago

AIM-Intelligence/XL-SafetyBench

Dasool authored a paper 4 months ago

What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models

View all activity

Papers

COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs

X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates

View all Papers

updated a dataset about 22 hours ago

AIM-Intelligence/XL-SafetyBench

Viewer • Updated about 22 hours ago • 5.5k • 280 • 1

authored a paper 4 months ago

What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models

Paper • 2601.06165 • Published Jan 7 • 16

submitted a paper to Daily Papers 4 months ago

What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models

Paper • 2601.06165 • Published Jan 7 • 16

authored 3 papers 4 months ago

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

Paper • 2510.24081 • Published Oct 28, 2025 • 22

Distribution-Level Feature Distancing for Machine Unlearning: Towards a Better Trade-off Between Model Utility and Forgetting

Paper • 2409.14747 • Published Sep 23, 2024

COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs

Paper • 2601.01836 • Published Jan 5 • 10

authored 8 papers 5 months ago

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24, 2025 • 77

One-Shot is Enough: Consolidating Multi-Turn Attacks into Efficient Single-Turn Prompts for LLMs

Paper • 2503.04856 • Published Mar 6, 2025 • 2

ELITE: Enhanced Language-Image Toxicity Evaluation for Safety

Paper • 2502.04757 • Published Feb 7, 2025 • 2

When Good Sounds Go Adversarial: Jailbreaking Audio-Language Models with Benign Inputs

Paper • 2508.03365 • Published Aug 5, 2025 • 5

Eliciting and Analyzing Emergent Misalignment in State-of-the-Art Large Language Models

Paper • 2508.04196 • Published Aug 6, 2025 • 1

ObjexMT: Objective Extraction and Metacognitive Calibration for LLM-as-a-Judge under Multi-Turn Jailbreaks

Paper • 2508.16889 • Published Aug 23, 2025 • 2

X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates

Paper • 2509.08729 • Published Sep 10, 2025 • 1

sudo rm -rf agentic_security

Paper • 2503.20279 • Published Mar 26, 2025 • 1

authored a paper 6 months ago

AI PB: A Grounded Generative Agent for Personalized Investment Insights

Paper • 2510.20099 • Published Oct 23, 2025

authored 5 papers 7 months ago

Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap

Paper • 2501.02448 • Published Jan 5, 2025

Multi-Step Reasoning in Korean and the Emergent Mirage

Paper • 2501.05712 • Published Jan 10, 2025

Improving Fine-grained Visual Understanding in VLMs through Text-Only Training

Paper • 2412.12940 • Published Dec 17, 2024

KoMultiText: Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services

Paper • 2310.04313 • Published Oct 6, 2023

HRET: A Self-Evolving LLM Evaluation Toolkit for Korean

Paper • 2503.22968 • Published Mar 29, 2025