Miaow Lab @ CityUHK

university

https://ningmiao.space

AI & ML interests

LLM reasoning

Recent Activity

TorresYang authored a paper 8 days ago

Step-Level Sparse Autoencoder for Reasoning Process Interpretation

TorresYang updated a collection 8 days ago

TorresYang updated a model 8 days ago

Miaow-Lab/SSAE-Checkpoints

View all activity

authored a paper 8 days ago

Step-Level Sparse Autoencoder for Reasoning Process Interpretation

Paper • 2603.03031 • Published 9 days ago

updated a collection 8 days ago

SSAE

Training and evaluation dataset, model checkpoints in 'Step-Level Sparse Autoencoder for Reasoning Process Interpretation' • 3 items • Updated 8 days ago • 1

updated a model 8 days ago

Miaow-Lab/SSAE-Checkpoints

Feature Extraction • Updated 8 days ago

updated a dataset 8 days ago

Miaow-Lab/SSAE-Dataset

Viewer • Updated 8 days ago • 1.28M • 43

updated a collection 14 days ago

SSAE

Training and evaluation dataset, model checkpoints in 'Step-Level Sparse Autoencoder for Reasoning Process Interpretation' • 3 items • Updated 8 days ago • 1

published a model 15 days ago

Miaow-Lab/SSAE-Checkpoints

Feature Extraction • Updated 8 days ago

published a dataset 15 days ago

Miaow-Lab/SSAE-Dataset

Viewer • Updated 8 days ago • 1.28M • 43

updated a dataset about 1 month ago

Miaow-Lab/RLVR-Linearity-Dataset

Viewer • Updated Feb 1 • 40.3k • 26

updated a model about 1 month ago

Miaow-Lab/RLVR-Linearity-Checkpoints

Text Generation • Updated Jan 29

updated a collection about 1 month ago

RLVR Linearity

RL training and evaluation datasets, and checkpoints in 'Not All Steps are Informative: On the Linearity of LLMs’ RLVR Training' • 3 items • Updated Jan 26

updated a collection about 2 months ago

RLVR Linearity

RL training and evaluation datasets, and checkpoints in 'Not All Steps are Informative: On the Linearity of LLMs’ RLVR Training' • 3 items • Updated Jan 26

published a model about 2 months ago

Miaow-Lab/RLVR-Linearity-Checkpoints

Text Generation • Updated Jan 29

updated a collection about 2 months ago

RLVR Linearity

RL training and evaluation datasets, and checkpoints in 'Not All Steps are Informative: On the Linearity of LLMs’ RLVR Training' • 3 items • Updated Jan 26

published a dataset about 2 months ago

Miaow-Lab/RLVR-Linearity-Dataset

Viewer • Updated Feb 1 • 40.3k • 26

authored a paper over 1 year ago

HAF-RM: A Hybrid Alignment Framework for Reward Model Training

Paper • 2407.04185 • Published Jul 4, 2024

authored 2 papers almost 2 years ago

ARKS: Active Retrieval in Knowledge Soup for Code Generation

Paper • 2402.12317 • Published Feb 19, 2024

ALaRM: Align Language Models via Hierarchical Rewards Modeling

Paper • 2403.06754 • Published Mar 11, 2024

authored a paper over 2 years ago

DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation

Paper • 2211.11501 • Published Nov 18, 2022