1 4

Jiongxiao Wang

Jayfeather1024

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Reinforcement Learning for Self-Improving Agent with Skill Library

submitted a paper 1 day ago

Reinforcement Learning for Self-Improving Agent with Skill Library

upvoted a paper 8 days ago

MMGR: Multi-Modal Generative Reasoning

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Reinforcement Learning for Self-Improving Agent with Skill Library

Paper • 2512.17102 • Published 7 days ago • 16

submitted a paper to Daily Papers 1 day ago

Reinforcement Learning for Self-Improving Agent with Skill Library

Paper • 2512.17102 • Published 7 days ago • 16

upvoted a paper 8 days ago

MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published 9 days ago • 114

updated a model 10 months ago

Jayfeather1024/DeepSeek-R1-Distill-Qwen-32B-target

Text Generation • 33B • Updated Feb 28 • 9

published a model 10 months ago

Jayfeather1024/DeepSeek-R1-Distill-Qwen-32B-target

Text Generation • 33B • Updated Feb 28 • 9

updated a dataset over 1 year ago

Jayfeather1024/PKU-SafeRLHF-30K-Embedding-Reward

Viewer • Updated May 30, 2024 • 26.9k • 29

authored a paper over 1 year ago

ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain Feedback

Paper • 2305.18090 • Published May 29, 2023

updated 2 models almost 2 years ago

Jayfeather1024/rm_30k

Updated Mar 21, 2024 • 8

Jayfeather1024/sft

Text Generation • Updated Mar 21, 2024 • 7

upvoted a paper almost 2 years ago

On the Exploitability of Instruction Tuning

Paper • 2306.17194 • Published Jun 28, 2023 • 9

updated a model almost 2 years ago

Jayfeather1024/alpaca_struq

Text Generation • 7B • Updated Mar 8, 2024 • 12 • 1

updated 2 datasets almost 2 years ago

Jayfeather1024/Reward-Embeddings

Preview • Updated Jan 4, 2024 • 44

Jayfeather1024/Reward-Embeddings-30k

Preview • Updated Jan 4, 2024 • 55

upvoted a paper about 2 years ago

LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B

Paper • 2310.20624 • Published Oct 31, 2023 • 13

authored a paper over 2 years ago

On the Exploitability of Instruction Tuning

Paper • 2306.17194 • Published Jun 28, 2023 • 9

Jiongxiao Wang

AI & ML interests

Recent Activity

Organizations

Jayfeather1024's activity