AI & ML interests

A one-year long research workshop on large language models: the Summer of Language Models 21 🌸

Recent Activity

christopher 
in bigscience/bloom 13 days ago

[SPAM] Deleted

3
#289 opened 13 days ago by
sarthak-saxena
stas 
posted an update 15 days ago
view post
Post
183
Good news! Ulysses Sequence Parallelism from the Snowflake AI Research and the Deepspeed teams has been integrated into
HuggingFace Trainer, Accelerate and TRL

For extensive details please see this writeup:
https://huggingface.co/blog/ulysses-sp

Thanks a lot to Kashif Rasul for helping make it happen. Also the others in the HF team who helped with integration.
christopher 
in bigscience/bloom 20 days ago

pretokenizer Regex issues?

8
#278 opened over 1 year ago by
hpcpony
christopher 
in bigscience/bloom 25 days ago

Test PR

#286 opened 25 days ago by
FIRSTACCOUNT69

Test discussion

#287 opened 25 days ago by
FIRSTACCOUNT69

Test discussion

#288 opened 25 days ago by
FIRSTACCOUNT69

Bloom

#2 opened 4 months ago by
Raz-Test