Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
87
89
120
Kashif Rasul
kashif
Follow
dolphinlee's profile picture
sahsaeedi's profile picture
SaylorTwift's profile picture
393 followers
·
92 following
krasul
kashif
AI & ML interests
Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning
Recent Activity
liked
a Space
1 day ago
multimodalart/LLaDA-2-1
new
activity
2 days ago
kuleshov-group/bd3lm-owt-block_size16:
Add post_init() and register_buffer(persistent=False) for transformers v5
new
activity
2 days ago
kuleshov-group/bd3lm-owt-block_size8:
Add post_init() and register_buffer(persistent=False) for transformers v5
View all activity
Organizations
kashif
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
kuleshov-group/bd3lm-owt-block_size16
2 days ago
Add post_init() and register_buffer(persistent=False) for transformers v5
#3 opened 2 days ago by
kashif
New activity in
kuleshov-group/bd3lm-owt-block_size8
2 days ago
Add post_init() and register_buffer(persistent=False) for transformers v5
#2 opened 2 days ago by
kashif
commented
a paper
3 days ago
LLaDA2.1: Speeding Up Text Diffusion via Token Editing
Paper
•
2602.08676
•
Published
Feb 9
•
70
•
5
New activity in
kuleshov-group/bd3lm-owt-block_size4
3 days ago
Add post_init() call for transformers v5 compatibility
#2 opened 3 days ago by
kashif
New activity in
inclusionAI/LLaDA2.1-mini
9 days ago
fixes for transforemrs v5
1
#4 opened 13 days ago by
kashif
New activity in
inclusionAI/LLaDA2.0-mini-CAP
13 days ago
fix: align RotaryEmbedding and _init_weights with Qwen2Moe for transformers compat
#2 opened 13 days ago by
kashif
New activity in
inclusionAI/LLaDA2.1-flash
13 days ago
fix: align RotaryEmbedding with Qwen2Moe pattern for transformers compat
#4 opened 13 days ago by
kashif
New activity in
nyu-visionx/RAE-mae-base-p16-ViTXL-n08
27 days ago
Update weights: include latent normalization buffers for diffusers compatibility
#3 opened 27 days ago by
kashif
New activity in
nyu-visionx/RAE-siglip2-base-p16-i256-ViTXL-n08
27 days ago
Update weights: include latent normalization buffers for diffusers compatibility
#4 opened 27 days ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-large-ViTXL-n08
27 days ago
Update weights: include latent normalization buffers for diffusers compatibility
#4 opened 27 days ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-small-ViTXL-n08
27 days ago
Update weights: include latent normalization buffers for diffusers compatibility
#3 opened 27 days ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-base-ViTXL-n08-i512
27 days ago
Update weights: include latent normalization buffers for diffusers compatibility
#3 opened 27 days ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-base-ViTXL-n08
27 days ago
Update weights: include latent normalization buffers for diffusers compatibility
#4 opened 27 days ago by
kashif
New activity in
google/timesfm-2.5-200m-transformers
30 days ago
updated config and weights
#3 opened 30 days ago by
kashif
Upload 2 files
#2 opened about 1 month ago by
kashif
New activity in
nyu-visionx/RAE-siglip2-base-p16-i256-ViTXL-n08
about 1 month ago
Remap encoder keys to match SiglipVisionModel key layout
#3 opened about 1 month ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-large-ViTXL-n08
about 1 month ago
Add encoder_num_hidden_layers=24 to config
1
#3 opened about 1 month ago by
kashif
New activity in
nyu-visionx/RAE-mae-base-p16-ViTXL-n08
about 1 month ago
Update config for diffusers AutoencoderRAE refactor
#2 opened about 1 month ago by
kashif
New activity in
nyu-visionx/RAE-siglip2-base-p16-i256-ViTXL-n08
about 1 month ago
Update config for diffusers AutoencoderRAE refactor
#2 opened about 1 month ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-large-ViTXL-n08
about 1 month ago
Update config for diffusers AutoencoderRAE refactor
#2 opened about 1 month ago by
kashif
Load more