Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Dhruvil Satasiya's picture
5 2

Dhruvil Satasiya

dikro
upgraedd's profile picture 21world's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation
updated a Space 7 days ago
dikro/Genre-Classification
published a Space 7 days ago
dikro/Genre-Classification
View all activity

Organizations

Cohere Labs Community's profile picture

upvoted a paper 4 days ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published 8 days ago • 74
upvoted 3 papers over 1 year ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 94

Maya: An Instruction Finetuned Multilingual Multimodal Model

Paper • 2412.07112 • Published Dec 10, 2024 • 28

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published Dec 9, 2024 • 72
upvoted a collection over 1 year ago

SpeechT5

Collection
The SpeechT5 framework consists of a shared seq2seq and six modal-specific (speech/text) pre/post-nets that can address a few audio-related tasks. • 8 items • Updated May 1, 2025 • 28
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs