Rongjiehuang
/

FastDiff

diffusion probabilistic model

Model card Files Files and versions

FastDiff Model Card

Model Details

Model type: Diffusion-based text-to-speech generation model
Language(s): English
Model Description: A conditional diffusion probabilistic model capable of generating high fidelity speech efficiently.
Resources for more information: FastDiff GitHub Repository, FastDiff Paper.

Cite as:

@inproceedings{huang2022fastdiff,
   title={FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis},
   author={Huang, Rongjie and Lam, Max WY and Wang, Jun and Su, Dan and Yu, Dong and Ren, Yi and Zhao, Zhou},
   booktitle = {Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, {IJCAI-22}},
   year={2022}

This model card was written based on the DALL-E Mini model card.

Downloads last month: -; Downloads are not tracked for this model. How to track

Paper for Rongjiehuang/FastDiff

FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis

Paper • 2204.09934 • Published Apr 21, 2022