martin
commited on
Commit
·
e719f93
1
Parent(s):
cf583b0
update urls
Browse files
README.md
CHANGED
|
@@ -12,32 +12,32 @@ This repository contains the Multimodal Large Language Model (LLM) component of
|
|
| 12 |
### Clone audio
|
| 13 |
| role | prompt wav | clone wav |
|
| 14 |
|:-------:|:-------:|:-------:|
|
| 15 |
-
|于谦| <audio controls src="https://
|
| 16 |
-
|李雪琴| <audio controls src="https://
|
| 17 |
|
| 18 |
### Speed control
|
| 19 |
| prompt | response |
|
| 20 |
|:-------:|:-------:|
|
| 21 |
-
|Human: 说一个绕口令<br>Assistant: 吃葡萄不吐葡萄皮,不吃葡萄倒吐葡萄皮<br>Human: 哎,你能把这个绕口令说的再快一点吗?|<audio controls src="https://
|
| 22 |
-
|Human: 说一个绕口令<br>Assistant: 吃葡萄不吐葡萄皮,不吃葡萄倒吐葡萄皮<br>Human: 哎,你能把这个绕口令说的再快一点吗?<br>Assistant: 吃葡萄不吐葡萄皮,不吃葡萄倒吐葡萄皮<br>Human: 呃,你再用非常非常慢的速度说一遍的。|<audio controls src="https://
|
| 23 |
|
| 24 |
### High EQ(emotional control & tone control)
|
| 25 |
| prompt | response |
|
| 26 |
|:-------:|:-------:|
|
| 27 |
-
|Human: 你这语气又不撒娇又不卖萌的,要不你撒个娇卖个萌吧。|<audio controls src="https://
|
| 28 |
-
|Human: 怎么办?我感觉我的人生很失败。|<audio controls src="https://
|
| 29 |
-
|Human: 小跃。你真的是。特别厉害。|<audio controls src="https://
|
| 30 |
|
| 31 |
### Multilingual (e.g., Chinese, English, Japanese)
|
| 32 |
| prompt | response |
|
| 33 |
|:-------:|:-------:|
|
| 34 |
-
|Human: What did the speaker mean when they said, it's raining cats and dogs?<br>Assistant: When they say "It's raining cats and dogs," it just means it's raining really hard. The speaker isn't literally saying cats and dogs are falling from the sky! It's just a fun way to describe heavy rain.|<audio controls src="https://
|
| 35 |
-
|Human: こんにちは。(你好)<br>Assistant
|
| 36 |
|
| 37 |
### Rap & Vocal
|
| 38 |
| prompt | response |
|
| 39 |
|:-------:|:-------:|
|
| 40 |
-
|human:唱一段rap|<audio controls src="https://
|
| 41 |
|
| 42 |
|
| 43 |
## 3. Evaluation
|
|
|
|
| 12 |
### Clone audio
|
| 13 |
| role | prompt wav | clone wav |
|
| 14 |
|:-------:|:-------:|:-------:|
|
| 15 |
+
|于谦| <audio controls src="https://huggingface.co/stepfun-ai/Step-Audio-Examples/resolve/main/prompt_wav_yuqian.wav"></audio>|<audio controls src="https://huggingface.co/stepfun-ai/Step-Audio-Examples/resolve/main/clone_wav_yuqian.wav"></audio>|
|
| 16 |
+
|李雪琴| <audio controls src="https://huggingface.co/stepfun-ai/Step-Audio-Examples/resolve/main/prompt_wav_lixueqin.wav"></audio>|<audio controls src="https://huggingface.co/stepfun-ai/Step-Audio-Examples/resolve/main/clone_wav_lixueqin.wav"></audio>|
|
| 17 |
|
| 18 |
### Speed control
|
| 19 |
| prompt | response |
|
| 20 |
|:-------:|:-------:|
|
| 21 |
+
|Human: 说一个绕口令<br>Assistant: 吃葡萄不吐葡萄皮,不吃葡萄倒吐葡萄皮<br>Human: 哎,你能把这个绕口令说的再快一点吗?|<audio controls src="https://huggingface.co/stepfun-ai/Step-Audio-Examples/resolve/main/speed_control1.wav"></audio>|
|
| 22 |
+
|Human: 说一个绕口令<br>Assistant: 吃葡萄不吐葡萄皮,不吃葡萄倒吐葡萄皮<br>Human: 哎,你能把这个绕口令说的再快一点吗?<br>Assistant: 吃葡萄不吐葡萄皮,不吃葡萄倒吐葡萄皮<br>Human: 呃,你再用非常非常慢的速度说一遍的。|<audio controls src="https://huggingface.co/stepfun-ai/Step-Audio-Examples/resolve/main/speed_control2.wav"></audio>|
|
| 23 |
|
| 24 |
### High EQ(emotional control & tone control)
|
| 25 |
| prompt | response |
|
| 26 |
|:-------:|:-------:|
|
| 27 |
+
|Human: 你这语气又不撒娇又不卖萌的,要不你撒个娇卖个萌吧。|<audio controls src="https://huggingface.co/stepfun-ai/Step-Audio-Examples/resolve/main/tone_control.wav"></audio>|
|
| 28 |
+
|Human: 怎么办?我感觉我的人生很失败。|<audio controls src="https://huggingface.co/stepfun-ai/Step-Audio-Examples/resolve/main/emotional_control1.wav"></audio>|
|
| 29 |
+
|Human: 小跃。你真的是。特别厉害。|<audio controls src="https://huggingface.co/stepfun-ai/Step-Audio-Examples/resolve/main/emotional_control2.wav"></audio>|
|
| 30 |
|
| 31 |
### Multilingual (e.g., Chinese, English, Japanese)
|
| 32 |
| prompt | response |
|
| 33 |
|:-------:|:-------:|
|
| 34 |
+
|Human: What did the speaker mean when they said, it's raining cats and dogs?<br>Assistant: When they say "It's raining cats and dogs," it just means it's raining really hard. The speaker isn't literally saying cats and dogs are falling from the sky! It's just a fun way to describe heavy rain.|<audio controls src="https://huggingface.co/stepfun-ai/Step-Audio-Examples/resolve/main/multilingual2.wav"></audio>|
|
| 35 |
+
|Human: こんにちは。(你好)<br>Assistant:こんにちは!何か手伝いましょうか?(您好!我可以帮你做点什么吗���)|<audio controls src="https://huggingface.co/stepfun-ai/Step-Audio-Examples/resolve/main/multilingual1.wav"></audio>|
|
| 36 |
|
| 37 |
### Rap & Vocal
|
| 38 |
| prompt | response |
|
| 39 |
|:-------:|:-------:|
|
| 40 |
+
|human:唱一段rap|<audio controls src="https://huggingface.co/stepfun-ai/Step-Audio-Examples/resolve/main/rap.wav"></audio>|
|
| 41 |
|
| 42 |
|
| 43 |
## 3. Evaluation
|