Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,32 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
+
datasets:
|
| 4 |
+
- OpenAssistant/oasst1
|
| 5 |
+
language:
|
| 6 |
+
- en
|
| 7 |
---
|
| 8 |
+
|
| 9 |
+
## 🐋 Humback
|
| 10 |
+
|
| 11 |
+
The proposed Humback is a novel framework that can augment the instruction data for supervised fine-tuning with high quality.
|
| 12 |
+
|
| 13 |
+
This is a backward model $M_{yx}$ for [Humback](https://arxiv.org/pdf/2308.06259.pdf) reproduction.
|
| 14 |
+
|
| 15 |
+
This model is trained on the seed data in a reversed order (generate instruction given response).
|
| 16 |
+
|
| 17 |
+
The seed data is a sampled dataset from [oasst1](https://huggingface.co/datasets/OpenAssistant/oasst1).
|
| 18 |
+
|
| 19 |
+
You may find more details and usage examples in [Spico197/Humback](https://github.com/Spico197/Humback) .
|
| 20 |
+
|
| 21 |
+
## 📜 Reference
|
| 22 |
+
|
| 23 |
+
```bibtex
|
| 24 |
+
@misc{li2023selfalignment,
|
| 25 |
+
title={Self-Alignment with Instruction Backtranslation},
|
| 26 |
+
author={Xian Li and Ping Yu and Chunting Zhou and Timo Schick and Luke Zettlemoyer and Omer Levy and Jason Weston and Mike Lewis},
|
| 27 |
+
year={2023},
|
| 28 |
+
eprint={2308.06259},
|
| 29 |
+
archivePrefix={arXiv},
|
| 30 |
+
primaryClass={cs.CL}
|
| 31 |
+
}
|
| 32 |
+
```
|