Improve model card: Add metadata and update paper links and usage snippet
#1
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,10 +1,16 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
# Explore–Execute Chain (E2C) Model
|
| 2 |
|
| 3 |
This repository hosts the **pretrained and fine-tuned Explore–Execute Chain (E2C) models**.
|
| 4 |
|
| 5 |
-
**Paper:**
|
| 6 |
-
*Kaisen Yang, Lixuan He, Rushi Shah, Kaicheng Yang, Qinwei Ma, Dianbo Liu, Alex Lamb*
|
| 7 |
-
> Under review at ICLR 2026
|
| 8 |
|
| 9 |
**Code:** [GitHub – Explore–Execute Chain](https://github.com/yks23/Explore-Execute-Chain)
|
| 10 |
|
|
@@ -14,24 +20,24 @@ This repository hosts the **pretrained and fine-tuned Explore–Execute Chain (E
|
|
| 14 |
|
| 15 |
E2C is a **two-stage reasoning framework** designed to improve the efficiency and interpretability of large language models (LLMs):
|
| 16 |
|
| 17 |
-
1.
|
| 18 |
-
2.
|
| 19 |
|
| 20 |
**Benefits:**
|
| 21 |
-
- Efficient reasoning with minimal computation
|
| 22 |
-
- Explicit, interpretable exploration traces
|
| 23 |
-
- Easy domain adaptation with minimal supervision
|
| 24 |
|
| 25 |
---
|
| 26 |
|
| 27 |
## 🚀 Key Features
|
| 28 |
|
| 29 |
-
-
|
| 30 |
-
|
| 31 |
-
|
| 32 |
-
-
|
| 33 |
-
-
|
| 34 |
-
-
|
| 35 |
|
| 36 |
---
|
| 37 |
|
|
@@ -39,25 +45,26 @@ E2C is a **two-stage reasoning framework** designed to improve the efficiency an
|
|
| 39 |
|
| 40 |
```python
|
| 41 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|
|
|
|
| 42 |
|
| 43 |
model_name = "KaisenYang/Explore-Execute-Chain"
|
| 44 |
model_type = "8B-Final" # change to the subfolder you want to use
|
| 45 |
|
| 46 |
tokenizer = AutoTokenizer.from_pretrained(model_name, subfolder=model_type)
|
| 47 |
-
model = AutoModelForCausalLM.from_pretrained(model_name, subfolder=model_type)
|
| 48 |
|
| 49 |
# Test example: Fibonacci sequence
|
| 50 |
-
inputs = tokenizer("What is the 10th number in the Fibonacci sequence?", return_tensors="pt")
|
| 51 |
outputs = model.generate(**inputs)
|
| 52 |
print(tokenizer.decode(outputs[0]))
|
| 53 |
-
```
|
| 54 |
|
| 55 |
---
|
| 56 |
|
| 57 |
## 🔗 Links
|
| 58 |
|
| 59 |
-
*
|
| 60 |
-
*
|
| 61 |
|
| 62 |
---
|
| 63 |
|
|
@@ -80,4 +87,4 @@ If you use this work, please cite:
|
|
| 80 |
## 🧾 License
|
| 81 |
|
| 82 |
This project is licensed under the **MIT License**.
|
| 83 |
-
See the [LICENSE](https://github.com/yks23/Explore-Execute-Chain/blob/main/LICENSE) file for details.
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
library_name: transformers
|
| 4 |
+
pipeline_tag: text-generation
|
| 5 |
+
---
|
| 6 |
+
|
| 7 |
# Explore–Execute Chain (E2C) Model
|
| 8 |
|
| 9 |
This repository hosts the **pretrained and fine-tuned Explore–Execute Chain (E2C) models**.
|
| 10 |
|
| 11 |
+
**Paper:** [Explore-Execute Chain: Towards an Efficient Structured Reasoning Paradigm](https://huggingface.co/papers/2509.23946)
|
| 12 |
+
*Kaisen Yang, Lixuan He, Rushi Shah, Kaicheng Yang, Qinwei Ma, Dianbo Liu, Alex Lamb*
|
| 13 |
+
> Under review at ICLR 2026
|
| 14 |
|
| 15 |
**Code:** [GitHub – Explore–Execute Chain](https://github.com/yks23/Explore-Execute-Chain)
|
| 16 |
|
|
|
|
| 20 |
|
| 21 |
E2C is a **two-stage reasoning framework** designed to improve the efficiency and interpretability of large language models (LLMs):
|
| 22 |
|
| 23 |
+
1. **Exploration** — Generate lightweight reasoning sketches (plans).
|
| 24 |
+
2. **Execution** — Execute selected plans faithfully for high-quality results.
|
| 25 |
|
| 26 |
**Benefits:**
|
| 27 |
+
- Efficient reasoning with minimal computation
|
| 28 |
+
- Explicit, interpretable exploration traces
|
| 29 |
+
- Easy domain adaptation with minimal supervision
|
| 30 |
|
| 31 |
---
|
| 32 |
|
| 33 |
## 🚀 Key Features
|
| 34 |
|
| 35 |
+
- **Two-stage training**
|
| 36 |
+
- **E2C-SFT** — Supervised fine-tuning on exploration–execution pairs
|
| 37 |
+
- **E2C-RL** — Reinforcement learning to refine execution
|
| 38 |
+
- **Efficient adaptation (EF-SFT)** — Adapt with exploration-only data
|
| 39 |
+
- **Test-time scaling** — Aggregate multiple explorations for better results
|
| 40 |
+
- Benchmarked on **mathematical** and **medical reasoning** datasets
|
| 41 |
|
| 42 |
---
|
| 43 |
|
|
|
|
| 45 |
|
| 46 |
```python
|
| 47 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|
| 48 |
+
import torch
|
| 49 |
|
| 50 |
model_name = "KaisenYang/Explore-Execute-Chain"
|
| 51 |
model_type = "8B-Final" # change to the subfolder you want to use
|
| 52 |
|
| 53 |
tokenizer = AutoTokenizer.from_pretrained(model_name, subfolder=model_type)
|
| 54 |
+
model = AutoModelForCausalLM.from_pretrained(model_name, subfolder=model_type, torch_dtype=torch.bfloat16, device_map="auto")
|
| 55 |
|
| 56 |
# Test example: Fibonacci sequence
|
| 57 |
+
inputs = tokenizer("What is the 10th number in the Fibonacci sequence?", return_tensors="pt").to(model.device)
|
| 58 |
outputs = model.generate(**inputs)
|
| 59 |
print(tokenizer.decode(outputs[0]))
|
| 60 |
+
```
|
| 61 |
|
| 62 |
---
|
| 63 |
|
| 64 |
## 🔗 Links
|
| 65 |
|
| 66 |
+
* 📂 **Full code and experiments:** [GitHub Repository](https://github.com/yks23/Explore-Execute-Chain)
|
| 67 |
+
* 📜 **Paper (under review):** [ICLR 2026 submission](https://huggingface.co/papers/2509.23946)
|
| 68 |
|
| 69 |
---
|
| 70 |
|
|
|
|
| 87 |
## 🧾 License
|
| 88 |
|
| 89 |
This project is licensed under the **MIT License**.
|
| 90 |
+
See the [LICENSE](https://github.com/yks23/Explore-Execute-Chain/blob/main/LICENSE) file for details.
|