Improve model card: Add metadata and update paper links and usage snippet

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +27 -20
README.md CHANGED
@@ -1,10 +1,16 @@
 
 
 
 
 
 
1
  # Explore–Execute Chain (E2C) Model
2
 
3
  This repository hosts the **pretrained and fine-tuned Explore–Execute Chain (E2C) models**.
4
 
5
- **Paper:** *ExploreExecute Chain: Towards an Efficient Structured Reasoning Paradigm*
6
- *Kaisen Yang, Lixuan He, Rushi Shah, Kaicheng Yang, Qinwei Ma, Dianbo Liu, Alex Lamb*
7
- > Under review at ICLR 2026
8
 
9
  **Code:** [GitHub – Explore–Execute Chain](https://github.com/yks23/Explore-Execute-Chain)
10
 
@@ -14,24 +20,24 @@ This repository hosts the **pretrained and fine-tuned Explore–Execute Chain (E
14
 
15
  E2C is a **two-stage reasoning framework** designed to improve the efficiency and interpretability of large language models (LLMs):
16
 
17
- 1. **Exploration** — Generate lightweight reasoning sketches (plans).
18
- 2. **Execution** — Execute selected plans faithfully for high-quality results.
19
 
20
  **Benefits:**
21
- - Efficient reasoning with minimal computation
22
- - Explicit, interpretable exploration traces
23
- - Easy domain adaptation with minimal supervision
24
 
25
  ---
26
 
27
  ## 🚀 Key Features
28
 
29
- - **Two-stage training**
30
- - **E2C-SFT** — Supervised fine-tuning on exploration–execution pairs
31
- - **E2C-RL** — Reinforcement learning to refine execution
32
- - **Efficient adaptation (EF-SFT)** — Adapt with exploration-only data
33
- - **Test-time scaling** — Aggregate multiple explorations for better results
34
- - Benchmarked on **mathematical** and **medical reasoning** datasets
35
 
36
  ---
37
 
@@ -39,25 +45,26 @@ E2C is a **two-stage reasoning framework** designed to improve the efficiency an
39
 
40
  ```python
41
  from transformers import AutoModelForCausalLM, AutoTokenizer
 
42
 
43
  model_name = "KaisenYang/Explore-Execute-Chain"
44
  model_type = "8B-Final" # change to the subfolder you want to use
45
 
46
  tokenizer = AutoTokenizer.from_pretrained(model_name, subfolder=model_type)
47
- model = AutoModelForCausalLM.from_pretrained(model_name, subfolder=model_type)
48
 
49
  # Test example: Fibonacci sequence
50
- inputs = tokenizer("What is the 10th number in the Fibonacci sequence?", return_tensors="pt")
51
  outputs = model.generate(**inputs)
52
  print(tokenizer.decode(outputs[0]))
53
- ````
54
 
55
  ---
56
 
57
  ## 🔗 Links
58
 
59
- * 📂 **Full code and experiments:** [GitHub Repository](https://github.com/yks23/Explore-Execute-Chain)
60
- * 📜 **Paper (under review):** ICLR 2026 submission
61
 
62
  ---
63
 
@@ -80,4 +87,4 @@ If you use this work, please cite:
80
  ## 🧾 License
81
 
82
  This project is licensed under the **MIT License**.
83
- See the [LICENSE](https://github.com/yks23/Explore-Execute-Chain/blob/main/LICENSE) file for details.
 
1
+ ---
2
+ license: mit
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ ---
6
+
7
  # Explore–Execute Chain (E2C) Model
8
 
9
  This repository hosts the **pretrained and fine-tuned Explore–Execute Chain (E2C) models**.
10
 
11
+ **Paper:** [Explore-Execute Chain: Towards an Efficient Structured Reasoning Paradigm](https://huggingface.co/papers/2509.23946)
12
+ *Kaisen Yang, Lixuan He, Rushi Shah, Kaicheng Yang, Qinwei Ma, Dianbo Liu, Alex Lamb*
13
+ > Under review at ICLR 2026
14
 
15
  **Code:** [GitHub – Explore–Execute Chain](https://github.com/yks23/Explore-Execute-Chain)
16
 
 
20
 
21
  E2C is a **two-stage reasoning framework** designed to improve the efficiency and interpretability of large language models (LLMs):
22
 
23
+ 1. **Exploration** — Generate lightweight reasoning sketches (plans).
24
+ 2. **Execution** — Execute selected plans faithfully for high-quality results.
25
 
26
  **Benefits:**
27
+ - Efficient reasoning with minimal computation
28
+ - Explicit, interpretable exploration traces
29
+ - Easy domain adaptation with minimal supervision
30
 
31
  ---
32
 
33
  ## 🚀 Key Features
34
 
35
+ - **Two-stage training**
36
+ - **E2C-SFT** — Supervised fine-tuning on exploration–execution pairs
37
+ - **E2C-RL** — Reinforcement learning to refine execution
38
+ - **Efficient adaptation (EF-SFT)** — Adapt with exploration-only data
39
+ - **Test-time scaling** — Aggregate multiple explorations for better results
40
+ - Benchmarked on **mathematical** and **medical reasoning** datasets
41
 
42
  ---
43
 
 
45
 
46
  ```python
47
  from transformers import AutoModelForCausalLM, AutoTokenizer
48
+ import torch
49
 
50
  model_name = "KaisenYang/Explore-Execute-Chain"
51
  model_type = "8B-Final" # change to the subfolder you want to use
52
 
53
  tokenizer = AutoTokenizer.from_pretrained(model_name, subfolder=model_type)
54
+ model = AutoModelForCausalLM.from_pretrained(model_name, subfolder=model_type, torch_dtype=torch.bfloat16, device_map="auto")
55
 
56
  # Test example: Fibonacci sequence
57
+ inputs = tokenizer("What is the 10th number in the Fibonacci sequence?", return_tensors="pt").to(model.device)
58
  outputs = model.generate(**inputs)
59
  print(tokenizer.decode(outputs[0]))
60
+ ```
61
 
62
  ---
63
 
64
  ## 🔗 Links
65
 
66
+ * 📂 **Full code and experiments:** [GitHub Repository](https://github.com/yks23/Explore-Execute-Chain)
67
+ * 📜 **Paper (under review):** [ICLR 2026 submission](https://huggingface.co/papers/2509.23946)
68
 
69
  ---
70
 
 
87
  ## 🧾 License
88
 
89
  This project is licensed under the **MIT License**.
90
+ See the [LICENSE](https://github.com/yks23/Explore-Execute-Chain/blob/main/LICENSE) file for details.