Machine981 commited on
Commit
cb87745
·
verified ·
1 Parent(s): 12b666e

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +60 -0
README.md ADDED
@@ -0,0 +1,60 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: apache-2.0
5
+ tags:
6
+ - text-generation
7
+ - nlp
8
+ datasets:
9
+ - DeepMath103K
10
+ metrics:
11
+ - avg@1 / pass@k
12
+ base_model:
13
+ - Deepseek-R1-Distill-Qwen-1.5B
14
+ pipeline_tag: text-generation
15
+ library_name: transformers
16
+ arxiv: 2604.10688
17
+ ---
18
+
19
+ # Model Name
20
+
21
+ SCOPE-Deepseek-R1-Distill-Qwen-1.5B
22
+
23
+ This model is introduced in the paper
24
+ [SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting](https://arxiv.org/abs/2604.10688)
25
+ and is developed by the **Longcat Interaction Team**.
26
+
27
+ ## Model Details
28
+
29
+ ### Model Description
30
+
31
+ - **Developed by:** Longcat Interaction Team
32
+ - **Language(s) (NLP):** English
33
+ - **License:** Apache 2.0
34
+ - **Finetuned from model:** Deepseek-R1-Distill-Qwen-1.5B
35
+ - **Paper:** [arxiv.org/abs/2604.10688](https://arxiv.org/abs/2604.10688)
36
+
37
+ ### Model Sources
38
+
39
+ - **Repository:** [https://github.com/machine981/SCOPE](https://github.com/machine981/SCOPE)
40
+ - **Paper:** [https://arxiv.org/abs/2604.10688](https://arxiv.org/abs/2604.10688)
41
+
42
+ ## Uses
43
+
44
+ ### Direct Use
45
+
46
+ This model can be used directly for text generation (like MATH reasoning) without any additional fine-tuning.
47
+
48
+ ## How to Get Started with the Model
49
+
50
+ Use the code below to get started with the model:
51
+
52
+ ```python
53
+ from transformers import AutoTokenizer, AutoModelForCausalLM # adjust as needed
54
+
55
+ tokenizer = AutoTokenizer.from_pretrained("Machine981/SCOPE-Deepseek-R1-Distill-Qwen-1.5B")
56
+ model = AutoModelForCausalLM.from_pretrained("Machine981/SCOPE-Deepseek-R1-Distill-Qwen-1.5B")
57
+
58
+ inputs = tokenizer("Your input text here", return_tensors="pt")
59
+ outputs = model.generate(**inputs)
60
+ print(tokenizer.decode(outputs[0], skip_special_tokens=True))