agentlans/crash-course
Viewer • Updated • 1.33M • 89 • 1
The model fails to end replies properly when used with some system prompts. If this is a problem, consider using agentlans/Llama3.1-Daredevilish-Instruct in instruct mode.
The model was created using mergekit with the following merge configuration:
models:
- model: DreadPoor/LemonP-8B-Model_Stock
parameters:
density: 0.6
weight: 0.16
- model: Youlln/1PARAMMYL-8B-ModelStock
parameters:
density: 0.6
weight: 0.13
- model: jaspionjader/f-2-8b
parameters:
density: 0.6
weight: 0.10
- model: Etherll/SuperHermes
parameters:
density: 0.6
weight: 0.08
merge_method: dare_ties
base_model: meta-llama/Llama-3.1-8B
dtype: bfloat16
This experimental model is designed for research and development purposes. Users should be aware of potential biases and limitations inherent in language models. Always validate outputs and use the model responsibly.
Further evaluation and fine-tuning may be necessary to optimize performance across various tasks. Researchers are encouraged to build upon this experimental merge to advance the capabilities of Llama-based models.
Detailed results can be found here! Summarized results can be found here!
| Metric | Value (%) |
|---|---|
| Average | 25.54 |
| IFEval (0-Shot) | 62.92 |
| BBH (3-Shot) | 29.20 |
| MATH Lvl 5 (4-Shot) | 12.76 |
| GPQA (0-shot) | 6.82 |
| MuSR (0-shot) | 11.60 |
| MMLU-PRO (5-shot) | 29.96 |