File size: 2,283 Bytes
3643128
9e74b5e
 
 
 
3643128
 
 
9e74b5e
 
3643128
 
9e74b5e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
---
title: MAKER Agent
emoji: πŸ”§
colorFrom: indigo
colorTo: purple
sdk: gradio
app_file: app.py
pinned: false
license: mit
short_description: Reliable AI Agent with Voting
---

# πŸ”§ MAKER Agent

**Reliable AI with Voting & Red-Flagging**

Based on the paper: [Solving a Million-Step LLM Task with Zero Errors](https://arxiv.org/abs/2511.09030)

## What is MAKER?

MAKER (Massively Decomposed Agentic Processes) is a framework that achieves **near-zero errors** over long task sequences by:

| Pillar | Description |
|--------|-------------|
| **Maximal Decomposition** | Break tasks into single-step atomic operations |
| **K-Voting** | Sample multiple times, winner needs K votes ahead |
| **Red-Flagging** | Discard suspicious outputs (don't repair them) |

## Key Insight

> *"Reliability is an engineering problem, not a model problem."*

Instead of waiting for better models, you can achieve near-zero errors TODAY using smaller, cheaper models with statistical voting.

The paper achieved **1,000,000 steps with ZERO errors** using gpt-4.1-mini!

## Features

- πŸ” **Web Search**: Optional DuckDuckGo search (free, no API key)
- πŸ“ **File Upload**: PDF, DOCX, TXT, images, and more
- πŸ—³οΈ **Voting**: Configurable K parameter for reliability vs speed
- 🚩 **Red-Flagging**: Automatic detection and discarding of suspicious outputs
- πŸ”Œ **Multiple LLMs**: OpenAI, Anthropic, Groq, Together, OpenRouter

## Usage

1. **Setup**: Enter your API key and configure K (votes needed)
2. **Query**: Ask questions with optional web search or file context
3. **Results**: See the winning answer with vote distribution

## Supported LLM Providers

| Provider | Models |
|----------|--------|
| OpenAI | gpt-4o-mini, gpt-4o, etc. |
| Anthropic | claude-sonnet, claude-opus |
| Groq | llama-3.3-70b (fast!) |
| Together | Llama, Mistral, etc. |
| OpenRouter | 100+ models |

## Local Development

```bash
git clone https://huggingface.co/spaces/YOUR_USERNAME/maker-agent
cd maker-agent
pip install -r requirements.txt
python app.py
```

## Paper Citation

```bibtex
@article{meyerson2025maker,
  title={Solving a Million-Step LLM Task with Zero Errors},
  author={Meyerson, Elliot and others},
  journal={arXiv preprint arXiv:2511.09030},
  year={2025}
}
```

## License

MIT