| | ---
|
| | title: FARA - Computer Use Agent
|
| | emoji: π€
|
| | colorFrom: blue
|
| | colorTo: purple
|
| | sdk: docker
|
| | pinned: false
|
| | license: mit
|
| | app_port: 7860
|
| | suggested_hardware: cpu-upgrade
|
| | tags:
|
| | - computer-use
|
| | - browser-automation
|
| | - ai-agent
|
| | - vision-language-model
|
| | ---
|
| |
|
| | # π€ FARA - Computer Use Agent Demo
|
| |
|
| | FARA (Fara Agent for Real-world Automation) is an AI agent that can browse the web and complete tasks autonomously.
|
| |
|
| | ## Features
|
| |
|
| | - π **Autonomous Web Navigation** - The agent can browse websites on its own
|
| | - π **Web Search** - Search for information across the web
|
| | - π **Form Filling** - Fill out forms automatically
|
| | - π±οΈ **Point and Click** - Click buttons, links, and elements
|
| | - β¨οΈ **Text Input** - Type text into fields
|
| | - π **Page Scrolling** - Scroll through content
|
| |
|
| | ## How to Use
|
| |
|
| | 1. Enter a task in natural language (e.g., "Search for the latest news about AI")
|
| | 2. Click "Run Task" and watch the agent work!
|
| | 3. View the screenshots to see each step the agent takes
|
| |
|
| | ## Powered By
|
| |
|
| | - **Microsoft Fara-7B** - Vision-Language Model for computer use
|
| | - **Playwright** - Browser automation framework
|
| | - **Modal** - Model hosting and inference
|
| |
|
| | ## Links
|
| |
|
| | - [GitHub Repository](https://github.com/microsoft/fara)
|
| |
|
| | ## License
|
| |
|
| | MIT License
|
| |
|