---
license: apache-2.0
pipeline_tag: text-to-image
tags:
- text-to-image
- image-generation
- yandex
---
Alice AI ART dev
---
by Yandex

![teaser_figure.JPG](teaser_figure.JPG)

Alice AI ART dev is 4.8B parameter diffusion UNet model capable of generating images from text prompts.

Key features
---

* **Relevance** A considerable amount of work was done to improve text-to-image alignment. According to the Side-by-Side evaluation, our model is competitive with Qwen-Image, despite being significantly smaller (4.8B parameters vs 20B parameters).
* **Aesthetics** Our model is capable of generating high-quality images with a wide range of styles and themes.
* **Accessibility** Alice AI ART dev is runnable on consumer-grade[^1] GPUs (for instance, NVIDIA RTX 3090) making it accessible to a wider audience.

[^1] with weight offloading

Usage
---
The image generation pipeline can be loaded a follows
```python
pipe = YandexArtOSPipeline.from_pretrained(
    "yandex_art_os",
    cpu_offload=True
)
```
For memory-constrained GPUs we recommend to turn on `cpu_offload` flag:

By default we use following sampling parameters:
```python
{
    "num_inference_steps": 32,
    "cond_scale": 2.75,
    "unet_switch_timestep": 8,
    "karras_rho": 6.0,
    "method_name": "dpm-multistep",
    "sampler_kwargs": {
        "num_train_timesteps": 1000,
        "beta_start": 0.00001013,
        "beta_end": 0.019771934,
        "use_karras_sigmas": True,
        "algorithm_type": "sde-dpmsolver++"
    }
}
```