--- license: apache-2.0 pipeline_tag: text-to-image tags: - text-to-image - image-generation - yandex --- Alice AI ART dev --- by Yandex ![teaser_figure.JPG](teaser_figure.JPG) Alice AI ART dev is 4.8B parameter diffusion UNet model capable of generating images from text prompts. Key features --- * **Relevance** A considerable amount of work was done to improve text-to-image alignment. According to the Side-by-Side evaluation, our model is competitive with Qwen-Image, despite being significantly smaller (4.8B parameters vs 20B parameters). * **Aesthetics** Our model is capable of generating high-quality images with a wide range of styles and themes. * **Accessibility** Alice AI ART dev is runnable on consumer-grade[^1] GPUs (for instance, NVIDIA RTX 3090) making it accessible to a wider audience. [^1] with weight offloading Usage --- The image generation pipeline can be loaded a follows ```python pipe = YandexArtOSPipeline.from_pretrained( "yandex_art_os", cpu_offload=True ) ``` For memory-constrained GPUs we recommend to turn on `cpu_offload` flag: By default we use following sampling parameters: ```python { "num_inference_steps": 32, "cond_scale": 2.75, "unet_switch_timestep": 8, "karras_rho": 6.0, "method_name": "dpm-multistep", "sampler_kwargs": { "num_train_timesteps": 1000, "beta_start": 0.00001013, "beta_end": 0.019771934, "use_karras_sigmas": True, "algorithm_type": "sde-dpmsolver++" } } ```