Multimodal Markup Document Models (MarkupDM)

arxiv paper

teaser

This repository provides the pre-trained MarkupDM model for graphic design completion. The model can automatically complete partially designed graphics by generating appropriate text/visual content, positioning, and styling, as demonstrated in our paper Multimodal Markup Document Models for Graphic Design Completion.

Usage

For detailed usage instructions, please refer to the MarkupDM GitHub repository.

License

This repository is released under the Apache-2.0 license.

Citation

@inproceedings{Kikuchi2025,
  title     = {Multimodal Markup Document Models for Graphic Design Completion},
  author    = {Kotaro Kikuchi and Ukyo Honda and Naoto Inoue and Mayu Otani and Edgar Simo-Serra and Kota Yamaguchi},
  booktitle = {ACM International Conference on Multimedia},
  year      = {2025},
  doi       = {10.1145/3746027.3755420}
}
Downloads last month
83
Safetensors
Model size
7B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for cyberagent/markupdm

Finetuned
(4)
this model

Dataset used to train cyberagent/markupdm

Paper for cyberagent/markupdm