hal-utokyo/Manga109-s
Updated • 122 • 31
How to use bluolightning/PaddleOCRv5-Server-Det-For-Manga with PaddleOCR:
# 1. See https://www.paddlepaddle.org.cn/en/install to install paddlepaddle
# 2. pip install paddleocr
from paddleocr import TextDetection
model = TextDetection(model_name="PaddleOCRv5-Server-Det-For-Manga")
output = model.predict(input="path/to/image.png", batch_size=1)
for res in output:
res.print()
res.save_to_img(save_path="./output/")
res.save_to_json(save_path="./output/res.json")This is a fine-tuned version of the PaddleOCR v5 Server Detection Model. It has been trained on a dataset of manga speech bubble crops to improve detection for:
This model outputs bounding boxes (polygons) for text regions. It does not perform text recognition; you will need a separate recognition model for that.
Note that this model is still being worked on, and may improve with a better dataset or hyperparameters.
The dataset consisted largely of synthetic data due to the limited real samples available.
This project was done with the usage of:
Base model
PaddlePaddle/PP-OCRv5_server_det