No examples in readme

by jadbox - opened Oct 24, 2024

Discussion

jadbox

Oct 24, 2024

Just an example or two would be ideal as I'm not sure if the model is rendering json and in what kind of schema.

pipilok

Oct 25, 2024

https://github.com/microsoft/OmniParser
https://microsoft.github.io/OmniParser/

JacobAsmuth

Oct 25, 2024

There's no thumbs down emoji, so here's my comment: @pipilok - :thumbsdown:

nmstoker

Oct 26, 2024

Thanks @pipilok

iiBLACKii

Oct 28, 2024

https://colab.research.google.com/drive/1ZU-cJtsaRHJZ4pi2NexQuajsUC6D6If7#scrollTo=wIx_8HzgvTAZ

ryytro

Nov 1, 2024

This comment has been hidden

franperic

Nov 1, 2024

@iiBLACKii - thank you for your notebook!
could you please elaborate on the blip2 model import?

Where does the icon_caption_blip2 model come from? Here on the huggingface repo are two bin files pytorch_model-00001-of-00002.bin and pytorch_model-00002-of-00002.bin.

In your notebook you are using this line:
caption_model_processor = get_caption_model_processor(model_name="blip2", model_name_or_path="/content/drive/MyDrive/OmniParser/icon_caption_blip2", device="cuda")
On my end that loads the salesforce blip2 model & looks locally for the icon_caption_blip2 model - thats where i get the error:
OSError: Incorrect path_or_model_id: '/Omniparser/icon_caption_blip2'. Please provide either the path to a local folder or the repo_id of a model on the Hub.

adamlu1

Nov 1, 2024

Hi @franperic , it seems the folder structure is incorrect. you can refer to the documentation: https://github.com/microsoft/OmniParser?tab=readme-ov-file#install, and see if it helps. Thanks.

bard1

Nov 3, 2024

This comment has been hidden

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment