Instructions to use Salesforce/blip-image-captioning-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Salesforce/blip-image-captioning-base with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "image-to-text" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("image-to-text", model="Salesforce/blip-image-captioning-base")# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("Salesforce/blip-image-captioning-base") model = AutoModelForImageTextToText.from_pretrained("Salesforce/blip-image-captioning-base") - Notebooks
- Google Colab
- Kaggle
Isit Possible to Upload Multiple Images at once and generate a all the list for example like 10 images
#25
by Kinglyz - opened
Cause Like I have breakdown a video by different frames and i would like to put all those images in
Kinglyz changed discussion status to closed
Kinglyz changed discussion status to open
Kinglyz changed discussion status to closed
Kinglyz changed discussion status to open
You can probably create a python script that slice theses frames and process them inside a for loop