DeepSeek-OCR-DEMO

Running on Zero

Reduce VRAM consumption by swapping `cuda()` and `to(torch.bfloat16)`

by mingyi456 - opened Nov 12

←

Files changed (1) hide show

app.py CHANGED Viewed

@@ -43,7 +43,7 @@ def process_ocr_task(image, model_size, task_type, ref_text):
         return "Please upload an image first.", None
     print("🚀 Moving model to GPU...")
-    model_gpu = model.cuda().to(torch.bfloat16)
     print("✅ Model is on GPU.")
     with tempfile.TemporaryDirectory() as output_path:

         return "Please upload an image first.", None
     print("🚀 Moving model to GPU...")
+    model_gpu = model.to(torch.bfloat16).cuda()
     print("✅ Model is on GPU.")
     with tempfile.TemporaryDirectory() as output_path: