Spaces:

natasa365
/

whisper.cpp

Sleeping

App Files Files Community

irbull commited on Feb 10, 2024

Commit

c276f12

unverified ·

1 Parent(s): 5cffd6f

metal : use autoreleasepool to avoid memory leaks (llama/5437)

Browse files

There appears to be a known memory leak when using the
`MLTCommandBuffer`. It is suggested to use `@autoreleasepool` in
[1,2]

[1] https://developer.apple.com/forums/thread/662721
[2] https://forums.developer.apple.com/forums/thread/120931

This change-set wraps the `ggml_metal_graph_compute` in a
`@autoreleasepool`.

This commit addresses https://github.com/ggerganov/llama.cpp/issues/5436

Files changed (1) hide show

ggml-metal.m +2 -0

ggml-metal.m CHANGED Viewed

@@ -696,6 +696,7 @@ static bool ggml_metal_graph_compute(
         struct ggml_metal_context * ctx,
                struct ggml_cgraph * gf) {
     MTLComputePassDescriptor * edesc = MTLComputePassDescriptor.computePassDescriptor;
     edesc.dispatchType = MTLDispatchTypeSerial;
@@ -2281,6 +2282,7 @@ static bool ggml_metal_graph_compute(
         [[MTLCaptureManager sharedCaptureManager] stopCapture];
     }
     return true;
 }

         struct ggml_metal_context * ctx,
                struct ggml_cgraph * gf) {
+    @autoreleasepool {
     MTLComputePassDescriptor * edesc = MTLComputePassDescriptor.computePassDescriptor;
     edesc.dispatchType = MTLDispatchTypeSerial;
         [[MTLCaptureManager sharedCaptureManager] stopCapture];
     }
+    }
     return true;
 }