Can it be used for transcribing streaming audio?

#20

by AB200 - opened Feb 29, 2024

Feb 29, 2024

Hi, I was looking for solution of ASR for steaming. While canary1-b and parakeet 1.1b is good for non steaming use-cases, if I wan to use them for streaming the accuracy reduce significantly, I understand it happens due to lack of future context, but this problem can be overcome by audio buffer method, while I found
documentation for buffer method of "stt_en_conformer_ctc_large" model, there are no such docs for these 2 models (canary1-b and parakeet 1.1b). They don't give any output with the current implementation for buffer method.

So I want to if canary model can be used for streaming, if yes, is there any documentation/example of that?

Fabian96

Jul 23, 2024

Any news on this?

JLouisBiz

Mar 10, 2025

Of course it can be used, but it is up to external program to provide buffering and transcription.

riccardodemaria

Apr 10

@AB200 how did you end up implementing this?

AB200

Apr 11

No, as much as I remember, we end up implementing other asr model (whisper large) for the live transcription.

AB200 changed discussion status to closed Apr 11

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment