Removed NEST layer outputs and added Streaming Sortformer v2 weights

by GradientDescent2718 - opened 13 days ago

base: refs/heads/main

←

from: refs/pr/6

Discussion Files changed

+23076

-0

Removed NEST layer outputs and added Streaming Sortformer v2 weightscc6c7488

GradientDescent2718

13 days ago

•

edited 13 days ago

The NEST layer outputs used to be part of the model to be used as speaker embeddings, but testing revealed that they actually encoded the arrival order slot rather than the speaker identity, so they were removed.
I also added variants for StreamingSortformer v2 (the old one was v2.1), which performed better in DIHARD III and may work better outside of meeting environments (unconfirmed).

bweng changed pull request status to merged 13 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment