Skip to content

Conversation

@salah-zaiem
Copy link
Collaborator

Code for the best technique in the paper "Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative Study" : https://arxiv.org/abs/2303.06740, allowing for sequence downsampling during fine-tuning of SSL models. This leads to lower inference times with low performance drops.

@salah-zaiem salah-zaiem reopened this Mar 16, 2023
Copy link
Collaborator

@TParcollet TParcollet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks! See my comments.

from speechbrain.utils.distributed import run_on_main
from hyperpyyaml import load_hyperpyyaml
from pathlib import Path
from pyctcdecode import build_ctcdecoder
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it should be optional nop? Now this is mandatory to pip install pyctcdecode in order to use the CTC wav2vec...

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes it should be optional, will put the import later

Copy link
Collaborator

@TParcollet TParcollet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@TParcollet TParcollet merged commit 2f86201 into speechbrain:develop Mar 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants