Downsampling #1888

salah-zaiem · 2023-03-16T23:11:02Z

Code for the best technique in the paper "Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative Study" : https://arxiv.org/abs/2303.06740, allowing for sequence downsampling during fine-tuning of SSL models. This leads to lower inference times with low performance drops.

TParcollet

Thanks! See my comments.

recipes/LibriSpeech/ASR/CTC/README.md

recipes/LibriSpeech/ASR/CTC/extra_requirements.txt

recipes/LibriSpeech/ASR/CTC/hparams/downsampled/train_hf_wavlm_average_downsampling.yaml

speechbrain/lobes/downsampling.py

tests/recipes/LibriSpeech.csv

Quick fix.

Adel-Moumen · 2023-03-24T13:38:35Z

recipes/LibriSpeech/ASR/CTC/train_with_wav2vec.py

 from speechbrain.utils.distributed import run_on_main
 from hyperpyyaml import load_hyperpyyaml
 from pathlib import Path
+from pyctcdecode import build_ctcdecoder


it should be optional nop? Now this is mandatory to pip install pyctcdecode in order to use the CTC wav2vec...

Yes it should be optional, will put the import later

…into downsampling

TParcollet

LGTM

salah-zaiem added 4 commits March 16, 2023 19:39

added downsampling code

e983409

corrected macs bugs

c5b7fe4

changed flake8 errors

309e503

fixed too big spaces

3309910

salah-zaiem closed this Mar 16, 2023

salah-zaiem reopened this Mar 16, 2023

salah-zaiem added 6 commits March 17, 2023 17:32

added recipe in teste

a3b5e99

put in the CTC code

6c1165e

added downsampling recipes in test

696f164

reformatted downsampler file

1050519

fixed links to readme.md

5cc02d3

removed bad drive link

5c187f3

TParcollet requested changes Mar 23, 2023

View reviewed changes

salah-zaiem added 3 commits March 23, 2023 17:17

added docstrings

7c4c27e

fixed path in recieps

62c47b4

fixed README

d4fd29f

anautsch reviewed Mar 23, 2023

View reviewed changes

tests/recipes/LibriSpeech.csv Outdated Show resolved Hide resolved

salah-zaiem and others added 13 commits March 23, 2023 18:40

added docstring to downsampler wrapper

11bfec6

docstring to forward function

f3a5f97

fixed expected shapes

51e6bc4

black fix on downsampling.py

862b652

removed trailing whitespaces from readme

eafc7da

fixed white space in yaml

72ddb5b

removed white line

8fc2539

added recipes and check language modelling

a8d41cc

Update extra_requirements.txt

5d92fd6

Quick fix.

quick tests

1ae7e99

update

605a803

update readme

bb46fb0

update readme

0d0e8cb

Adel-Moumen reviewed Mar 24, 2023

View reviewed changes

TParcollet added 2 commits March 24, 2023 13:38

Merge branch 'develop' of https://github.com/speechbrain/speechbrain …

d00803e

…into downsampling

fix mixed precision

2f33f0c

TParcollet approved these changes Mar 24, 2023

View reviewed changes

TParcollet added 3 commits March 24, 2023 14:06

fix yaml

b021a2f

fixing import

4b7c7c4

update extra requirement

1947061

TParcollet merged commit 2f86201 into speechbrain:develop Mar 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Downsampling #1888

Downsampling #1888

Uh oh!

salah-zaiem commented Mar 16, 2023

Uh oh!

TParcollet left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Adel-Moumen Mar 24, 2023

Uh oh!

salah-zaiem Mar 24, 2023

Uh oh!

TParcollet left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Downsampling #1888

Downsampling #1888

Uh oh!

Conversation

salah-zaiem commented Mar 16, 2023

Uh oh!

TParcollet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Adel-Moumen Mar 24, 2023

Choose a reason for hiding this comment

Uh oh!

salah-zaiem Mar 24, 2023

Choose a reason for hiding this comment

Uh oh!

TParcollet left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants