Starting a recipe for ESC50 #1605

ycemsubakan · 2022-10-15T21:58:07Z

Starting a recipe for ESC50. Not ready to merge yet, but wanted to start the work!

I used the UrbanSound8k recipe as the base.

I am seeing the ecapa-tdnn network is able to get ~50 % accuracy on test, and ~60 % accuracy on valid, when trained on folds [1, 2, 3] and fold 4 is used validation set.

merge wip with train_nmf

…into esc50

TParcollet · 2023-03-15T09:55:55Z

@ycemsubakan can you comply with the new recipe testing? We need to be able to test this recipe as well :-)

fpaissan · 2023-03-18T22:42:51Z

hey @TParcollet @mravanelli, just finished fixing the recipe testing. now everything is ready for review! 😃

recipes/ESC50/classification/confusion_matrix_fig.py

anautsch

This PR looks quite ready. I put some comments regarding docstring polishing. Lmk if I should go more nitty gritty - or if it is ok for your taste.

The recipe tests worked out just fine!

(1/6) Running test for ESC50_row_2...
	... 56.49s
(2/6) Running test for ESC50_row_3...
	... 27.63s
(3/6) Running test for ESC50_row_4...
	... 9.81s
(4/6) Running test for ESC50_row_5...
	... 16.64s
(5/6) Running test for ESC50_row_6...
	... 13.01s
(6/6) Running test for ESC50_row_7...
	... 12.71s
TEST PASSED

The README train calls run (fast epochs); the dataset is git-available. So, the data preparation works (which is outside of the recipe testing scope).

I like the +5,030 −0 changes :)

Three files were added to speechbrain/lobes/models. They contain docstring examples (which run as of PR workflows).

recipes/ESC50/classification/esc50_prepare.py

recipes/ESC50/interpret/esc50_prepare.py

recipes/ESC50/interpret/hparams/l2i_cnn14.yaml

recipes/ESC50/interpret/train_nmf.py

speechbrain/lobes/models/L2I.py

recipes/ESC50/esc50_prepare.py

speechbrain/lobes/models/L2I.py

speechbrain/lobes/models/PIQ.py

anautsch

lgtm

ycemsubakan added 2 commits October 15, 2022 17:33

starting a recipe for ESC50

4a37ba8

cosmetic updates

a55169f

ycemsubakan requested a review from fpaissan October 15, 2022 21:58

ycemsubakan and others added 27 commits October 16, 2022 00:32

added the model from kumar et al. and implemented pretraining

1291b49

trying to replicate

018db06

adding custom_models.py

65cb77b

implemented loading of the SSL-pretrained model

302955d

wip - psi net

053f0f0

add asserts on hidden layers shapes

8647d26

wip - need new nmf decoder

a793af0

wip - l_nmf diverging

90c70b7

wip

c4721de

need device fix

b7c5ba4

supports gpu now -- unit testing time activations

46e044d

wip - NMF training with speechbrain's data loading

58ddefb

Merge branch 'esc50' of github.com:ycemsubakan/speechbrain-1 into esc50

4d06e2e

implemented theta and fidelity loss

3bc0113

pushing classifier to device

3cd7efb

wip - interpretation reconstruction

809f6e1

new interpretation generation

395dd8f

few changes to keep things in torch

7c51811

transposed conv. psi

61c3221

fix the broken code due to permutes

bb642af

updated the nmf training script

b0282c0

using our own nmf decoder

d4c5d95

wip - feature extraction refactoring

4e29a27

Merge branch 'esc50' of github.com:ycemsubakan/speechbrain-1 into esc50

9f2c7dd

merge wip with train_nmf

wip - speechbrain preprocessing

c034818

removed some useless lines

a343767

new pre-processing in compute_obj and loss_fdi bug fix

5141f8c

fpaissan added 2 commits March 14, 2023 10:06

Fixed docstring tests

7920675

Merge branch 'develop' of https://github.com/speechbrain/speechbrain …

53c8b4a

…into esc50

fpaissan added 9 commits March 17, 2023 20:53

starting recipe testing

fa16535

recipe testing

b4ccd88

passing recipe test for cnn14

6b6adaf

add use_pretrained in class

0159687

recipe testing

e01d84b

From pretrained as int not string

26e10da

remove wrong links

043855c

made examples smaller

6cc7c03

cosmetic

798e62d

anautsch reviewed Mar 20, 2023

View reviewed changes

recipes/ESC50/classification/confusion_matrix_fig.py Show resolved Hide resolved

anautsch reviewed Mar 20, 2023

View reviewed changes

fpaissan added 4 commits March 20, 2023 16:49

Addressing comments

48169b5

Addressing comments

39940e7

fixed cnn14 hyperparams

c285433

fixed credits

db11f86