-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Starting a recipe for ESC50 #1605
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
merge wip with train_nmf
|
@ycemsubakan can you comply with the new recipe testing? We need to be able to test this recipe as well :-) |
|
hey @TParcollet @mravanelli, just finished fixing the recipe testing. now everything is ready for review! 😃 |
anautsch
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This PR looks quite ready. I put some comments regarding docstring polishing. Lmk if I should go more nitty gritty - or if it is ok for your taste.
The recipe tests worked out just fine!
(1/6) Running test for ESC50_row_2...
... 56.49s
(2/6) Running test for ESC50_row_3...
... 27.63s
(3/6) Running test for ESC50_row_4...
... 9.81s
(4/6) Running test for ESC50_row_5...
... 16.64s
(5/6) Running test for ESC50_row_6...
... 13.01s
(6/6) Running test for ESC50_row_7...
... 12.71s
TEST PASSED
The README train calls run (fast epochs); the dataset is git-available. So, the data preparation works (which is outside of the recipe testing scope).
I like the +5,030 −0 changes :)
Three files were added to speechbrain/lobes/models. They contain docstring examples (which run as of PR workflows).
anautsch
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm
Starting a recipe for ESC50. Not ready to merge yet, but wanted to start the work!
I used the UrbanSound8k recipe as the base.
I am seeing the ecapa-tdnn network is able to get ~50 % accuracy on test, and ~60 % accuracy on valid, when trained on folds [1, 2, 3] and fold 4 is used validation set.