Fixing shuffle argument for distributed sampler in core.py #1518

RuABraun · 2022-07-26T17:55:36Z

Currently shuffling of the sample indices is not done before distributing the indices across nodes, this means the distribution will be the same for each epoch. This causes a small but noticeable decrease in performance.

Fixes #1495

I don't think there's ever a reason to have it set to False, therefore hardcoded to True.

I also took the liberty of adding a tiny bit of extra info to why DistributedSamplerWrapper needs to exist.

TParcollet

LGTM!

TParcollet · 2022-07-26T18:02:11Z

If the tests pass, we can merge @RuABraun

Fixing shuffle argument for distributed sampler in core.py

6bc72af

RuABraun mentioned this pull request Jul 26, 2022

wav2vec2 pretraining implemented with speechbrain #1312

Merged

TParcollet approved these changes Jul 26, 2022

View reviewed changes

mravanelli merged commit 40fd44a into speechbrain:develop Jul 26, 2022

asumagic mentioned this pull request Nov 25, 2022

[Bug]: sorting does not works in ASR train #1722

Closed

anautsch mentioned this pull request Nov 28, 2022

fix sorting bug #1730

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixing shuffle argument for distributed sampler in core.py #1518

Fixing shuffle argument for distributed sampler in core.py #1518

RuABraun commented Jul 26, 2022

Uh oh!

TParcollet left a comment

Uh oh!

TParcollet commented Jul 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fixing shuffle argument for distributed sampler in core.py #1518

Fixing shuffle argument for distributed sampler in core.py #1518

Conversation

RuABraun commented Jul 26, 2022

Uh oh!

TParcollet left a comment

Choose a reason for hiding this comment

Uh oh!

TParcollet commented Jul 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants