add LinearWarmupScheduler #1537

geliAI · 2022-08-12T14:34:47Z

Create a schedule with a learning rate that decreases linearly from the initial lr set in the optimizer to 0, after a warmup period during which it increases linearly from 0 to the initial lr set in the optimizer.

TParcollet · 2022-08-16T18:16:21Z

speechbrain/nnet/schedulers.py

        return self.value_at_epoch[old_index], self.value_at_epoch[index]


+class LinearWarmupScheduler:


HI and thanks! Is this resumable ? I see a "current_step" shouldn't the sheduler be saved as well in case of resuming experiment? This can be easily done with hooks (see other scheduler with states). What do you think?

I agree, on a side note, StepScheduler also does not have hooks. we should fix that in a separate PR.

Hi, this is a very good question. TBH, I am not very familiar with the concept of hooks. But I will take a look at how other schedulers are implemented.

I have added the checkpoint hooks. Please take a look at it.

TParcollet · 2022-08-17T20:03:14Z

Huge thanks!

danpovey · 2022-08-17T20:07:14Z

I notice the design is quite different from the one in PyTorch native schedulers that have a step() function and have load_state_dict() and state_dict() functions.

We also ended up changing the interface a bit, as I wanted something where you could step on both minibatches and epochs. [In our case it's not part of a unified interface, though; because as for now, for flexibility of early development, our model is to put most of the complexity in local scripts without putting most things in any central place.]

TParcollet · 2022-08-17T20:11:52Z

Agreed. Torch schedulers are a bit rigid. Although, you can use it natively with SB as well. As you may have seen, we follow the opposite direction for now: more central places and less complexity in local scripts. I guess it's a balance to do between how much maintenance you can put from a coordinated team (central) vs how much you wish to rely on the community to do that (local scripts). At least, this is a personal opinion, I find it hard to maintain properly recipes as they tend to grow way too rapidly in number :p

danpovey · 2022-08-17T20:14:26Z

Hm yes, for now we are aiming to get the best possible WER with reasonable latency before we add lots of recipes; at a later time we might consider centralizing things a bit. I figure if people really need recipes that work for a specific dataset, they can always get it from speechbrain or ESPNet.

TParcollet · 2022-08-17T20:17:20Z

The numbers you get with Transducers are really impressive, I really wish we soon obtain enough resources to put someone on this full-time. The last intern that tried did not succeed but he had other things to do as well (the PR where he tried your nice pruned transducer loss).

add LinearWarmupScheduler

5bae6df

geliAI requested a review from TParcollet August 12, 2022 14:35

update examples of LinearWarmupScheduler

5724bd5

geliAI requested a review from popcornell August 16, 2022 17:50

TParcollet requested changes Aug 16, 2022

View reviewed changes

add register_checkpoint_hooks to class LinearWarmupScheduler

0e02a5a

geliAI requested a review from TParcollet August 17, 2022 19:36

mark_as_saver and marker_as_loader

430ea95

TParcollet approved these changes Aug 17, 2022

View reviewed changes

TParcollet merged commit 73789b5 into speechbrain:develop Aug 17, 2022

geliAI deleted the develop_linear_warmup_LR branch August 18, 2022 15:06

RuABraun mentioned this pull request Aug 30, 2022

wav2vec2 pretraining implemented with speechbrain #1312

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add LinearWarmupScheduler #1537

add LinearWarmupScheduler #1537

Uh oh!

geliAI commented Aug 12, 2022

Uh oh!

TParcollet Aug 16, 2022

Uh oh!

popcornell Aug 17, 2022

Uh oh!

geliAI Aug 17, 2022

Uh oh!

geliAI Aug 17, 2022 •

edited

Loading

Uh oh!

TParcollet commented Aug 17, 2022

Uh oh!

danpovey commented Aug 17, 2022

Uh oh!

TParcollet commented Aug 17, 2022

Uh oh!

danpovey commented Aug 17, 2022

Uh oh!

TParcollet commented Aug 17, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		return self.value_at_epoch[old_index], self.value_at_epoch[index]


		class LinearWarmupScheduler:

add LinearWarmupScheduler #1537

add LinearWarmupScheduler #1537

Uh oh!

Conversation

geliAI commented Aug 12, 2022

Uh oh!

TParcollet Aug 16, 2022

Choose a reason for hiding this comment

Uh oh!

popcornell Aug 17, 2022

Choose a reason for hiding this comment

Uh oh!

geliAI Aug 17, 2022

Choose a reason for hiding this comment

Uh oh!

geliAI Aug 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TParcollet commented Aug 17, 2022

Uh oh!

danpovey commented Aug 17, 2022

Uh oh!

TParcollet commented Aug 17, 2022

Uh oh!

danpovey commented Aug 17, 2022

Uh oh!

TParcollet commented Aug 17, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

geliAI Aug 17, 2022 •

edited

Loading