Bump torchaudio version + add bytes #2821

Adel-Moumen · 2025-02-10T20:23:29Z

What does this PR do?

This PR aims at "fixing" the issue #2820 brought by PR #2781. In the latter PR, I introduced a backward incompatible change to torchaudio.load. Indeed, specifying the backend to the load function only came in torchaudio 2.1.0. It was also available in 2.0.0 and 2.0.1 but at the cost of enabling the flag export TORCHAUDIO_USE_BACKEND_DISPATCHER=1 before importing torchaudio.

After working on a workaround, I found it slower as I had to check at each read_audio call the torchaudio function, and check if there was an argument backend to the audio function. I think, it safer to upgrade our pytorch requirements to 2.1.0 for both: torch and torchaudio. PyTorch is now at version 2.6.0 and I think it should be safe for us to slowly increase the requirements as the novelties comes to make sure that the toolkit is still working. To be honest, I don't have any clues if there's still people using speechbrain with torch<2.0.0, or if speechbrain is still actually working with a very old version of torch.

Also, with torch==2.0.0 the torch.compile was immature with many bugs (sorry PyTorch team, you made a lot of nice progress), and so latter versions of pytorch are getting better and better at handling complex cases of computation graph. So, I would be in favour of slowly increasing our torch requirements, and in this case, bumping to torch>=2.1.0.

Any comments/feedbacks @TParcollet @mravanelli, and/or @pplantinga ? :)

Before submitting

Did you read the contributor guideline?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you list all the breaking changes introduced by this pull request?
Does your code adhere to project-specific code style and conventions?

PR review

Reviewer checklist

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified
Confirm that the changes adhere to compatibility requirements (e.g., Python version, platform)
Review the self-review checklist to ensure the code is ready for review

TParcollet · 2025-02-10T21:39:51Z

I vote for the bump, it will help for rope as well.

pplantinga

I'm good with the version bump, minor comments can be addressed or not

speechbrain/utils/torch_audio_backend.py

pplantinga · 2025-02-10T21:45:05Z

speechbrain/dataio/dataio.py

    if backend not in [None, "ffmpeg", "sox", "soundfile"]:
        raise ValueError(
-            "backend must be one of 'ffmpeg', 'sox', 'soundfile' or None"
+            "backend must be one of 'ffmpeg', 'sox', 'soundfile' or None",
+            "Available backends on your system: ",
+            torchaudio.list_audio_backends(),
        )


Not a huge deal but this code seems repeated from above, any way to combine (e.g. unified audio backend checker function)

TParcollet · 2025-02-11T10:03:43Z

@Adel-Moumen could you fix the tests?

Co-authored-by: Peter Plantinga <plantinga.peter@proton.me>

TParcollet

LGTM

Adel-Moumen added 5 commits February 10, 2025 19:59

fix backend when not avail in torchaudio.load

5641102

typo

74edcfa

remove safe call

1a61e63

bump torch versions

cfa4533

Update torch_audio_backend.py

26d78c5

Adel-Moumen changed the title ~~fix backend when not avail in torchaudio.load~~ Bump torchaudio version + add bytes Feb 10, 2025

pplantinga approved these changes Feb 10, 2025

View reviewed changes

Adel-Moumen and others added 5 commits February 11, 2025 10:08

Update speechbrain/utils/torch_audio_backend.py

6a08141

Co-authored-by: Peter Plantinga <plantinga.peter@proton.me>

add docstring + helper function + bytes case

ba3cdf3

fix pre-commit

51bab8d

pre-commit

8ad4959

remove example

9c47d97

Adel-Moumen marked this pull request as ready for review February 11, 2025 10:52

Adel-Moumen requested a review from TParcollet February 11, 2025 11:04

Adel-Moumen self-assigned this Feb 11, 2025

TParcollet approved these changes Feb 11, 2025

View reviewed changes

TParcollet merged commit c436f61 into speechbrain:develop Feb 11, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bump torchaudio version + add bytes #2821

Bump torchaudio version + add bytes #2821

Uh oh!

Adel-Moumen commented Feb 10, 2025 •

edited

Loading

Uh oh!

TParcollet commented Feb 10, 2025

Uh oh!

pplantinga left a comment

Uh oh!

Uh oh!

pplantinga Feb 10, 2025

Uh oh!

TParcollet commented Feb 11, 2025

Uh oh!

TParcollet left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Bump torchaudio version + add bytes #2821

Bump torchaudio version + add bytes #2821

Uh oh!

Conversation

Adel-Moumen commented Feb 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

PR review

Uh oh!

TParcollet commented Feb 10, 2025

Uh oh!

pplantinga left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pplantinga Feb 10, 2025

Choose a reason for hiding this comment

Uh oh!

TParcollet commented Feb 11, 2025

Uh oh!

TParcollet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adel-Moumen commented Feb 10, 2025 •

edited

Loading