Fix torchcodec audio decoding to respect 'num_channels' by AsymptotaX · Pull Request #8028 · huggingface/datasets

AsymptotaX · 2026-02-27T10:28:46Z

Fixes torchcodec audio decoding when num_channels is set on Audio.

Before this change, AudioDecoder["array"] reduced multi-channel audio to mono by averaging channels, so the requested channel behavior was not respected.

With this PR:

multi-channel decoded arrays are preserved by default;
mono output is returned only when num_channels == 1 is explicitly requested.

Previous behavior

ds_stereo = ds.cast_column("audio", Audio(num_channels={...})) - None, 1, 2

Original file: '(16000, 2)' - stereo ✓
'Audio(num_channels=None)': '(16000,)' - mono ✗
'Audio(num_channels=2)': '(16000,)' - mono ✗
'Audio(num_channels=1)': '(16000,)' - mono ✓

New behavior

'num_channels=None' preserves the original number of channels from the source file.
'num_channels=2' preserves/converts to stereo output with shape '(2, num_samples)'.
'num_channels=1' downmixes to mono with shape '(num_samples,)'.

Results

Original file shape (via soundfile): (16000, 2)
HF datasets shape with num_channels=None: (2, 16000)
HF datasets shape with num_channels=1: (16000,)
HF datasets shape with num_channels=2: (2, 16000)

Fixes #8005.

…#8005)

HuggingFaceDocBuilderDev · 2026-02-27T23:19:34Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

lhoestq

Hi ! I believe ["array"] is used a lot in transformers and has been expecting mono for a long time - from before we switched to torchcodec. So we might need to keep mono as default for ["array"] unless num_channels is specified explicitly. Would it be ok for you ?

I see in your tests that you expect stereo even if num_channels is not specified in Audio()

tests/features/test_audio.py

huggingface#8005)

Fix torchcodec AudioDecoder array shape for num_channels (huggingface…

aaffd3d

…#8005)

lhoestq reviewed Feb 27, 2026

View reviewed changes

tests/features/test_audio.py Outdated Show resolved Hide resolved

AsymptotaX added 2 commits February 28, 2026 11:29

Merge branch 'main' into fix/issue-8005-audio-num-channels

a640f1a

Fix AudioDecoder default mono behavior and honor explicit num_channels (

eaea167

huggingface#8005)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix torchcodec audio decoding to respect 'num_channels'#8028

Fix torchcodec audio decoding to respect 'num_channels'#8028
AsymptotaX wants to merge 3 commits intohuggingface:mainfrom
AsymptotaX:fix/issue-8005-audio-num-channels

AsymptotaX commented Feb 27, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 27, 2026

Uh oh!

lhoestq left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AsymptotaX commented Feb 27, 2026

Previous behavior

New behavior

Results

Uh oh!

HuggingFaceDocBuilderDev commented Feb 27, 2026

Uh oh!

lhoestq left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants