Back to General and Gameplay Programming

How are audio channels arranged in a .wav file?

General and Gameplay Programming Programming

Started by blueshogun96 November 17, 2015 10:17 PM

13 comments, last by blueshogun96 8 years, 4 months ago

blueshogun96

2,267

Author

November 17, 2015 10:17 PM

Okay, I'm given the task to write a tool that gets the volume level of each audio channel within a .wav file with 4 channels. I don't know how the data is arranged (i.e. how the channels are aranged byte wise), but I do know how to read a .wav file from scratch. Let's say samples are 16-bit, do I ready every fourth word to get the data per channel?

And if you're thinking of saying "use .ogg instead", just know that I can't because the tool that we are using here for my automation testing generates .wav files, and I have to build my autmation around this and more, so I won't bother with that.

Thanks,

Shogun.

fastcall22

10,918

November 17, 2015 10:38 PM

If you can use an audio library, use that to do the heavy lifting for you. Otherwise, take a look at the WAV format specification. It won't be as simple as reading every other byte, because not only are WAV files chunked, they could also be compressed.

blueshogun96

2,267

Author

November 17, 2015 10:52 PM

What library would you recommend fastcall? I understand how RIFF works, it's just the audio channel stuff I don't understand. When I reach the data chunk, how do you separate each audio channel from the data?

Shogun.

fastcall22

10,918

November 17, 2015 11:00 PM

According to The DirectSound Programming Guide, you can use the mmio* (mmioOpen, mmioRead, winmm.lib, desktop only) functions provided by win32. And I suppose OpenAL would be a suitable cross-platform alternative.

blueshogun96

2,267

Author

November 17, 2015 11:16 PM

Thanks fastcall, but not quite. The mmio API doesn't appear to let me select one channel to read from. I'm not going to be playing back any of these audio samples either. I'm just going to be reading the sound samples to average out the sound levels from each.

Shogun.

Nypyren

12,313

November 18, 2015 12:06 AM

Inside the RIFF there is a WAVE (or, in fact, you can store non-WAVE data such as MP3 audio).

Assuming you have a WAVE, the sample data layout depends on the compression, bitrate, number of channels, etc.

http://soundfile.sapp.org/doc/WaveFormat/

blueshogun96

2,267

Author

November 18, 2015 12:27 AM

The data is wave format extensible. I guess I have to understand this codec first before moving forward.

Shogun.

Aressera

3,142

November 18, 2015 12:30 AM

For PCM (almost all wav files), audio data is stored as an array of interleaved channels of whatever the sample size is (8, 16, 24 common, 32, 64, 32fp, 64fp possible). Integer values are signed little-endian.

e.g.

sample1L sample1R sample2L sample2R sample3L sample3R...

blueshogun96

2,267

Author

November 18, 2015 05:55 PM

For PCM (almost all wav files), audio data is stored as an array of interleaved channels of whatever the sample size is (8, 16, 24 common, 32, 64, 32fp, 64fp possible). Integer values are signed little-endian.

e.g.

sample1L sample1R sample2L sample2R sample3L sample3R...

Okay, I had a feeling that was it. If that's the case, then I can write this tool easily and quickly. Thanks.

Shogun.