Sign in to follow this  

XAudio2 - Size of FMT Chunk for ADPCM Files larger than WAVEFORMATEXTENSIBLE struct

Recommended Posts

I'm writing a parser for wave files in C++, designed to work with XAudio2.


I have no problems reading in PCM wave files. My parser extracts the chunk size for the format chunk of a PCM file as 16 bytes. I copy this data into a WAVEFORMATEX or WAVEFORMATEXTENSIBLE struct and it fills struct as I expect.


I used the "AdpcmEncode" utility that comes with the June 2010 DirectX SDK to encode my PCM file to ADPCM.


Now my parser extracts the size of the format chunk as 50 bytes.


I notice that the size of the WAVEFORMATEXTENSIBLE struct is 40 bytes. Is there something wrong with my parser here or is there another type of struct that I should be copying this data to?


The parser seems to be working OK as from the ADPCM file I extract wFormatTag = 2 (i.e. ADPCM encoding) and cbSize = 32 (i.e. 32 extra bytes + 18 bytes for the WAVEFORMATEX struct = 50 bytes total). I'm wondering why this chunk won't fit into a WAVEFORMATEXTENSIBLE struct and what the extra 10 bytes of data might be for.


Any help here will be much appreciated.

Share this post

Link to post
Share on other sites

Just trying to figure this out myself as I go . . . 


From -


Note  Data formats such as ADPCM that require more information than provided by WAVEFORMATEX have aWAVEFORMATEX structure as the first member in their format structures. When you create a source voice with one of those formats, cast the format's structure as a WAVEFORMATEX structure and use it as the value forpSourceFormat.


I assumed I'd be using WAVEFORMATEXTENSIBLE, but it doesn't actually say that here.


From -


In order to use ADPCM encoded data in XAudio2, you need to initialize a ADPCMWAVEFORMAT structure with ADPCM specific values, and pass it as an argument to IXAudio2::CreateSourceVoice when you create a source voice. For an example of loading and playing a sound in XAudio2, see How to: Play a Sound with XAudio2.


Getting closer. There doesn't appear to be an MSDN page for ADPCMWAVEFORMAT, however the Japanese site has:

Edited by pknight

Share this post

Link to post
Share on other sites

I think, that the additional 10 bytes are meta-information for the specific format appended to WAVEFORMATEXTENSIBLE, accordingly to this page. Quote:


If wFormatTag = WAVE_FORMAT_EXTENSIBLE, set cbSize to sizeof(WAVEFORMATEXTENSIBLE)-sizeof(WAVEFORMATEX) plus the size of any format-specific data that is appended to the WAVEFORMATEXTENSIBLE structure.



Share this post

Link to post
Share on other sites

Thanks for the reply Ashaman. I think that I have got to the bottom of what the format specific data is for ADPCM files and exactly what type of struct I need to fill. Here are my comments below in case it helps anyone else with the same question:
This is from the Japanese MSDN page, here
typedef struct adpcmwaveformat_tag {
WORD wSamplesPerBlock;
WORD wNumCoef;


(apologies for the formatting)


Now we have in this struct:


WAVEFORMATEX = 18 bytes, as usual

2 x WORD = 2 x 2 bytes


Then a variable size component which will be 7 x sizeof(ADPCMCOEFSET), which is 7 x 4 bytes, or 28 bytes. I got the 7 as per the msdn site's comment:



wNumCoef The number of entries in aCoef. Note (AdpcmEncode) ADPCM command line encoder always, seven entries are included in the array of data compression factor of the ADPCM data that is generated by.


So total size = 18 + 4 + 28 = 50 bytes, which is what my parser is pulling out of the ADPCM file.

Share this post

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

Sign in to follow this