ADPCM support #41

thomcc · 2021-07-09T18:36:13Z

I'd like to use Symphonia with some WAV files containing ADPCM data. Specifically, I'd like to support the "Microsoft ADPCM" (RIFF format 0x02¹) and "IMA/DVI ADPCM" (RIFF format 0x11) variants. There are also possibly a couple extensions² that I need to support.

Just these two account for most of the ADPCM I've encounted in the wild, and to some extent rounds out the set of codecs that you most commonly find in WAV files, IME. The various other variants use can of course be added later when/if needed — I think there are a couple that likely will be useful (like QuickTime's "IMA4 ADPCM").

Anyway, I'd be willing to provide the code for those two (I have some parsing code lying around for them, and encoding is easy enough as well).

The reason I'm filing an issue first (rather than just a PR) is:

To make sure you'd accept it.
I am unsure where they should live. Or rather, I suspect the answer is "A new symphonia-codec-adpcm crate, but it seems worth asking about this first.

I think there's a strong possibility you considered the existence of ADPCM when writing the WAV code, and possibly have thoughts on where it should live, but I could be wrong
So I have somewhere to ask questions when I inevitably get lost due to my vague-at-best understanding of how Symphonia is structured 😅

One final note is that while libavcodec (codec library behind ffmpeg)'s support for ADPCM is pretty thorough, ffmpeg's usage of it used to introduce many strange bugs. So, I'm a little concerned about testing against it as with symphonia-check... I guess we'll cross that it it becomes an issue, though, since it may have been fixed in the 3 years since I tried last.

(footnotes)

I'm specifying the format ID because pretty much every name you could use to describe the different flavors of ADPCM is somewhat ambiguous. There are at least 3 codecs reasonably called "DVI ADPCM" and two reasonably called "Microsoft ADPCM", and these have overlap (it's kind of a disaster).
Specifically, the two important extensions are:
- Non-hardcoded block sizes, as documented https://docs.microsoft.com/en-us/windows/win32/xaudio2/adpcm-overview.
- Arbitrary channel counts — I believe think the format inside WAV only supports 2 channels, according to some documentation. In practice I've seen upwards of 8, which decoded fine in windows and on macOS (using AVFoundation.
Note that I'm unsure that these are actually extensions.

The text was updated successfully, but these errors were encountered:

pdeljanov · 2021-07-09T23:51:22Z

To make sure you'd accept it.

Yep.

Generally speaking, even if a format or codec is not on the "roadmap" I'll accept it as long as it works and the code is maintainable.

I think ADPCM support would be a great addition.

I am unsure where they should live. Or rather, I suspect the answer is "A new symphonia-codec-adpcm crate, but it seems worth asking about this first.
I think there's a strong possibility you considered the existence of ADPCM when writing the WAV code, and possibly have thoughts on where it should live, but I could be wrong

You're correct.

I think we should keep ADPCM and PCM separate. To that end, I think you should just copy symphonia-codec-pcm and use it as a starting point for symphonia-codec-adpcm. PcmDecoder is a very good example of a single Decoder that can decode multiple codecs (you'll likely want to consider the different variants to be different codecs).

You'll also need to extend the WavReader to support the ADPCM format. This would involve a few things:

Add support for the ADPCM format in the ext and fmt chunks.
As per the Microsoft docs, it looks like the sample data is stored in the data chunk in discrete blocks with a known (calculatable) size. This lends itself well to packetization. Currently, the WavReader only supports PCM formats which is just a continuous stream of audio frames with no packet boundaries. Therefore, the reader simulates packets by chunking the PCM data into blocks of 1152 audio frames (same as MP3) each. So you'll have to find a way to support both the PCM case where we simulate packets, and the ADPCM case where we have actual packets (blocks).
Seeking would also need to updated to support the ADPCM case.

I actually think most of your work will end up being in WavReader rather than the codec.

Other than that, let me brain dump a few other things you'll have to do to tie everything together:

Update the Cargo.toml for thesymphonia crate to add a adpcm feature. This can be enabled by default if you're reasonably certain the decoder will not infringe on any patents. I imagine it won't.
In the symphonia crate, re-export the AdpcmDecoder here.
Also in the symphonia crate, register the AdpcmDecoder into the default CodecRegistry here.
In symphonia-core add new CODEC_TYPES for all of your ADPCM variants (1 per variant) here. Perhaps start it at 0x200 to allow the uncompressed PCM codecs room to grow.
As mentioned above, symphonia-codec-pcm is a good example of a single Decoder supporting multiple codecs. Each of the CODEC_TYPES you declared in 4 can be supported by your one AdpcmDecoder. Then, you can adjust the behaviour of AdpcmDecoder based on the CODEC_TYPE provided in the CodecParameters on instantiation.

Once that's done, everything should just work ™️ .

One final thought. As per the Microsoft doc, the blocks can be as small as 32 samples. That's pretty small considering the work involved to get a parse a new block, allocate a packet, decode the packet, write to audio output, etc. Since the block size can be calculated, it may make sense to provide > 1 block per Packet, then the AdpcmDecoder can divide the packet buffer length by the block length to know how many blocks were provided. In this way we can reduce the number of next_packet -> decode iterations and amortize the costs over a greater number of playable samples. This is an optimization and comes with some caveats, but might be something to think about later.

One final note is that while libavcodec (codec library behind ffmpeg)'s support for ADPCM is pretty thorough, ffmpeg's usage of it used to introduce many strange bugs. So, I'm a little concerned about testing against it as with symphonia-check... I guess we'll cross that it it becomes an issue, though, since it may have been fixed in the 3 years since I tried last.

I think symphonia-check should be able to use other reference decoders. FFMpeg was just the gold standard in my mind when I wrote it. Any decoder that can output a wave file to standard out can be used with minimal changes.

Probably the best thing to do is just try to implement it, submit a PR, and we can iterate on it.

pdeljanov assigned pdeljanov and thomcc and unassigned pdeljanov Jul 12, 2021

pdeljanov added the enhancement New feature or request label Aug 10, 2021

pdeljanov mentioned this issue May 30, 2022

Valid files(checked by FFprobe) aren't properly parsed in Symphonia #124

Open

geckoxx mentioned this issue Nov 4, 2022

Add ADPCM Decoder #160

Merged

pdeljanov closed this as completed in #160 Nov 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADPCM support #41

ADPCM support #41

thomcc commented Jul 9, 2021

pdeljanov commented Jul 9, 2021

ADPCM support #41

ADPCM support #41

Comments

thomcc commented Jul 9, 2021

pdeljanov commented Jul 9, 2021