Driver/Input: Migrate audio backend to Symphonia #89

FelixMcFelix · 2021-08-23T21:01:04Z

This begins work on #67, which will be a huge step towards purely in-process audio processing (i.e., one day eliminating ffmpeg).

Audiopus-based Symphonia codec and DCA1 framing.
Ensure that Opus packet passthrough can be performed through Symphonia.
Port mixing infrastructure (and packet passthrough).
Rework input interface.
Implement basic DASH handling for youtube and other multimedia handlers extracted from youtube-dl.
Investigate upstreaming MKV support to Symphonia.
Port our stream-cacheing wrappers to this ecosystem.

There will be some slight issues stemming from the fact that we will now be parsing metadata from bytestreams as they arrive:

How will metadata be accessible to users before it enters the mixer? Is this just a spawn_blocking on the initial Format parse phase? Pass messages out from the main mixer context?
Users will need to be able to both pass in raw MediaSourceStreams, Reads, and fully processed (Format, Metadata, ...) sets -- due to the above, and as some extra metadata will only be exposed by youtube-dl, for instance.
Until MKV support is impl'd, we'll probably need to peek at bytes internally and pass over to ffmpeg on a decode failure. Having such a fallback should probably be toggleable.
We'll probably need to offer one or two wrappers to hand over AsyncReads into the mixing system (as well as automating extension/MIME type extraction from Files or HTTP responses).

Export is mainly for testing, but I think it'd also be a nice backport.

…rt. different input packet sizes.

) * Depend on Serenity's `next` branch on Songbird's `next` branch * Update the examples to use the `next` branch

Serenity's cache design has changed, this (finally) prevents the examples from failing to compile on CI.

Adds generics to any `Id` types on `Call`. Also includes the overlooked `Songbird::get_or_insert`. Closes serenity-rs#94. Tested using `cargo make ready`.

This matches a recent serenity change where `ClientBuilder` no longer has an explicit lifetime parameter. This was tested using `cargo make ready`.

MP3s now work great under the convert -> resample -> mix pipeline, even across unclean frame boundaries. Main issue seemed to be the number of internal subchunks in the resampler, which is still totally opaque... Oh well. Actual instantiation of Lazy->Wrapped and Wrapped->Parsed elements in the driver is not yet covered. Other formats are broken due to handling of the default track id, which is currently a) messy, and b) incorrect.

Oggs have some frames whose length is 0, yet must be decoded. I assume this covers the entire coder delay.

Tested with a URL extracted from bandcamp via youtube-dl, also includes the scaffolding necessary to have Reads/Seeks pass between sync/async boundaries. This format is MP3, streamed in over an HTTP request via reqwest. Seeks not tested with e.g. Async Files -- they are programmed, though.

FelixMcFelix · 2021-11-25T23:25:13Z

Rough status update: we now have a decent mixing pipeline for audio files of different formats, samples rates, packet lengths etc. into a single buffer. This is horrifically messy. Opus packet passthrough isn't there yet, but should be possible with some care around codec resets. There are some issues around track data cleanup right now.

We also have a wrapper for running seeks/reads over async audio sources (i.e., reqwest calls) and passing those bytes in via ring buffer, which seem to be working great for exposing some amount of flexibility around input sources. In tandem, mkv support in symphonia seems to be coming along well -- hopefully with that youtube-dl should integrate nicely to allow e.g. WebMs. I hope that will come down to simple parsing, with occasional renegotiation for a new DASH source if we don't want the stream to cut out halfway through.

Moves all sync creation/parsing/seeking etc. over to an elastically-sized thread pool. Since basically everything is now a restartable, this means that the `ForwardOnly` distinction can be handled really cleanly in general and we can try to recreate sources as needed.

Asks for any streams without webm, since they're likely to be golden right now.

* Remove unnecessary poison messages * Re-add Poison for CoreMessage

This is most relevant for queue users -- an extra track would cause opus passthrough to end, even though only one track was being actively used in mixing.

Corollary: this same new clippy lint adds a ton of false positives which *will* fail to compile.

Will leave 're-computing filesize guesses' to a future issue.

FelixMcFelix · 2022-07-23T22:26:11Z

I've put a fairly in-depth review and commenting round into the core driver changes, and I've had a more cursory look over the rest of the changes (inputs, adapters, tracks). I'll be merging shortly -- between this and the adventurous use in prod from a handful of users I think this is really quite stable now. Most importantly, it's fast and it's lightweight as a pure Rust implementation should be.

One caveat is that we're waiting on one symphonia bug fix being merged and released before this can go to crates.io, at some point this might need to be pushed out with a disclaimer that the current symphonia fork should be Cargo patched in instead. Hopefully this won't be the case!

This extensive PR rewrites the internal mixing logic of the driver to use symphonia for parsing and decoding audio data, and rubato to resample audio. Existing logic to decode DCA and Opus formats/data have been reworked as plugins for symphonia. The main benefit is that we no longer need to keep yt-dlp and ffmpeg processes alive, saving a lot of memory and CPU: all decoding can be done in Rust! In exchange, we now need to do a lot of the HTTP handling and resumption ourselves, but this is still a huge net positive. `Input`s have been completely reworked such that all default (non-cached) sources are lazy by default, and are no longer covered by a special-case `Restartable`. These now span a gamut from a `Compose` (lazy), to a live source, to a fully `Parsed` source. As mixing is still sync, this includes adapters for `AsyncRead`/`AsyncSeek`, and HTTP streams. `Track`s have been reworked so that they only contain initialisation state for each track. `TrackHandles` are only created once a `Track`/`Input` has been handed over to the driver, replacing `create_player` and related functions. `TrackHandle::action` now acts on a `View` of (im)mutable state, and can request seeks/readying via `Action`. Per-track event handling has also been improved -- we can now determine and propagate the reason behind individual track errors due to the new backend. Some `TrackHandle` commands (seek etc.) benefit from this, and now use internal callbacks to signal completion. Due to associated PRs on felixmcfelix/songbird from avid testers, this includes general clippy tweaks, API additions, and other repo-wide cleanup. Thanks go out to the below co-authors. Co-authored-by: Gnome! <45660393+GnomedDev@users.noreply.github.com> Co-authored-by: Alakh <36898190+alakhpc@users.noreply.github.com>

FelixMcFelix and others added 10 commits August 20, 2021 22:27

Initial Symphonia libopus wrapper

95b36f1

Messy DCA(1) framing for Symphonia, DCA export on Compressed

7e8c81e

Export is mainly for testing, but I think it'd also be a nice backport.

Primitive mixer code.

9556871

Early measurements, thinking about right granularity for resampling w…

454637f

…rt. different input packet sizes.

Some structuring, committing for now.

0c9689a

Prelim integration of mix code (not input readying yet...)

10bca95

Minor progress on dropping in new inputs...

4663884

Deps: Depend on Serenity's next branch on Songbird's next branch (#6

ae69e5e

) * Depend on Serenity's `next` branch on Songbird's `next` branch * Update the examples to use the `next` branch

Deps: Bump streamcatcher version -> 1.0 (serenity-rs#93)

3784490

Examples: Fix serenity-next cache accesses (serenity-rs#99)

94bd290

Serenity's cache design has changed, this (finally) prevents the examples from failing to compile on CI.

FelixMcFelix force-pushed the next branch from abe5c35 to 94bd290 Compare October 19, 2021 16:09

FelixMcFelix added 10 commits October 19, 2021 17:28

Gateway: Add generics to Call methods. (serenity-rs#102)

2129422

Adds generics to any `Id` types on `Call`. Also includes the overlooked `Songbird::get_or_insert`. Closes serenity-rs#94. Tested using `cargo make ready`.

Merge branch 'next' into symphonia

6a63e78

Gateway: Remove lifetime from Serenity setup trait (serenity-rs#103)

7956792

This matches a recent serenity change where `ClientBuilder` no longer has an explicit lifetime parameter. This was tested using `cargo make ready`.

Merge branch 'next' into symphonia

b8cf942

Dumb hack for Ogg playback.

069cbd9

Oggs have some frames whose length is 0, yet must be decoded. I assume this covers the entire coder delay.

Lazy input creation pipeline.

0d10631

Use new last_decoded Decoder semantics.

07e278e

Lazy inputs work as expected -- for Files at least.

af88b64

FelixMcFelix added 8 commits December 13, 2021 11:25

Seek support.

64cdb6c

Moves all sync creation/parsing/seeking etc. over to an elastically-sized thread pool. Since basically everything is now a restartable, this means that the `ForwardOnly` distinction can be handled really cleanly in general and we can try to recreate sources as needed.

Basic youtube-dl input, remove old tokio support.

140db80

Asks for any streams without webm, since they're likely to be golden right now.

Update to 2021 edition

04abb1f

Re-enable SetTrack

f6e9253

Redesign cached::Memory, port cached::Compressed

e666131

Opus frame passthrough support restored

c3c67ee

Stereo/mono mix target support.

170fff0

DCA seek support, metadata fixes.

ecc9cf1

GnomedDev and others added 8 commits June 27, 2022 23:20

Remove unnecessary poison messages (#6)

c069ac4

* Remove unnecessary poison messages * Re-add Poison for CoreMessage

Use OnceCell and DashMap (#8)

8b23dcd

Format, alphabetise feature deps.

fa9352b

Fix ShardMessage for serenity::next

185038f

Structify YoutubeDL Json reading

965ba4a

Make url field mandatory in YtDL Output.

b4bd427

FFprobe output as File::aux_metadata

82b29ea

Quick once-over of the serenity examples.

29d8d55

FelixMcFelix mentioned this pull request Jul 5, 2022

Support serenity simd_json #105

Merged

FelixMcFelix and others added 8 commits July 11, 2022 16:56

Review Pt. 1: Doc mix logic, extend passthrough to one *live* track

bcb23d2

This is most relevant for queue users -- an extra track would cause opus passthrough to end, even though only one track was being actively used in mixing.

Some unneeded refs caught by clippy

d29dec1

Corollary: this same new clippy lint adds a ton of false positives which *will* fail to compile.

Minor cleanup: not a fan of From<&Type> vs. dedicated method.

5557e6c

Track removals from event context, sans Vec

d9f0ef7

Review Pt 2: Mixer task.

cf256ad

Act on queued seeks given mid-track--prepare.

c69d11f

Review pt 3: Starting on inputs

afb5cb5

Will leave 're-computing filesize guesses' to a future issue.

Remove unnecessary chrono dep (#9)

71f1e46

FelixMcFelix force-pushed the next branch from 49c3346 to 0d66467 Compare July 22, 2022 21:20

FelixMcFelix added 6 commits July 23, 2022 18:56

Merge branch 'next' into symphonia

5f87e23

Adapt examples to new serenity framework.

20cd609

impl Error for Cached adapters

bfcc932

Review pt 4: adapters.

01e1030

Review pt 5: Inputs

fae374a

Finis

e7a233c

FelixMcFelix marked this pull request as ready for review July 23, 2022 22:15

FelixMcFelix merged commit 5547de2 into serenity-rs:next Jul 23, 2022

FelixMcFelix mentioned this pull request Jul 25, 2022

Migrate mixing/decoding toolchain to Symphonia #67

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Driver/Input: Migrate audio backend to Symphonia #89

Driver/Input: Migrate audio backend to Symphonia #89

FelixMcFelix commented Aug 23, 2021 •

edited

Loading

FelixMcFelix commented Nov 25, 2021

FelixMcFelix commented Jul 23, 2022

Driver/Input: Migrate audio backend to Symphonia #89

Driver/Input: Migrate audio backend to Symphonia #89

Conversation

FelixMcFelix commented Aug 23, 2021 • edited Loading

FelixMcFelix commented Nov 25, 2021

FelixMcFelix commented Jul 23, 2022

FelixMcFelix commented Aug 23, 2021 •

edited

Loading