TextDecoderStream leaks a native decoder resource if its stream errors #13142

h4l · 2021-12-19T15:28:06Z

I've come across what seems to be a bug in TextDecoderStream which allows it to leak the native decoder used by its TextDecoder.

I've made a test module to demonstrate the issue (repro code is at the bottom of that page, under the output): https://gist.github.com/h4l/0199ab7cc24dd13536e01c5ea98b3ae7
The 3 tests trigger the test runner's resource leak detection (a really nice feature!).

What seems to be happening is:

TextDecoderStream uses a TextDecoder to decode its chunks
TextDecoder.decode() creates a native decoder resource, which it holds open when used in streaming mode. It closes the resource when a non-streaming decode() call is made.
TextDecoderStream makes streaming decode() calls in its transform() method, and makes a final non-streaming decode() call in its flush() method.
When a stream pipeline is errored, the flush() method of any Transformer in a TransformStream is not called, so in the case of TextDecoderStream it has no way to know its no longer in use, and keeps open its decoder.

I was looking through the streams spec when I encountered the leak (before I worked out the cause) to try to work out if I was misusing the streams in some way. It seems to me like an oversight in the spec that Transformers have no way to be told to close/clean up when a stream doesn't close cleanly. flush() is only called when the stream closes normally if I've not missed something, and there are no other lifecycle methods available for Transformers.

I can't see any idiomatic way to tell the Transformer to close, but one approach could be to wrap the readable and writable streams of the TransformerStream to watch for close/cancel/abort calls.

I've not made any previous contributions, but I'd be happy to help with a PR to fix this if it'd be useful.

The text was updated successfully, but these errors were encountered:

h4l · 2022-01-23T11:44:19Z

I played around with the Streams API a bit and came up with a fairly straightforward way to implement a TransformStream whose Transformer gets notified of stream aborts. Basically two parts:

A WritableStream can be monitored for errors by wrapping it with another WritableStream that opens a reader on the monitored stream, exposes the reader's closed promise (which rejects if the monitored stream is aborted), and forwards start/write/close/abort calls to the monitored stream.
That looks like this: https://deno.land/x/shutdown_aware_transform_stream@1.0.0/shutdown_monitor_writable_stream.ts
Then a TransformStream can react to stream aborts by monitoring its writable side with the monitor stream, and using the closed promise to be notified when the stream aborts.
That looks like this: https://deno.land/x/shutdown_aware_transform_stream@1.0.0/shutdown_aware_transform_stream.ts#L98

Would you be interested in a PR for TextDecoderStream that fixes this issue by using a cut down/minimal version of the above to close the TextDecoder on aborts?

Or alternatively, it seems reasonable to me to suggest/request a change to the WHATWG Streams spec to give the controller of a TransformStream a property exposing an AbortSignal which could be used to handle stream aborts. WritableStreamDefaultController provides this already, but TransformStreamDefaultController doesn't. (And or to give Transformer objects an abort() method, like a WritableStream underlying sinks have.)

What do you think?

crowlKats · 2022-01-23T13:46:01Z

~~This seems to me like something that should be changed in the specification. It does seem to me like a good addition.~~ My bad, flush should take care of that actually already

h4l · 2022-01-23T18:44:01Z

As I understand it, flush is only called when the stream closes normally, it's not called when the stream aborts: https://streams.spec.whatwg.org/#transform-stream-error
Or do you mean you think the spec should change to call flush when the stream aborts?

h4l · 2022-01-24T14:18:59Z

I'll bring it up with the Streams spec team.

A TextDecoderStream that works around the Deno bug: denoland/deno#13142

nkronlage · 2022-06-28T18:23:13Z

In case this helps anyone hitting this issue when breaking out of async iteration, I was able to work around it with a custom decoder transform stream:

  const MyTextDecoderStream = () => {                                                
    const textDecoder = new TextDecoder();                                           
    return new TransformStream({                                                     
      transform(chunk : Uint8Array, controller: TransformStreamDefaultController) {  
        controller.enqueue(textDecoder.decode(chunk));                               
      },                                                                                                                          
      flush(controller: TransformStreamDefaultController) {                          
        controller.enqueue(textDecoder.decode());                                    
      }                                                                              
    });                                                                              
  };

nounder · 2023-05-10T14:49:16Z

Alternative solution with manual encoding from duplicated #19074

Deno.test("working alternative", async () => {
  const res = await fetch(
    "https://deno.land/std@0.186.0/json/testdata/test.jsonl"
  )

  const textDecoder = new TextDecoder()
  const reader = res.body!.getReader()
  const b = await reader.read()
  const t = await textDecoder.decode(b.value!)

  await reader.cancel()
})

…llation (#21074) This PR uses the new `cancel` method of `TransformStream` to properly clean up the internal `TextDecoder` used in `TextDecoderStream` if the stream is cancelled. Fixes #13142 Co-authored-by: Bartek Iwańczuk <biwanczuk@gmail.com>

…llation (denoland#21074) This PR uses the new `cancel` method of `TransformStream` to properly clean up the internal `TextDecoder` used in `TextDecoderStream` if the stream is cancelled. Fixes denoland#13142 Co-authored-by: Bartek Iwańczuk <biwanczuk@gmail.com>

kitsonk added bug Something isn't working correctly web related to Web APIs labels Dec 19, 2021

h4l mentioned this issue Jan 24, 2022

Handling stream errors in a TransformStream transformer whatwg/streams#1212

Closed

h4l added a commit to h4l/issue_13142_text_decoder_stream that referenced this issue Jan 30, 2022

feat: implement Issue13142TextDecoderStream

821f8ab

A TextDecoderStream that works around the Deno bug: denoland/deno#13142

andreubotella mentioned this issue Mar 22, 2022

It doesn't release a resource when we break the async iteration #14070

Closed

crowlKats mentioned this issue May 10, 2023

TextDecoderStream leak resources #19074

Closed

crowlKats mentioned this issue Jul 27, 2023

ReadableStream is missing asyncIterator and cancel does not cancel #19946

Open

egfx-notifications mentioned this issue Nov 4, 2023

fix(ext/web): Prevent TextDecoderStream resource leak on stream cancellation #21074

Merged

mmastrac closed this as completed in #21074 Nov 12, 2023

iuioiua mentioned this issue Nov 13, 2023

chore(csv): use TextDecoderStream() in test denoland/std#3801

Merged

Clemens-git76 mentioned this issue May 13, 2024

[Snyk] Security upgrade jest from 27.5.1 to 29.0.0 Clemens-git76/deno#1

Open

effigies mentioned this issue Aug 4, 2024

Cancelling a TextDecoderStream that failed to read invalid data raises TypeError #24872

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TextDecoderStream leaks a native decoder resource if its stream errors #13142

TextDecoderStream leaks a native decoder resource if its stream errors #13142

h4l commented Dec 19, 2021

h4l commented Jan 23, 2022

crowlKats commented Jan 23, 2022 •

edited

Loading

h4l commented Jan 23, 2022

h4l commented Jan 24, 2022

nkronlage commented Jun 28, 2022

nounder commented May 10, 2023 •

edited

Loading

TextDecoderStream leaks a native decoder resource if its stream errors #13142

TextDecoderStream leaks a native decoder resource if its stream errors #13142

Comments

h4l commented Dec 19, 2021

h4l commented Jan 23, 2022

crowlKats commented Jan 23, 2022 • edited Loading

h4l commented Jan 23, 2022

h4l commented Jan 24, 2022

nkronlage commented Jun 28, 2022

nounder commented May 10, 2023 • edited Loading

crowlKats commented Jan 23, 2022 •

edited

Loading

nounder commented May 10, 2023 •

edited

Loading