How better handle negative NaNs? #1

MaxGraey · 2018-11-17T17:54:45Z

I think need better clarify how handle negative NaNs. Most of implementations in built-ins of LLVM, GCC, Go and Rust use non-sign agnostic for NaNs like:

signbit(+NaN) == false // +NaN => 0x7ff80000_00000000
signbit(-NaN) == true  // -NaN => 0xfff80000_00000000

But in spec this not strictly mentioned and it seems we need always handle signed and unsigned NaNs as false?

Relate to this discussion

The text was updated successfully, but these errors were encountered:

chicoxyzzy · 2018-11-17T18:16:02Z

-NaN is actually evaluates to NaN in JS

> function ident(n) {return n}
< undefined
> ident(-0)
< -0
> ident(-NaN)
< NaN

so I suppose it should be handled as NaN

MaxGraey · 2018-11-17T18:45:23Z

const F64 = new Float64Array(1);
const U64 = new Uint32Array(F64.buffer);

F64[0] = NaN;
console.log('0x' + U64[1].toString(16));

F64[0] =-NaN;
console.log('0x' + U64[1].toString(16));

> 0x7ff80000
> 0xfff80000

chicoxyzzy · 2018-11-17T18:57:49Z

There is no negative NaN in spec though

distinct “Not-a-Number” values of the IEEE Standard are represented in ECMAScript as a single special NaN value

https://tc39.github.io/ecma262/#sec-ecmascript-language-types-number-type

hax · 2019-11-05T12:57:12Z

I found that all engines are actually have different raw bits for NaN and -NaN.

For example chakra implement it in chakra-core/ChakraCore#5905 .

And it seems Chrome recently also implement it (version 79+) though I have no time to find the original PR.

hax · 2019-11-05T16:10:00Z

// use TypedArray to expose the sign bit
// note this also use the coercion `ToNumber` semantic
Math.signbit = (() => {
	const LE = new Uint8Array(new Uint16Array([1]).buffer)[0]
	return function signbit(n) {
		const f64 = new Float64Array([n])
		const i32 = new Uint32Array(f64.buffer)
		return (i32[LE] >>> 31) === 1
	}
})()

console.log(Math.signbit(0))
console.log(Math.signbit(-0))
console.log(Math.signbit(Infinity))
console.log(Math.signbit(-Infinity))
console.log(Math.signbit(NaN))
console.log(Math.signbit(-NaN))
console.log(Math.signbit(-(-NaN)))
const negNaN = Number.POSITIVE_INFINITY / Number.NEGATIVE_INFINITY
console.log(Math.signbit(negNaN))

Chrome 80, FireFox 70 and Safari 13 all return interlaced false and true 😊
Chrome 78, Node 12 ~ 13 returnfalse false false false for NaN cases
Node 10 ~ 11 return false true false true for NaN cases 😊
Node 0.12 ~ 9 return false false false true for NaN cases
Node 0.8 ~ 0.10 return false true false true for NaN cases 😊
Old Chakra return true true true true for NaN cases 🤣

Note all tests are run on my MacBook Air (macOS High Sierra 10.13.6, Intel Core i5)

MaxGraey · 2019-11-05T16:45:29Z

Interesting. Btw you could use simpler approach because JS should use LE for x84:

const F64 = new Float64Array(1);
const U64 = new Uint32Array(F64.buffer);

const signbit = x => (F64[0] = x, Boolean(U64[1] >>> 31));

ghost · 2021-03-19T19:35:59Z

I came along and was wondering why special casing was made for NaNs too.

It wouldn't act like C's signbit at all then, but according to @chicoxyzzy, JS doesn't have a negative NaN.

If it isn't possible to create/use a NaN with an arbitrary bitset, then wouldn't one be able to use the bit manipulation implementations that most other languages use for signbit, without special casing NaNs, relying on the JS VM to canonicalize the NaN upon writing/reading/serializing it?

hax · 2021-04-19T03:57:03Z

JS doesn't have a negative NaN.

I think as my previous tests, engines actually have negative NaNs, currently it could be treated as abstract leak of implementation details in some degree, but if introduce signbit, I suppose it should reflect them as is.

ljharb · 2021-04-19T04:02:51Z

Exposing the bit patterns of NaN is a massive mistake in Typed Arrays, and one we should not extend anywhere else. Math.signbit should, like every non-Typed-Array part of the language, canonicalize NaNs and not distinguish between any bit patterns of any implementation's NaN values.

ghost · 2021-04-19T04:55:41Z

Exposing the bit patterns of NaN is a massive mistake in Typed Arrays

If I may ask, why? NaN is just as much of a number as 53.5 is, as 8 is, as 0 is, as -0 is, as infinity is, etc, as least according to IEEE 754 semantics and rules. All of them have a hard bit-pattern, and because TypedArrays expose any of them, I'd argue that they should all be exposed.

Maybe... just maybe, the language spec should be changed to reflect modern implementations, and have different NaNs?

ljharb · 2021-04-19T05:13:24Z

@crimsoncodes0 because in JS, explicitly and intentionally, there is supposed to only be one observable NaN value.

Typed Arrays expose them because the implementations that led to them didn't canonicalize. That doesn't mean it's a good decision.

Nothing should ever be added to the language that widens this unfortunate exposure.

MaxGraey · 2021-04-19T07:37:01Z

I think as my previous tests, engines actually have negative NaNs, currently it could be treated as abstract leak of implementation details in some degree, but if introduce signbit, I suppose it should reflect them as is.

Yes, according IEEE 754 negative NaN is canonical and fully valid (chould be preserve sign and propagate with sign)

MaxGraey · 2021-04-19T07:42:04Z

Exposing the bit patterns of NaN is a massive mistake in Typed Arrays, and one we should not extend anywhere else.

@ljharb In my opinion the big mistake is try to fix IEEE 754 on software (language or VM) level. Even WebAssembly which try to be most deterministic ISA/VM don't try to do this

ljharb · 2021-04-19T13:15:03Z

All of JavaScript does this already, outside of typed arrays. It’s part of the language design.

ghost · 2021-04-19T14:23:02Z

Would it be a web compatibility-breaking change to add to the TypedArray's spec that implementations must canonicalize NaN values from the Float{32,64}Array numerical accessors and DataView.getFloat{32,64}?

Presently, it sounds like the language is quite frankly... broken. Yes, it's a small thing, but it still breaks a fundamental part of the ES language spec, and explicitly putting a a step into the algorithms for reading memory into JS floats would fill this hole, and clear up this issue, as JS implementations would no-longer expose NaN bit patterns.

ljharb · 2021-04-19T14:26:32Z

It wouldn't likely break the web, but the committee explicitly decided in 2015 to not mandate NaN canonicalization in Typed Arrays, for performance reasons, and I'm quite confident there's no appetite to revisit that decision.

MaxGraey · 2021-04-19T14:28:48Z

It wouldn't likely break the web, but the committee explicitly decided in 2015 to not mandate NaN canonicalization in Typed Arrays, for performance reasons

And this totally make sense. How about relax NaN canonization to other lang parts? I don't think it may break the web

ljharb · 2021-04-19T14:36:37Z

@MaxGraey other language parts aren't used in hot paths or perf-sensitive code like Typed Arrays are (that's their reason for existing). I would be strongly opposed to any attempt to further worsen the situation around NaN canonicalization in the language.

MaxGraey · 2021-04-19T14:39:46Z

attempt to further worsen the situation around NaN canonicalization in the language.

Why? In user space bit signature of NaN doesn't matter at all. It may still canonize for FFI or something like this if it's necessary. Relax this requirement will simplify and speedup js engines

ghost · 2021-04-19T18:31:01Z

Off-topic, but does ECMAScript's canonical NaN value have a canonical bitset?

in JS, explicitly and intentionally, there is supposed to only be one observable NaN value.

And is there any documented reasoning behind that decision? If so, could it be linked, so that we may at least understand this situation (a bit) better?

ljharb · 2021-04-19T19:28:32Z

@crimsoncodes0 no, since the only bits of it are exposed via Typed Arrays.

The spec itself: https://tc39.es/ecma262/#sec-ecmascript-language-types-number-type.

In some implementations, external code might be able to detect a difference between various Not-a-Number values, but such behaviour is implementation-defined; to ECMAScript code, all NaN values are indistinguishable from each other.

ghost · 2021-04-19T21:49:44Z

I can't open the spec's multi-megabyte webpage without causing my entire device to lag, or crashing my (mobile) browser, is there a way to open only a small section of the spec?

Besides that, I have one last question to help me assess this problem: does the ES spec say that the floating point number (5.0) has a bitset? Does it acknowledge that it has one or otherwise say that it does?

If it acknowledges that any numbers have bit-patterns, it should acknowledge that all numbers do, including not-a-number, otherwise the specification makes no sense whatsoever, and ought to be changed.

If it does not acknowledge that any numbers have bit-patterns, then TypedArrays and DataViews are just plain broken features in JavaScript, since they clearly expose these "non-existent" bit-patterns to user scripts.

ljharb · 2021-04-19T22:02:44Z

Here's the same section on the multipage build: https://tc39.es/ecma262/multipage/ecmascript-data-types-and-values.html#sec-ecmascript-language-types-number-type

There's a note in there about the bit pattern; not sure if that answers your question.

That the language here is incongruous between "typed arrays" and "everything else" is true, but doesn't mean anything can change it. It also doesn't mean the incongruity should be worsened.

hax · 2021-04-20T02:35:54Z

I would be strongly opposed to any attempt to further worsen the situation around NaN canonicalization in the language.

But I think the semantic of signbit() should expose the sign bit as is. This is what signbit in any other languages do.

It also keep the simple invariant of signbit(x) === !signbit(-x).

ljharb · 2021-04-20T02:56:44Z

I don't think that invariant is possible; -(-NaN) is not guaranteed to have the same bit pattern as the original NaN. Engines are already allowed to canonicalize NaN in Typed Arrays - many just don't choose to.

There are no guarantees once you have a NaN. even storing it in a variable can change the bit pattern.

ghost · 2021-04-20T03:30:50Z

The first step of the unary negation algorithm canonicalizes the NaN, therefore this is merely a double canonicalization, thus the NaN should be the exact same NaN and consequently have the same bit-pattern, so I don't follow?

If the above is correct, then current engines aren't implementing it at all.

ljharb · 2021-04-20T04:01:22Z

Feel free to experiment with it in various engines - when writing https://npmjs.com/get-nans, i found a lot of unpredictable and unintuitive behavior.

hax · 2021-04-20T04:13:10Z

-(-NaN) is not guaranteed to have the same bit pattern

As my previous test #1 (comment) , most engines keep the bit pattern.

dy · 2024-12-27T01:21:29Z

@hax it's not consistent btw.

// use TypedArray to expose the sign bit
// note this also use the coercion ToNumber semantic
Math.signbit = (() => {
const LE = new Uint8Array(new Uint16Array([1]).buffer)[0]
return function signbit(n) {
const f64 = new Float64Array([n])
const i32 = new Uint32Array(f64.buffer)
return (i32[LE] >>> 31) === 1
}
})()

If you call signbit long enough, it will give different results. Try this in console:

let view = new DataView(new Float32Array(1).buffer)
for (let i = 0; i < 1e5; i++)
{
    view.setFloat32(0,-NaN)
    if ([ view.getUint8(3), view.getUint8(2), view.getUint8(1), view.getUint8(0) ]+'' !== [0, 0, 192, 127]+'') 
    console.log('failed', i)
}

After ~3k calls it gives different result.

ljharb · 2024-12-27T05:29:26Z

@dy as i explained on nodejs/node#56373 (comment)

This is expected, and is the nature of the language. Although the majority of the JS language only has one observable NaN, via Typed Arrays one can view bit patterns of many of the millions of NaN values in IEEE 754.

It would be perfectly reasonable in this non-TA API to canonicalize all NaN values, and treat them all the same.

MaxGraey mentioned this issue Nov 17, 2018

add Math.signbit AssemblyScript/assemblyscript#333

Merged

MaxGraey closed this as completed Jan 15, 2019

chicoxyzzy mentioned this issue Nov 3, 2019

Use cases & motivation #5

Open

MaxGraey reopened this Nov 5, 2019

hax mentioned this issue Apr 27, 2022

Petition to use Number namespace instead of Math #7

Open

chicoxyzzy mentioned this issue May 12, 2023

Math.copysign? rwaldron/proposal-math-extensions#33

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How better handle negative NaNs? #1

How better handle negative NaNs? #1

MaxGraey commented Nov 17, 2018 •

edited

Loading

chicoxyzzy commented Nov 17, 2018 •

edited

Loading

MaxGraey commented Nov 17, 2018

chicoxyzzy commented Nov 17, 2018

hax commented Nov 5, 2019

hax commented Nov 5, 2019 •

edited

Loading

MaxGraey commented Nov 5, 2019 •

edited

Loading

ghost commented Mar 19, 2021 •

edited by ghost

Loading

hax commented Apr 19, 2021

ljharb commented Apr 19, 2021

ghost commented Apr 19, 2021 •

edited by ghost

Loading

ljharb commented Apr 19, 2021

MaxGraey commented Apr 19, 2021 •

edited

Loading

MaxGraey commented Apr 19, 2021 •

edited

Loading

ljharb commented Apr 19, 2021

ghost commented Apr 19, 2021 •

edited by ghost

Loading

ljharb commented Apr 19, 2021

MaxGraey commented Apr 19, 2021

ljharb commented Apr 19, 2021

MaxGraey commented Apr 19, 2021 •

edited

Loading

ghost commented Apr 19, 2021 •

edited by ghost

Loading

ljharb commented Apr 19, 2021

ghost commented Apr 19, 2021

ljharb commented Apr 19, 2021

hax commented Apr 20, 2021

ljharb commented Apr 20, 2021

ghost commented Apr 20, 2021 •

edited by ghost

Loading

ljharb commented Apr 20, 2021

hax commented Apr 20, 2021

dy commented Dec 27, 2024 •

edited

Loading

ljharb commented Dec 27, 2024

How better handle negative NaNs? #1

How better handle negative NaNs? #1

Comments

MaxGraey commented Nov 17, 2018 • edited Loading

chicoxyzzy commented Nov 17, 2018 • edited Loading

MaxGraey commented Nov 17, 2018

chicoxyzzy commented Nov 17, 2018

hax commented Nov 5, 2019

hax commented Nov 5, 2019 • edited Loading

MaxGraey commented Nov 5, 2019 • edited Loading

ghost commented Mar 19, 2021 • edited by ghost Loading

hax commented Apr 19, 2021

ljharb commented Apr 19, 2021

ghost commented Apr 19, 2021 • edited by ghost Loading

ljharb commented Apr 19, 2021

MaxGraey commented Apr 19, 2021 • edited Loading

MaxGraey commented Apr 19, 2021 • edited Loading

ljharb commented Apr 19, 2021

ghost commented Apr 19, 2021 • edited by ghost Loading

ljharb commented Apr 19, 2021

MaxGraey commented Apr 19, 2021

ljharb commented Apr 19, 2021

MaxGraey commented Apr 19, 2021 • edited Loading

ghost commented Apr 19, 2021 • edited by ghost Loading

ljharb commented Apr 19, 2021

ghost commented Apr 19, 2021

ljharb commented Apr 19, 2021

hax commented Apr 20, 2021

ljharb commented Apr 20, 2021

ghost commented Apr 20, 2021 • edited by ghost Loading

ljharb commented Apr 20, 2021

hax commented Apr 20, 2021

dy commented Dec 27, 2024 • edited Loading

ljharb commented Dec 27, 2024

MaxGraey commented Nov 17, 2018 •

edited

Loading

chicoxyzzy commented Nov 17, 2018 •

edited

Loading

hax commented Nov 5, 2019 •

edited

Loading

MaxGraey commented Nov 5, 2019 •

edited

Loading

ghost commented Mar 19, 2021 •

edited by ghost

Loading

ghost commented Apr 19, 2021 •

edited by ghost

Loading

MaxGraey commented Apr 19, 2021 •

edited

Loading

MaxGraey commented Apr 19, 2021 •

edited

Loading

ghost commented Apr 19, 2021 •

edited by ghost

Loading

MaxGraey commented Apr 19, 2021 •

edited

Loading

ghost commented Apr 19, 2021 •

edited by ghost

Loading

ghost commented Apr 20, 2021 •

edited by ghost

Loading

dy commented Dec 27, 2024 •

edited

Loading