Improve performance of unescapeXML #125

mogsie · 2018-09-26T14:39:27Z

Half of #120 fixed by this change.

sonnyp · 2018-09-26T21:34:25Z

test/escape-test.js

@@ -49,6 +49,9 @@ vows.describe('escape').addBatch({
    'unescapes \'': function () {
      assert.strictEqual(unescapeXML('&apos;'), '\'')
    },
+    'leaves invalid entities alone': function () {
+      assert.strictEqual(unescapeXML('&foobar;'), '&foobar;')
+    },


should we throw instead?

That was my first inclination, and my first implementation, but I didn't want to break backwards compatibility, so I decided to write a test that verified that the behaviour didn't change. If anything, that might be better to put in a different PR.

sounds sane :) good candidate for release 3.0

Invalid entities are passed through silently, meaning that "<a>&foobar;</a>" is allowed, even though it technically isn't valid XML. In order to preserve backwards compatibility, this test has been added to avoid changing this behaviour.

The original replace function uses a regular expression to find expressions to parse. It is more efficient to use the indexOf to find the first matching '&' character and then the matching ';' character. Fixes half of xmppjs#120.

sonnyp · 2018-09-29T10:26:48Z

parsers suite

before: ltx x 817,222 ops/sec ±1.94% (88 runs sampled)
after: ltx x 1,098,104 ops/sec ±0.87% (95 runs sampled)

Node.js v10.11.0 - Intel(R) Core(TM) i5-2520M CPU @ 2.50GHz

sonnyp · 2018-10-03T20:51:44Z

https://github.com/xmppjs/ltx/releases/tag/v2.8.0

sonnyp reviewed Sep 26, 2018

View reviewed changes

mogsie mentioned this pull request Sep 28, 2018

Throw on invalid entity #127

Merged

Erik Mogensen added 2 commits September 28, 2018 15:10

Improve performance of unescapeXML

8fccfe3

The original replace function uses a regular expression to find expressions to parse. It is more efficient to use the indexOf to find the first matching '&' character and then the matching ';' character. Fixes half of xmppjs#120.

mogsie force-pushed the performance-escape branch from 83a52f2 to 8fccfe3 Compare September 28, 2018 13:11

sonnyp merged commit 2690f7c into xmppjs:master Sep 30, 2018

mogsie deleted the performance-escape branch September 30, 2018 18:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of unescapeXML #125

Improve performance of unescapeXML #125

mogsie commented Sep 26, 2018

sonnyp Sep 26, 2018

mogsie Sep 27, 2018

sonnyp Sep 27, 2018

sonnyp commented Sep 29, 2018 •

edited

Loading

sonnyp commented Oct 3, 2018

Improve performance of unescapeXML #125

Improve performance of unescapeXML #125

Conversation

mogsie commented Sep 26, 2018

sonnyp Sep 26, 2018

Choose a reason for hiding this comment

mogsie Sep 27, 2018

Choose a reason for hiding this comment

sonnyp Sep 27, 2018

Choose a reason for hiding this comment

sonnyp commented Sep 29, 2018 • edited Loading

sonnyp commented Oct 3, 2018

sonnyp commented Sep 29, 2018 •

edited

Loading