Fix XSS vulnerability on search results page #323

ChrisBAshton · 2023-04-11T07:42:52Z

Pages that are indexed in search results have their entire contents indexed, including any HTML code snippets. These HTML snippets would appear in the search results unsanitised, so it was possible to render arbitrary HTML or run arbitrary scripts:

This is a largely theoretical security issue; to exploit it, an attacker would need to find a way of committing malicious code to a page indexed by a site that uses tech-docs-gem (which are typically not editable by untrusted users). Their code would also be limited by the relatively short length that's rendered in the corresponding search result. Nevertheless, the XSS would then be triggerable by visiting a pre-constructed URL (/search/index.html?q=some+search+term), which users could be tricked into clicking on through social engineering.

What’s changed

This commit sanitises the HTML before rendering it to the page. It does so whilst retaining the <mark data-markjs="true"> behaviour that highlights the search term in the result:

I've used jQuery's text() function for sanitisation, as that is the approach used elsewhere in the project (1).

I did consider using native JavaScript (using the same approach as in Mustache 2) to avoid the jQuery dependency, but this itself may contain bugs and would lead to having two sanitisation approaches to maintain, so I opted against it. For future reference, the code in this commit can be swapped out with:

var entityMap = {
  '&': '&amp;',
  '<': '&lt;',
  '>': '&gt;',
  '"': '&quot;',
  "'": '&#39;',
  '/': '&#x2F;',
  '`': '&#x60;',
  '=': '&#x3D;'
};
var sanitizedContent = String(content).replace(/[&<>"'`=\/]/g, function (s) {
  return entityMap[s];
});

Identifying a user need

The look and interactions of the gem are unchanged. This simply addresses a security issue.

CHANGELOG.md

lib/assets/javascripts/_modules/search.js

Pages that are indexed in search results have their entire contents indexed, including any HTML code snippets. These HTML snippets would appear in the search results unsanitised, so it was possible to render arbitrary HTML or run arbitrary scripts: > ![script being invoked](https://user-images.githubusercontent.com/5111927/230888935-0367b598-eda7-4f67-afb5-799b41684ee3.png) > ![HTML being rendered](https://user-images.githubusercontent.com/5111927/230888939-f0056edc-6955-4f10-8aee-c93414b1cb69.png) This is a largely theoretical security issue; to exploit it, an attacker would need to find a way of committing malicious code to a page indexed by a site that uses tech-docs-gem (which are typically not editable by untrusted users). Their code would also be limited by the relatively short length that's rendered in the corresponding search result. Nevertheless, the XSS would then be triggerable by visiting a pre-constructed URL (`/search/index.html?q=some+search+term`), which users could be tricked into clicking on through social engineering. This commit sanitises the HTML before rendering it to the page. It does so whilst retaining the `<mark data-markjs="true">` behaviour that highlights the search term in the result: > ![sanitised HTML with highlights](https://user-images.githubusercontent.com/5111927/230888944-9aaf4920-cddd-43f9-8ef5-17f15785af73.png) I've used jQuery's `text()` function for sanitisation, as that is the approach used elsewhere in the project ([1]). I did consider using native JavaScript (using the same approach as in Mustache [2]) to avoid the jQuery dependency, but this itself may contain bugs and would lead to having two sanitisation approaches to maintain, so I opted against it. For future reference, the code in this commit can be swapped out with: ```js var entityMap = { '&': '&', '<': '<', '>': '>', '"': '"', "'": ''', '/': '/', '`': '`', '=': '=' }; var sanitizedContent = String(content).replace(/[&<>"'`=\/]/g, function (s) { return entityMap[s]; }); ``` [1]: https://github.com/alphagov/tech-docs-gem/blob/66cc7ab0a06dc2f1fe89de8cba2270fcf46f6466/lib/assets/javascripts/_modules/search.js#L202-L204 [2]: https://github.com/janl/mustache.js/blob/972fd2b27a036888acfcb60d6119317744fac7ee/mustache.js#L60-L75

ChrisBAshton marked this pull request as ready for review April 11, 2023 07:44

ChrisBAshton requested a review from lfdebrux April 11, 2023 07:56

claireashworth reviewed Apr 11, 2023

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

ChrisBAshton force-pushed the fix-xss branch from e3f0b9f to 6140bcb Compare April 11, 2023 08:41

lfdebrux reviewed Apr 11, 2023

View reviewed changes

lib/assets/javascripts/_modules/search.js Outdated Show resolved Hide resolved

ChrisBAshton added 2 commits April 11, 2023 09:56

Add XSS note to changelog

60e6e2d

ChrisBAshton force-pushed the fix-xss branch from 6140bcb to 60e6e2d Compare April 11, 2023 08:56

claireashworth approved these changes Apr 11, 2023

View reviewed changes

lfdebrux approved these changes Apr 11, 2023

View reviewed changes

lfdebrux merged commit a51c705 into main Apr 11, 2023

lfdebrux deleted the fix-xss branch April 11, 2023 09:24

lfdebrux mentioned this pull request Apr 11, 2023

Release v3.3.1 #324

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix XSS vulnerability on search results page #323

Fix XSS vulnerability on search results page #323

ChrisBAshton commented Apr 11, 2023 •

edited

Loading

Fix XSS vulnerability on search results page #323

Fix XSS vulnerability on search results page #323

Conversation

ChrisBAshton commented Apr 11, 2023 • edited Loading

What’s changed

Identifying a user need

ChrisBAshton commented Apr 11, 2023 •

edited

Loading