HTML block: fix parsing #27268

ellatrix · 2020-11-25T11:57:26Z

Description

Fixes #24282, or at least partly.
The issue has been around since the beginning of Gutenberg.

The problem is that, when parsing the blocks, element.innerHTML encodes these characters.

What we're looking for is the raw block content, so there's no need to parse the HTML at all, it can be skipped.

How has this been tested?

Type 0 < 1 in an HTML block. Save and reload. There should be no error.

Screenshots

Types of changes

Checklist:

My code is tested.
My code follows the WordPress code style.
My code follows the accessibility standards.
My code has proper inline documentation.
I've included developer documentation if appropriate.
I've updated all React Native files affected by any refactorings/renamings in this PR.

github-actions · 2020-11-25T12:06:44Z

Size Change: +137 B (0%)

Total Size: 1.33 MB

Filename	Size	Change
`build/block-library/index.min.js`	196 kB	+119 B (0%)
`build/blocks/index.min.js`	50.4 kB	+18 B (0%)

ℹ️ View Unchanged

Filename	Size
`build/a11y/index.min.js`	993 B
`build/annotations/index.min.js`	2.78 kB
`build/api-fetch/index.min.js`	2.27 kB
`build/autop/index.min.js`	2.15 kB
`build/blob/index.min.js`	487 B
`build/block-directory/index.min.js`	7.16 kB
`build/block-directory/style-rtl.css`	1.03 kB
`build/block-directory/style.css`	1.04 kB
`build/block-editor/content-rtl.css`	2.71 kB
`build/block-editor/content.css`	2.71 kB
`build/block-editor/default-editor-styles-rtl.css`	401 B
`build/block-editor/default-editor-styles.css`	401 B
`build/block-editor/index.min.js`	181 kB
`build/block-editor/style-rtl.css`	14.5 kB
`build/block-editor/style.css`	14.5 kB
`build/block-library/blocks/archives/editor-rtl.css`	107 B
`build/block-library/blocks/archives/editor.css`	106 B
`build/block-library/blocks/archives/style-rtl.css`	129 B
`build/block-library/blocks/archives/style.css`	129 B
`build/block-library/blocks/audio/editor-rtl.css`	185 B
`build/block-library/blocks/audio/editor.css`	185 B
`build/block-library/blocks/audio/style-rtl.css`	158 B
`build/block-library/blocks/audio/style.css`	158 B
`build/block-library/blocks/audio/theme-rtl.css`	172 B
`build/block-library/blocks/audio/theme.css`	172 B
`build/block-library/blocks/avatar/editor-rtl.css`	154 B
`build/block-library/blocks/avatar/editor.css`	154 B
`build/block-library/blocks/avatar/style-rtl.css`	126 B
`build/block-library/blocks/avatar/style.css`	126 B
`build/block-library/blocks/block/editor-rtl.css`	338 B
`build/block-library/blocks/block/editor.css`	338 B
`build/block-library/blocks/button/editor-rtl.css`	517 B
`build/block-library/blocks/button/editor.css`	517 B
`build/block-library/blocks/button/style-rtl.css`	566 B
`build/block-library/blocks/button/style.css`	566 B
`build/block-library/blocks/buttons/editor-rtl.css`	373 B
`build/block-library/blocks/buttons/editor.css`	373 B
`build/block-library/blocks/buttons/style-rtl.css`	368 B
`build/block-library/blocks/buttons/style.css`	368 B
`build/block-library/blocks/calendar/style-rtl.css`	270 B
`build/block-library/blocks/calendar/style.css`	270 B
`build/block-library/blocks/categories/editor-rtl.css`	125 B
`build/block-library/blocks/categories/editor.css`	124 B
`build/block-library/blocks/categories/style-rtl.css`	138 B
`build/block-library/blocks/categories/style.css`	138 B
`build/block-library/blocks/code/editor-rtl.css`	102 B
`build/block-library/blocks/code/editor.css`	102 B
`build/block-library/blocks/code/style-rtl.css`	159 B
`build/block-library/blocks/code/style.css`	159 B
`build/block-library/blocks/code/theme-rtl.css`	160 B
`build/block-library/blocks/code/theme.css`	160 B
`build/block-library/blocks/columns/editor-rtl.css`	147 B
`build/block-library/blocks/columns/editor.css`	147 B
`build/block-library/blocks/columns/style-rtl.css`	442 B
`build/block-library/blocks/columns/style.css`	442 B
`build/block-library/blocks/comment-author-avatar/editor-rtl.css`	163 B
`build/block-library/blocks/comment-author-avatar/editor.css`	163 B
`build/block-library/blocks/comment-content/style-rtl.css`	134 B
`build/block-library/blocks/comment-content/style.css`	134 B
`build/block-library/blocks/comment-template/style-rtl.css`	237 B
`build/block-library/blocks/comment-template/style.css`	236 B
`build/block-library/blocks/comments-pagination-numbers/editor-rtl.css`	159 B
`build/block-library/blocks/comments-pagination-numbers/editor.css`	157 B
`build/block-library/blocks/comments-pagination/editor-rtl.css`	258 B
`build/block-library/blocks/comments-pagination/editor.css`	249 B
`build/block-library/blocks/comments-pagination/style-rtl.css`	272 B
`build/block-library/blocks/comments-pagination/style.css`	268 B
`build/block-library/blocks/comments-title/editor-rtl.css`	118 B
`build/block-library/blocks/comments-title/editor.css`	118 B
`build/block-library/blocks/comments/editor-rtl.css`	875 B
`build/block-library/blocks/comments/editor.css`	874 B
`build/block-library/blocks/comments/style-rtl.css`	672 B
`build/block-library/blocks/comments/style.css`	671 B
`build/block-library/blocks/cover/editor-rtl.css`	646 B
`build/block-library/blocks/cover/editor.css`	647 B
`build/block-library/blocks/cover/style-rtl.css`	1.61 kB
`build/block-library/blocks/cover/style.css`	1.6 kB
`build/block-library/blocks/embed/editor-rtl.css`	327 B
`build/block-library/blocks/embed/editor.css`	327 B
`build/block-library/blocks/embed/style-rtl.css`	446 B
`build/block-library/blocks/embed/style.css`	446 B
`build/block-library/blocks/embed/theme-rtl.css`	172 B
`build/block-library/blocks/embed/theme.css`	172 B
`build/block-library/blocks/file/editor-rtl.css`	335 B
`build/block-library/blocks/file/editor.css`	335 B
`build/block-library/blocks/file/style-rtl.css`	288 B
`build/block-library/blocks/file/style.css`	288 B
`build/block-library/blocks/file/view.min.js`	353 B
`build/block-library/blocks/freeform/editor-rtl.css`	2.47 kB
`build/block-library/blocks/freeform/editor.css`	2.47 kB
`build/block-library/blocks/gallery/editor-rtl.css`	1.01 kB
`build/block-library/blocks/gallery/editor.css`	1.02 kB
`build/block-library/blocks/gallery/style-rtl.css`	1.58 kB
`build/block-library/blocks/gallery/style.css`	1.58 kB
`build/block-library/blocks/gallery/theme-rtl.css`	157 B
`build/block-library/blocks/gallery/theme.css`	157 B
`build/block-library/blocks/group/editor-rtl.css`	687 B
`build/block-library/blocks/group/editor.css`	687 B
`build/block-library/blocks/group/style-rtl.css`	105 B
`build/block-library/blocks/group/style.css`	105 B
`build/block-library/blocks/group/theme-rtl.css`	125 B
`build/block-library/blocks/group/theme.css`	125 B
`build/block-library/blocks/heading/style-rtl.css`	128 B
`build/block-library/blocks/heading/style.css`	128 B
`build/block-library/blocks/html/editor-rtl.css`	365 B
`build/block-library/blocks/html/editor.css`	366 B
`build/block-library/blocks/image/editor-rtl.css`	861 B
`build/block-library/blocks/image/editor.css`	859 B
`build/block-library/blocks/image/style-rtl.css`	662 B
`build/block-library/blocks/image/style.css`	666 B
`build/block-library/blocks/image/theme-rtl.css`	172 B
`build/block-library/blocks/image/theme.css`	172 B
`build/block-library/blocks/latest-comments/style-rtl.css`	333 B
`build/block-library/blocks/latest-comments/style.css`	333 B
`build/block-library/blocks/latest-posts/editor-rtl.css`	250 B
`build/block-library/blocks/latest-posts/editor.css`	249 B
`build/block-library/blocks/latest-posts/style-rtl.css`	514 B
`build/block-library/blocks/latest-posts/style.css`	514 B
`build/block-library/blocks/list/style-rtl.css`	135 B
`build/block-library/blocks/list/style.css`	135 B
`build/block-library/blocks/media-text/editor-rtl.css`	300 B
`build/block-library/blocks/media-text/editor.css`	298 B
`build/block-library/blocks/media-text/style-rtl.css`	540 B
`build/block-library/blocks/media-text/style.css`	539 B
`build/block-library/blocks/more/editor-rtl.css`	465 B
`build/block-library/blocks/more/editor.css`	465 B
`build/block-library/blocks/navigation-link/editor-rtl.css`	746 B
`build/block-library/blocks/navigation-link/editor.css`	744 B
`build/block-library/blocks/navigation-link/style-rtl.css`	153 B
`build/block-library/blocks/navigation-link/style.css`	153 B
`build/block-library/blocks/navigation-submenu/editor-rtl.css`	333 B
`build/block-library/blocks/navigation-submenu/editor.css`	333 B
`build/block-library/blocks/navigation/editor-rtl.css`	2.19 kB
`build/block-library/blocks/navigation/editor.css`	2.19 kB
`build/block-library/blocks/navigation/style-rtl.css`	2.26 kB
`build/block-library/blocks/navigation/style.css`	2.25 kB
`build/block-library/blocks/navigation/view-modal.min.js`	2.81 kB
`build/block-library/blocks/navigation/view.min.js`	447 B
`build/block-library/blocks/nextpage/editor-rtl.css`	428 B
`build/block-library/blocks/nextpage/editor.css`	428 B
`build/block-library/blocks/page-list/editor-rtl.css`	397 B
`build/block-library/blocks/page-list/editor.css`	398 B
`build/block-library/blocks/page-list/style-rtl.css`	212 B
`build/block-library/blocks/page-list/style.css`	212 B
`build/block-library/blocks/paragraph/editor-rtl.css`	214 B
`build/block-library/blocks/paragraph/editor.css`	214 B
`build/block-library/blocks/paragraph/style-rtl.css`	321 B
`build/block-library/blocks/paragraph/style.css`	321 B
`build/block-library/blocks/post-author/style-rtl.css`	212 B
`build/block-library/blocks/post-author/style.css`	212 B
`build/block-library/blocks/post-comments-form/editor-rtl.css`	137 B
`build/block-library/blocks/post-comments-form/editor.css`	137 B
`build/block-library/blocks/post-comments-form/style-rtl.css`	536 B
`build/block-library/blocks/post-comments-form/style.css`	537 B
`build/block-library/blocks/post-date/style-rtl.css`	107 B
`build/block-library/blocks/post-date/style.css`	107 B
`build/block-library/blocks/post-excerpt/editor-rtl.css`	119 B
`build/block-library/blocks/post-excerpt/editor.css`	119 B
`build/block-library/blocks/post-excerpt/style-rtl.css`	116 B
`build/block-library/blocks/post-excerpt/style.css`	116 B
`build/block-library/blocks/post-featured-image/editor-rtl.css`	620 B
`build/block-library/blocks/post-featured-image/editor.css`	618 B
`build/block-library/blocks/post-featured-image/style-rtl.css`	349 B
`build/block-library/blocks/post-featured-image/style.css`	349 B
`build/block-library/blocks/post-navigation-link/style-rtl.css`	190 B
`build/block-library/blocks/post-navigation-link/style.css`	189 B
`build/block-library/blocks/post-template/editor-rtl.css`	140 B
`build/block-library/blocks/post-template/editor.css`	139 B
`build/block-library/blocks/post-template/style-rtl.css`	317 B
`build/block-library/blocks/post-template/style.css`	317 B
`build/block-library/blocks/post-terms/style-rtl.css`	136 B
`build/block-library/blocks/post-terms/style.css`	136 B
`build/block-library/blocks/post-title/style-rtl.css`	138 B
`build/block-library/blocks/post-title/style.css`	138 B
`build/block-library/blocks/preformatted/style-rtl.css`	139 B
`build/block-library/blocks/preformatted/style.css`	139 B
`build/block-library/blocks/pullquote/editor-rtl.css`	170 B
`build/block-library/blocks/pullquote/editor.css`	170 B
`build/block-library/blocks/pullquote/style-rtl.css`	357 B
`build/block-library/blocks/pullquote/style.css`	357 B
`build/block-library/blocks/pullquote/theme-rtl.css`	201 B
`build/block-library/blocks/pullquote/theme.css`	201 B
`build/block-library/blocks/query-pagination-numbers/editor-rtl.css`	158 B
`build/block-library/blocks/query-pagination-numbers/editor.css`	156 B
`build/block-library/blocks/query-pagination/editor-rtl.css`	258 B
`build/block-library/blocks/query-pagination/editor.css`	247 B
`build/block-library/blocks/query-pagination/style-rtl.css`	326 B
`build/block-library/blocks/query-pagination/style.css`	322 B
`build/block-library/blocks/query-title/style-rtl.css`	108 B
`build/block-library/blocks/query-title/style.css`	108 B
`build/block-library/blocks/query/editor-rtl.css`	475 B
`build/block-library/blocks/query/editor.css`	477 B
`build/block-library/blocks/quote/style-rtl.css`	253 B
`build/block-library/blocks/quote/style.css`	253 B
`build/block-library/blocks/quote/theme-rtl.css`	255 B
`build/block-library/blocks/quote/theme.css`	259 B
`build/block-library/blocks/read-more/style-rtl.css`	168 B
`build/block-library/blocks/read-more/style.css`	168 B
`build/block-library/blocks/rss/editor-rtl.css`	239 B
`build/block-library/blocks/rss/editor.css`	240 B
`build/block-library/blocks/rss/style-rtl.css`	323 B
`build/block-library/blocks/rss/style.css`	323 B
`build/block-library/blocks/search/editor-rtl.css`	205 B
`build/block-library/blocks/search/editor.css`	205 B
`build/block-library/blocks/search/style-rtl.css`	441 B
`build/block-library/blocks/search/style.css`	439 B
`build/block-library/blocks/search/theme-rtl.css`	149 B
`build/block-library/blocks/search/theme.css`	149 B
`build/block-library/blocks/separator/editor-rtl.css`	184 B
`build/block-library/blocks/separator/editor.css`	184 B
`build/block-library/blocks/separator/style-rtl.css`	269 B
`build/block-library/blocks/separator/style.css`	269 B
`build/block-library/blocks/separator/theme-rtl.css`	229 B
`build/block-library/blocks/separator/theme.css`	229 B
`build/block-library/blocks/shortcode/editor-rtl.css`	508 B
`build/block-library/blocks/shortcode/editor.css`	508 B
`build/block-library/blocks/site-logo/editor-rtl.css`	522 B
`build/block-library/blocks/site-logo/editor.css`	522 B
`build/block-library/blocks/site-logo/style-rtl.css`	238 B
`build/block-library/blocks/site-logo/style.css`	238 B
`build/block-library/blocks/site-tagline/editor-rtl.css`	129 B
`build/block-library/blocks/site-tagline/editor.css`	129 B
`build/block-library/blocks/site-title/editor-rtl.css`	155 B
`build/block-library/blocks/site-title/editor.css`	155 B
`build/block-library/blocks/site-title/style-rtl.css`	101 B
`build/block-library/blocks/site-title/style.css`	101 B
`build/block-library/blocks/social-link/editor-rtl.css`	219 B
`build/block-library/blocks/social-link/editor.css`	219 B
`build/block-library/blocks/social-links/editor-rtl.css`	709 B
`build/block-library/blocks/social-links/editor.css`	708 B
`build/block-library/blocks/social-links/style-rtl.css`	1.43 kB
`build/block-library/blocks/social-links/style.css`	1.43 kB
`build/block-library/blocks/spacer/editor-rtl.css`	372 B
`build/block-library/blocks/spacer/editor.css`	372 B
`build/block-library/blocks/spacer/style-rtl.css`	96 B
`build/block-library/blocks/spacer/style.css`	96 B
`build/block-library/blocks/table/editor-rtl.css`	491 B
`build/block-library/blocks/table/editor.css`	491 B
`build/block-library/blocks/table/style-rtl.css`	670 B
`build/block-library/blocks/table/style.css`	669 B
`build/block-library/blocks/table/theme-rtl.css`	220 B
`build/block-library/blocks/table/theme.css`	220 B
`build/block-library/blocks/tag-cloud/style-rtl.css`	287 B
`build/block-library/blocks/tag-cloud/style.css`	288 B
`build/block-library/blocks/template-part/editor-rtl.css`	436 B
`build/block-library/blocks/template-part/editor.css`	436 B
`build/block-library/blocks/template-part/theme-rtl.css`	139 B
`build/block-library/blocks/template-part/theme.css`	139 B
`build/block-library/blocks/text-columns/editor-rtl.css`	135 B
`build/block-library/blocks/text-columns/editor.css`	135 B
`build/block-library/blocks/text-columns/style-rtl.css`	198 B
`build/block-library/blocks/text-columns/style.css`	198 B
`build/block-library/blocks/verse/style-rtl.css`	130 B
`build/block-library/blocks/verse/style.css`	130 B
`build/block-library/blocks/video/editor-rtl.css`	720 B
`build/block-library/blocks/video/editor.css`	723 B
`build/block-library/blocks/video/style-rtl.css`	218 B
`build/block-library/blocks/video/style.css`	218 B
`build/block-library/blocks/video/theme-rtl.css`	171 B
`build/block-library/blocks/video/theme.css`	171 B
`build/block-library/classic-rtl.css`	193 B
`build/block-library/classic.css`	193 B
`build/block-library/common-rtl.css`	1.05 kB
`build/block-library/common.css`	1.05 kB
`build/block-library/editor-elements-rtl.css`	126 B
`build/block-library/editor-elements.css`	126 B
`build/block-library/editor-rtl.css`	11.7 kB
`build/block-library/editor.css`	11.7 kB
`build/block-library/elements-rtl.css`	105 B
`build/block-library/elements.css`	105 B
`build/block-library/reset-rtl.css`	514 B
`build/block-library/reset.css`	514 B
`build/block-library/style-rtl.css`	12.4 kB
`build/block-library/style.css`	12.4 kB
`build/block-library/theme-rtl.css`	749 B
`build/block-library/theme.css`	753 B
`build/block-serialization-default-parser/index.min.js`	1.13 kB
`build/block-serialization-spec-parser/index.min.js`	2.83 kB
`build/components/index.min.js`	204 kB
`build/components/style-rtl.css`	11.7 kB
`build/components/style.css`	11.7 kB
`build/compose/index.min.js`	12.3 kB
`build/core-data/index.min.js`	15.9 kB
`build/customize-widgets/index.min.js`	11.6 kB
`build/customize-widgets/style-rtl.css`	1.41 kB
`build/customize-widgets/style.css`	1.41 kB
`build/data-controls/index.min.js`	663 B
`build/data/index.min.js`	8.15 kB
`build/date/index.min.js`	32.1 kB
`build/deprecated/index.min.js`	518 B
`build/dom-ready/index.min.js`	336 B
`build/dom/index.min.js`	4.74 kB
`build/edit-navigation/index.min.js`	16.2 kB
`build/edit-navigation/style-rtl.css`	4.12 kB
`build/edit-navigation/style.css`	4.13 kB
`build/edit-post/classic-rtl.css`	569 B
`build/edit-post/classic.css`	570 B
`build/edit-post/index.min.js`	34.5 kB
`build/edit-post/style-rtl.css`	7.45 kB
`build/edit-post/style.css`	7.44 kB
`build/edit-site/index.min.js`	62.6 kB
`build/edit-site/style-rtl.css`	8.74 kB
`build/edit-site/style.css`	8.74 kB
`build/edit-widgets/index.min.js`	16.7 kB
`build/edit-widgets/style-rtl.css`	4.48 kB
`build/edit-widgets/style.css`	4.48 kB
`build/editor/index.min.js`	44 kB
`build/editor/style-rtl.css`	3.69 kB
`build/editor/style.css`	3.68 kB
`build/element/index.min.js`	4.72 kB
`build/escape-html/index.min.js`	548 B
`build/experiments/index.min.js`	882 B
`build/format-library/index.min.js`	6.96 kB
`build/format-library/style-rtl.css`	596 B
`build/format-library/style.css`	596 B
`build/hooks/index.min.js`	1.66 kB
`build/html-entities/index.min.js`	454 B
`build/i18n/index.min.js`	3.79 kB
`build/is-shallow-equal/index.min.js`	535 B
`build/keyboard-shortcuts/index.min.js`	1.79 kB
`build/keycodes/index.min.js`	1.86 kB
`build/list-reusable-blocks/index.min.js`	2.13 kB
`build/list-reusable-blocks/style-rtl.css`	858 B
`build/list-reusable-blocks/style.css`	857 B
`build/media-utils/index.min.js`	2.94 kB
`build/notices/index.min.js`	977 B
`build/nux/index.min.js`	2.07 kB
`build/nux/style-rtl.css`	772 B
`build/nux/style.css`	768 B
`build/plugins/index.min.js`	1.95 kB
`build/preferences-persistence/index.min.js`	2.23 kB
`build/preferences/index.min.js`	1.35 kB
`build/primitives/index.min.js`	960 B
`build/priority-queue/index.min.js`	1.59 kB
`build/react-i18n/index.min.js`	702 B
`build/react-refresh-entry/index.min.js`	8.44 kB
`build/react-refresh-runtime/index.min.js`	7.31 kB
`build/redux-routine/index.min.js`	2.75 kB
`build/reusable-blocks/index.min.js`	2.26 kB
`build/reusable-blocks/style-rtl.css`	281 B
`build/reusable-blocks/style.css`	281 B
`build/rich-text/index.min.js`	10.7 kB
`build/server-side-render/index.min.js`	2.19 kB
`build/shortcode/index.min.js`	1.52 kB
`build/style-engine/index.min.js`	1.51 kB
`build/token-list/index.min.js`	650 B
`build/url/index.min.js`	3.7 kB
`build/vendors/inert-polyfill.min.js`	2.48 kB
`build/vendors/react-dom.min.js`	41.8 kB
`build/vendors/react.min.js`	4.02 kB
`build/viewport/index.min.js`	1.09 kB
`build/warning/index.min.js`	280 B
`build/widgets/index.min.js`	7.23 kB
`build/widgets/style-rtl.css`	1.21 kB
`build/widgets/style.css`	1.21 kB
`build/wordcount/index.min.js`	1.06 kB

_{compressed-size-action}

ellatrix · 2020-11-25T14:02:05Z

Asking wider feedback because it adds a small piece to the existing block API.

youknowriad · 2020-11-25T14:02:32Z

packages/block-library/src/html/block.json

@@ -5,7 +5,7 @@
 	"attributes": {
 		"content": {
 			"type": "string",
-			"source": "html"
+			"source": "raw"


I'm pretty sure there's already a PR that does the same thing with lengthy discussion from @aduth

I didn't find it :(

Found it #25120. I was actually just talking with @getdave about this to try to find a solution.

I found it, this is the one I was talking about #10551

@youknowriad Nice, thanks, will read quick.

So the difference with that PR is that it still uses the html type instead of a new raw type, but the solution is essentially the same.

yes, and I believe @aduth's point is that it's the html type itself that is "entirely" broken and that we shouldn't fix just the "HTML block" case.

Although...if this fixes the immediate experience for users could we land this PR and then write a followup Issue detailing the problems with html and how we might look to resolve them long term? Not being able to put HTML in the HTML block feels like a bit of a poor UX 😄

I'm not sure we should add a new API (new "source" type)

getdave

I tested this using the examples provided by the contributor in the original Issue.

This PR does seem to resolve the issue in that chars such as < are not converted to entities. Here's the comparison from my testing.

Master	This PR

As you can see on master we get errors because the generated save content doesn't match the post content. On this PR however, there are no errors and everything is great 🎉

@ellatrix I'll defer to yourself and @youknowriad regarding whether to land this or to try and ship a more comprehensive fix to HTML in general.

One thing - we should probably add some tests to cover this scenario.

ellatrix · 2020-11-28T12:01:07Z

Definitely needs some tests. This is just a proposal so far. I'd like some feedback from at least some more people like @youknowriad @mtias @mcsf.

I don't see a good alternative here. We could try to fix the html sourcing (which relies on Element.innerHTML right now), but that's not easy to do (I think) and it would be a breaking change on an existing API vs an addition to the API. The simplest way is to allow a block to access the raw block content.

getdave · 2020-11-30T16:11:24Z

We could try to fix the html sourcing (which relies on Element.innerHTML right now)

I believe this is why the entities are encoded. See this note from innerHTML from MDN:

Note: If a <div>, <span>, or <noembed> node has a child text node that 
includes the characters (&), (<), or (>), innerHTML returns these characters 
as the HTML entities "&amp;", "&lt;" and "&gt;" respectively. Use 
Node.textContent to get a raw copy of these text nodes' contents.

ellatrix · 2020-12-01T13:27:00Z

Yes, because of this, the HTML you SET with Element.innerHTML may not be equal to the HTML you GET with Element.innerHTML.

One alternative solution here is to always encode characters just like innerHTML does. I think it's just the opening bracket that's causing the problem.

getdave · 2020-12-08T10:13:08Z

One alternative solution here is to always encode characters just like innerHTML does.

This sounds similar to what my original PR did but I was decoding.

Correct me if I'm wrong but the key would be to ensure that the Gutenberg annotations are 1:1 with whatever the user types into the Block? If the database needs to have that data encoded for persistence then that's ok so long as when it's parsed back it returns back 1:1 with what the user originally entered?

aristath · 2021-03-05T10:48:40Z

The parser has changed since this PR was created, do we still want to do this? Or should we close this?

ellatrix · 2022-07-18T16:03:22Z

@aristath Unfortunately, this is still a problem.

Would be good to agree on the solution.

getdave · 2022-07-21T09:08:23Z

I've rebased the branch. As @aristath said the parser changed and the original change was in a file that no longer existed.

I've moved to what seems like the new location and reinstated @ellatrix's original change.

getdave · 2022-07-21T09:11:15Z

So to recap...

The problem is that when HTML code it put into the HTML block certain characters are encoded into HTML entities.

The reason for this is that the html source matcher used by the core/html block relies on Element.innerHTML which will encode certain characters.

@ellatrix's proposed fix (here) is to introduce a new raw source type which simply returns the block content as is. This avoids/bypasses the innerHTML issue and we get a 1:1 with what the user entered.

@youknowriad was not in favour of introducing a new API however. He referenced a previous PR by @johngodley where they were looking to address a similar issue by either

fixing upstream in hpq
bypassing hpq entirely based on the presence of a selector in the attribute schema

The outcome of #10551 is that neither @youknowriad nor @aduth were happy to commit to merging the PR. However, the goal of that PR seems to have been around normalizing the HTML whereas our problem is relating to the HTML being fundamentally transformed.

There is also #25120 which attempts to "fix" the html parser at the point of Element.innerHTML by converting the HTML entities back into their string form. The description of that PR also includes a lot of detail about why this bug occurs so it's worth reading for context.

dmsnell

This won't impact any other blocks besides the HTML block. We should probably try and keep an eye out in case anyone else wants to use raw for another block. As discussed in person, I think there are still peculiarities about what the role of the HTML block even is or what proper behavior here is.

So practically I think this is a fair tactical change and addresses the issue at hand without adding much baggage. If we ever get around to reworking the HTML block we can change this stuff here, or if we overhaul the attribute sourcing and validation.

1 > 0

getdave · 2022-08-22T13:39:14Z

I'm in favour of merging this one. The impact seems limited to the HTML block and it is "opt in" rather than default behaviour.

Current unit test failures seem related and will need to be resolved prior to merge.

In #39424 when we added an optimization to only parse a block's innerHTML once we also changed the behavior that the innerHTML propety represented the raw HTML loaded by the parser. Instead, what we have since that change is the DOM of the parsed HTML. In this patch we're adding yet-another parameter to the bag of arguments in `getBlockAttribute()` so that the new `raw` type can read and reproduce that original source, e.g. when reading the string `1 < 0` the parsed value's `innerHTML` will be `1 < 0` even though the block's raw content was `1 < 0`.

getdave · 2022-12-07T10:37:14Z

I'm very pleased to see this get merged 🥳

ellatrix requested review from ajitbohra and talldan as code owners November 25, 2020 11:57

ellatrix force-pushed the try/html-fix-parse branch from 9747a02 to 19ee60a Compare November 25, 2020 11:58

ellatrix requested a review from getdave November 25, 2020 12:02

ellatrix added [Type] Bug An existing feature does not function as intended [Block] HTML Affects the the HTML Block labels Nov 25, 2020

ellatrix requested a review from youknowriad November 25, 2020 12:03

ellatrix mentioned this pull request Nov 25, 2020

Code block: paste plain text #27236

Merged

6 tasks

ellatrix added the [Feature] Block API API that allows to express the block paradigm. label Nov 25, 2020

ellatrix requested a review from a team November 25, 2020 14:01

youknowriad reviewed Nov 25, 2020

View reviewed changes

getdave approved these changes Nov 27, 2020

View reviewed changes

Base automatically changed from master to trunk March 1, 2021 15:44

youknowriad mentioned this pull request Mar 8, 2021

Improve loading method for block Javascript #29606

Open

getdave force-pushed the try/html-fix-parse branch from 19ee60a to e265ebf Compare July 21, 2022 09:07

dmsnell approved these changes Aug 12, 2022

View reviewed changes

ellatrix force-pushed the try/html-fix-parse branch from b04b62e to d87cfe5 Compare August 12, 2022 22:24

getdave added 2 commits December 6, 2022 15:14

HTML block: fix parsing

ff91cda

Reinstate the change in the new file

b513cb7

dmsnell and others added 2 commits December 6, 2022 15:14

Add e2e test

bc59be6

ellatrix force-pushed the try/html-fix-parse branch from d87cfe5 to bc59be6 Compare December 6, 2022 13:14

Add to schema

67c96a2

ellatrix requested a review from ajlende as a code owner December 6, 2022 14:25

ellatrix merged commit 97eb4a4 into trunk Dec 7, 2022

ellatrix deleted the try/html-fix-parse branch December 7, 2022 10:19

github-actions bot added this to the Gutenberg 14.8 milestone Dec 7, 2022

This was referenced Dec 7, 2022

HTML block content gets transformed into entities #24282

Closed

Markdown substitutes > for > inside codefence Automattic/wp-calypso#51666

Open

mpkelly pushed a commit to mpkelly/gutenberg that referenced this pull request Dec 7, 2022

HTML block: fix parsing (WordPress#27268)

f7addb9

dmsnell mentioned this pull request Mar 6, 2023

feature: add TypeScript types to the blocks package #48604

Open

ellatrix mentioned this pull request Nov 9, 2023

Missing block: use raw source for originalContent #56014

Merged

kuuak mentioned this pull request Aug 22, 2024

fix: 🐛 core/html missing content attribute pristas-peter/wp-graphql-gutenberg#208

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HTML block: fix parsing #27268

HTML block: fix parsing #27268

ellatrix commented Nov 25, 2020 •

edited

Loading

github-actions bot commented Nov 25, 2020 •

edited

Loading

ellatrix commented Nov 25, 2020 •

edited

Loading

youknowriad Nov 25, 2020

ellatrix Nov 25, 2020

youknowriad Nov 25, 2020

ellatrix Nov 25, 2020

youknowriad Nov 25, 2020

ellatrix Nov 25, 2020

ellatrix Nov 25, 2020

youknowriad Nov 25, 2020 •

edited

Loading

getdave Nov 27, 2020

youknowriad Nov 27, 2020

getdave left a comment •

edited

Loading

ellatrix commented Nov 28, 2020

getdave commented Nov 30, 2020

ellatrix commented Dec 1, 2020

getdave commented Dec 8, 2020

aristath commented Mar 5, 2021

ellatrix commented Jul 18, 2022

getdave commented Jul 21, 2022

getdave commented Jul 21, 2022 •

edited

Loading

dmsnell left a comment

getdave commented Aug 22, 2022

getdave commented Dec 7, 2022

HTML block: fix parsing #27268

HTML block: fix parsing #27268

Conversation

ellatrix commented Nov 25, 2020 • edited Loading

Description

How has this been tested?

Screenshots

Types of changes

Checklist:

github-actions bot commented Nov 25, 2020 • edited Loading

ellatrix commented Nov 25, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

youknowriad Nov 25, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

getdave left a comment • edited Loading

Choose a reason for hiding this comment

ellatrix commented Nov 28, 2020

getdave commented Nov 30, 2020

ellatrix commented Dec 1, 2020

getdave commented Dec 8, 2020

aristath commented Mar 5, 2021

ellatrix commented Jul 18, 2022

getdave commented Jul 21, 2022

getdave commented Jul 21, 2022 • edited Loading

dmsnell left a comment

Choose a reason for hiding this comment

getdave commented Aug 22, 2022

getdave commented Dec 7, 2022

ellatrix commented Nov 25, 2020 •

edited

Loading

github-actions bot commented Nov 25, 2020 •

edited

Loading

ellatrix commented Nov 25, 2020 •

edited

Loading

youknowriad Nov 25, 2020 •

edited

Loading

getdave left a comment •

edited

Loading

getdave commented Jul 21, 2022 •

edited

Loading