add \ae, \AE, \oe, \OE, \o, \O, \ss with unicode support #1030

kevinbarabash · 2017-12-22T19:10:22Z

edemaine

Great, I didn't realize that these are already in the fonts! (I confirmed that they render using Computer Modern fonts, and look just like LaTeX's output.)

LaTeX throws warnings (LaTeX Warning: Command \ae invalid in math mode on input line 7.) if we use these commands in math mode, but technically it does support them (generating textords). Should we? This PR currently only supports the characters in text mode.

Technically, some lines in fontMetrics.js such as this one should be removed. You can also feel free to leave this to #992 if you'd like.

edemaine · 2017-12-22T19:19:21Z

test/katex-spec.js

+    it("should render ligature commands like their unicode characters", () => {
+        const commands = getBuilt("\\text{\\ae\\AE\\oe\\OE\\o\\O\\ss}");
+        const unicode = getBuilt("\\text{æÆœŒøØß}");
+        expect(commands).toEqual(unicode);


You can use toParseLike instead of this: expect("\\text{...}").toParseLike("\\text{...}")

I tried that, but they don't actually parse the same.

Ah, I see, getBuilt renders to HTML. I didn't realize that -- it suggests some other tests we could do!

kevinbarabash · 2017-12-22T19:27:01Z

Technically, some lines in fontMetrics.js such as this one should be removed. You can also feel free to leave this to #992 if you'd like.

I can remove those.

…aracters in text mode

kevinbarabash · 2017-12-22T19:30:08Z

src/fontMetrics.js

@@ -103,7 +103,6 @@ const extraCharacterMap = {
    'Ã': 'A',
    'Ä': 'A',
    'Å': 'A',
-    'Æ': 'A',


There weren't any entries for Œ or œ.

edemaine

LGTM!

edemaine · 2017-12-22T20:17:04Z

@kevinbarabash What are your thoughts about supporting these commands in ~~text~~math mode?

kevinbarabash · 2017-12-22T21:21:42Z

@edemaine I assume you mean "math mode". I didn't realize that they appear in both. I'll put up another PR for that.

edemaine · 2017-12-22T21:34:40Z

@kevinbarabash "LaTeX throws warnings (LaTeX Warning: Command \ae invalid in math mode on input line 7.) if we use these commands in math mode, but technically it does support them (generating textords)."

kevinbarabash · 2017-12-23T00:23:27Z

@edemaine I was testing using http://quicklatex.com. I tried it again with pdflatex locally and am seeing the same issue you are. In that case let's not bother. The change was non trivial to b/c while we have the glyphs in KaTeX_Main-Italic they need to either be added to KaTeX_Math-Italic or we would've had to special case things in the code.

edemaine · 2017-12-23T22:06:17Z

If they're specified as textords, I thought they'd be rendered in text mode using text fonts. I could be wrong though. (That is also what LaTeX does: it renders them in Roman font by default.)

kevinbarabash · 2017-12-24T00:02:58Z

For text mode they are textords and they do get rendered using KaTeX_Main-* which are the text fonts.

* Unicode accents * Lexer now looks for combining dicritical marks and adds them to the same character * Parser's `parseSymbol` now recognizes both combined and uncombined forms of Unicode accents, and builds accent objects just like the accent functions * Added CJK support to math mode (not just text mode) * Add invalid combining character test * Add MathML test * Add weak support for other Latin-1 characters This maintains backwards compatibility, but it uses the wrong font. There's a TODO to fix this later. Also refactor symbol code to use for..of * Update Unicode screenshot * Remove dot from accented i and j (in math mode) Also add dotless Unicode characters to support some accented i's and j's * Fix \imath, \jmath, \pounds, and more tests * Switch from for..of to .split().forEach() Save around 800 bytes in minified code * Fix split * normalize() detection * Convert back to vanilla for loops * Fix merge * Move normalize dependency to unicodeMake.js * Make unicodeSymbols into a lookup table instead of macros This is important for multi-accented characters. * Add comments about when to run * Move symbols definition into unicodeMake/Symbols.js * Remove CJK support in text mode * Add missing semicolon * Refactor unicodeAccents to its own file * Dotless i/j support in text mode * Remove excess character mappings * Fix Åå in math mode (still via Times) * Update to support #1030 * Add accented Greek letter support (for supported Greek symbols) * Update screenshot * remove Æ, æ, Ø, ø, and ß from math mode test

kevinbarabash force-pushed the ligatures branch from 6f7500f to 0bac9eb Compare December 22, 2017 19:13

kevinbarabash mentioned this pull request Dec 22, 2017

\ddot\imath is too wide #1028

Closed

kevinbarabash requested a review from edemaine December 22, 2017 19:21

edemaine reviewed Dec 22, 2017

View reviewed changes

kevinbarabash mentioned this pull request Dec 22, 2017

Unicode accents #992

Merged

add \ae, \AE, \oe, \OE, \o, \O, \ss with unicode support for those ch…

0960a22

…aracters in text mode

kevinbarabash force-pushed the ligatures branch from 0bac9eb to 0960a22 Compare December 22, 2017 19:29

kevinbarabash commented Dec 22, 2017

View reviewed changes

edemaine approved these changes Dec 22, 2017

View reviewed changes

kevinbarabash merged commit 522b238 into master Dec 22, 2017

edemaine added a commit to edemaine/KaTeX that referenced this pull request Dec 22, 2017

Update to support KaTeX#1030

d36b10b

edemaine added a commit to edemaine/KaTeX that referenced this pull request Dec 28, 2017

Update to support KaTeX#1030

45a901f

kevinbarabash deleted the ligatures branch December 29, 2017 17:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add \ae, \AE, \oe, \OE, \o, \O, \ss with unicode support #1030

add \ae, \AE, \oe, \OE, \o, \O, \ss with unicode support #1030

kevinbarabash commented Dec 22, 2017

edemaine left a comment •

edited

Loading

edemaine Dec 22, 2017

kevinbarabash Dec 22, 2017

edemaine Dec 22, 2017

kevinbarabash commented Dec 22, 2017

kevinbarabash Dec 22, 2017

edemaine left a comment

edemaine commented Dec 22, 2017 •

edited

Loading

kevinbarabash commented Dec 22, 2017

edemaine commented Dec 22, 2017

kevinbarabash commented Dec 23, 2017

edemaine commented Dec 23, 2017

kevinbarabash commented Dec 24, 2017

add \ae, \AE, \oe, \OE, \o, \O, \ss with unicode support #1030

add \ae, \AE, \oe, \OE, \o, \O, \ss with unicode support #1030

Conversation

kevinbarabash commented Dec 22, 2017

edemaine left a comment • edited Loading

Choose a reason for hiding this comment

edemaine Dec 22, 2017

Choose a reason for hiding this comment

kevinbarabash Dec 22, 2017

Choose a reason for hiding this comment

edemaine Dec 22, 2017

Choose a reason for hiding this comment

kevinbarabash commented Dec 22, 2017

kevinbarabash Dec 22, 2017

Choose a reason for hiding this comment

edemaine left a comment

Choose a reason for hiding this comment

edemaine commented Dec 22, 2017 • edited Loading

kevinbarabash commented Dec 22, 2017

edemaine commented Dec 22, 2017

kevinbarabash commented Dec 23, 2017

edemaine commented Dec 23, 2017

kevinbarabash commented Dec 24, 2017

edemaine left a comment •

edited

Loading

edemaine commented Dec 22, 2017 •

edited

Loading