Accented Letters In HTML, TeX, And Microsoft Word
Có thể bạn quan tâm
This page explains the patterns behind how HTML, TeX, and Microsoft Word represent accented letters. The page begins with explanations for HTML, LaTeX, and Word and concludes with a chart summarizing how accents are handled across languages.
HTML
HTML uses “entities”, escape sequences, to represent accented letters (diacritics) with only ASCII characters. For example, à is encoded as à. The general pattern for constructing an HTML entity for an accented letter is
& + letter + accent code + ;
The letters that can be accented this way are a, c, e, i, n, o, u, and y, in both their lower-case and upper-case forms.
The possible accent codes are grave, acute, circ (circumflex), tilde, and uml (umlaut).
Not all combinations of letters and accents are possible. For example, the entity à places a grave accent on the letter “a”, but there is no entity &ngrave; to put such an accent on top of a letter “n.”
There are three additional accent codes that can only be applied to a single letter. The code ring can only be applied to “a.” å produces å and Å produces Å. The cedil (cedilla) code can only be applied to “c.” ç produces ç and Ç produces Ç. Finally, slash only applies to “o.” ø produces ø and Ø produces Ø.
The æ ligature follows a pattern similar to accents: æ produces æ and Æ produces Æ.
Note that most HTML entities are not legal in XHTML. See Greek letters in HTML, XML, TeX, and Unicode for an explanation of how to denote Unicode symbols in ASCII text in XML.
TeX and LaTeX
TeX uses a consistent pattern for adding diacritical marks to letters:
\ + code + letter
The code is a single character indicating the kind of diacritical mark to add. Unlike HTML entities, any mark can be added to any letter. Letters may be upper-case or lower-case.
TeX supports 14 diacritical mark codes. The ones corresponding to HTML entities listed above are ` (grave), ' (acute), ^ (circumflex), ~ (tilde), " (umlaut), and c (cedilla).
Examples: jalapeño is written jalape\~no in TeX. garçon is written gar\ccon.
TeX uses \ae and \AE for æ and Æ, \o and \O for ø and Ø, and \aa and \AA for å and Å.
Note that the above remarks only apply to text mode. LaTeX handles accents differently in math mode.
Microsoft Word
Microsoft Word is nearly identical to HTML in the accent-letter combinations it supports, but similar to TeX in the key strokes that it uses.
To add a grave accent to a vowel in Word, hold down the control key and press ` (sometimes call the back tick or back quote symbol, often in the top left of a US keyboard) then type the letter to accent. In short, CTRL + ` is the escape sequence.
Use CTRL + ' for an acute accent, CTRL + ^ for circumflex, CTRL + SHIFT + ~ for tilde, CTRL + SHIFT + : for umlaut, and CTRL + , for cedilla.
Word uses CTRL + SHIFT + & for æ and Æ, CTRL + / for ø and Ø, and CTRL + SHIFT + @ for å and Å.
Combined chart
Accent | HTML | TeX | Word |
---|---|---|---|
grave | grave | \` | CTRL + ` |
acute | acute | \' | CTRL + ' |
circumflex | circ | \^ | CTRL + ^ |
tilde | tidle | \~ | CTRL + SHIFT + ~ |
umlaut | uml | \" | CTRL + SHIFT + : |
cedilla | cedil | \c | CTRL + , |
æ, Æ | æ, Æ | \ae, \AE | CTRL + SHIFT + & + a or A |
ø, Ø | ø, Ø | \o, \O | CTRL + / + o or O |
å, Å | å, Å | \aa, \AA | CTRL + SHIFT + @ + a or A |
See also
- Common math symbols in HTML, XML, TeX, and Unicode
- Convert between Unicode and LaTeX commands
Từ khóa » Html Accent Codes
-
Letters With Accents
-
Symbol Codes | Entity Codes For HTML - Sites At Penn State
-
Acute Accent HTML Symbol, Character And Entity Codes - Toptal
-
Html Codes For Foreign Language Accents. Keyboard Help. - Starr.Net
-
Diacritical Marks - HTML Unicode UTF-8
-
HTML Entities - W3Schools
-
HTML Codes And HTML Special Characters: The Complete List
-
How Can I Show Special Characters Like "e" With Accent Acute Over It ...
-
ALT & HTML Codes :: International Letters For PC & MAC
-
HTML Codes For French Language Characters - ThoughtCo
-
HTML Code For Common Symbols - ThoughtCo
-
Hypertext Markup Language - 2.0 - The HTML Coded Character Set
-
HTML Character Codes »