HTML Character Entity Encoding Mercifully short character code overview. An apparently simple subject which turns out to be brutally complicated - well to our modest brains. This stuff (and more) is contained in Chapter 24 of the current HTML 4.01 spec (the last ever version of HTML - its all XHTML now) which you can get (and lots of other great stuff) from the W3C site including their fantastic page validation services. If you find it useful - print it, stick it on your wall - and save our bandwidth!! The reason you can never find this stuff is because the most popular values confusingly belong to a number of character sets. There are two encoding forms supported by most browsers a character name format e.g. © = © a numeric format e.g. © = © Both formats are shown in the tables below. If you are using anything obscure or have to deal with old browsers (< v3 ish) you should stick to the numeric format only. For convenience we show some popular values (from a number of character sets) in a single table then the full Latin-1 (ISO8859-1) table. If you are into the greek alphabet for obscure mathematical symbols - you are out of luck. Go the the W3C click on HTML then go for Chapter 24 of the latest spec - they are all there. This lot should all display on most browsers. However in the tables below the column DN shows the result of using the name format encoding in your browser and the column D# shows the result using the numeric format. If there is no character in one or other of these column (except the space, dummy!) then your browser does not handle that format correctly. Commonly used character entity references DN Name format
D# Numeric format
Description
Char set
non-breaking space
ISO8559-1
©
©
©
©
copyright sign
ISO8559-1
®
®
®
®
registered trade mark sign
ISO8559-1
²
²
²
²
superscript 2 (squared)
ISO8559-1
³
³
³
³
superscript 3 (cubed)
ISO8559-1
"
"
"
"
quotation mark
ISO10646
&
&
&
&
ampersand sign
ISO10646
<
<
<
<
less than sign
ISO10646
>
>
>
>
greater than sign
ISO10646
–
–
–
–
en dash
ISO10646
—
—
—
—
em dash
ISO10646
‘
‘
‘
‘
left single quote
ISO10646
1 of 5
’
’
’
’
right single quote, apostrophe
ISO10646
“
“
“
“
left double quotation mark
ISO10646
”
”
”
”
right double quotation mark
ISO10646
•
•
•
•
small black circle, bullet
ISO10646
†
†
†
†
dagger sign
ISO10646
‡
‡
‡
‡
double dagger sign
ISO10646
′
′
′
′
prime = minutes = feet
ISO10646
″
″
″
″
double prime = seconds = inches
ISO10646
‹
‹
‹
‹
single left pointing angle quote
ISO10646
›
›
›
›
single right pointing angle quote
ISO10646
€
€
€
€
euro sign
ISO10646
™
™
™
™
Registered Trademark sign
ISO10646
˜
˜
˜
˜
tilde sign
ISO10646
ˆ
ˆ
ˆ
ˆ
circumflex (or caret) sign
ISO10646
♠
♠
♠
♠
black spade suit
ISO10646
♣
♣
♣
♣
black clubs suit
ISO10646
♥
♥
♥
♥
black heart suit
ISO10646
♦
♦
♦
♦
black diamonds suit
ISO10646
◊
◊
◊
◊
lozenge
ISO10646
←
←
←
←
left arrow
ISO10646
→
→
→
→
right arrow
ISO10646
↑
↑
↑
↑
up arrow
ISO10646
↓
↓
↓
↓
down arrow
ISO10646
↔
↔
↔
↔
right-left arrow
ISO10646
¬
¬
¬
¬
NOT sign
ISO8859-1
Here is the full list of character entities for accented characters and miscellaneous symbols in the Latin-1 (ISO 8859-1) character set. Values in the range 00 to 7F ( the ASCII/IA5 set in this table) are typically used as raw characters with the exceptions of the special characters used for HTML encoding as shown above (<, >, & and ") but they can be represented as HTML entities by using the ASCII/IA5 decimal number, for example ',' (comma) has a decimal vale of 44 (in the ASCII/IA5 table and may be represented as a HTML entity by encoding as ,. Latin-1 (ISO8859-1) character entity references DN Name format
D# Numeric format
Description
|
|
|
vertical bar
non-breaking space
¡
¡
¡
¡
inverted exclamation mark
¢
¢
¢
¢
cent sign
£
£
£
£
pound sign
2 of 5
¤
¤
¤
¤
currency sign
¥
¥
¥
¥
yen sign = yuan sign
¦
¦
¦
¦
broken vertical bar
§
§
§
§
section sign
¨
¨
¨
¨
diaeresis = spacing diaeresis
©
©
©
©
copyright sign
ª
ª
ª
ª
feminine ordinal indicator
«
«
«
«
left-pointing double angle quotes (left pointing quillemet)
¬
¬
¬
¬
not sign
soft hyphen
®
®
®
®
registered sign
‾
¯
‾
¯
macron = spacing macron
°
°
°
°
degree sign
±
±
±
±
plus-minus sign
²
²
²
²
superscript two (squared)
³
³
³
³
superscript three (cubed)
´
´
´
´
acute accent
µ
µ
µ
µ
micro sign
¶
¶
¶
¶
paragraph sign = pilcrow sign
·
·
middle dot = georgian comma
¸
¸
¸
¸
cedilla sign
¹
¹
¹
¹
superscript one
º
º
º
º
masculine ordinal indicator
»
»
»
»
right-pointing double angle quotes (right pointing quillemet)
¼
¼
¼
¼
vulgar fraction one quarter
½
½
½
½
vulgar fraction one half
¾
¾
¾
¾
vulgar fraction three quarters
¿
¿
¿
¿
inverted question mark
À
À
À
À
latin capital A with grave accent
Á
Á
Á
Á
latin capital A with acute accent
Â
Â
Â
Â
latin capital A with circumflex
Ã
Ã
Ã
Ã
latin capital A with tilde
Ä
Ä
Ä
Ä
latin capital A with diaeresis
Å
Å
Å
Å
latin capital A with ring
Æ
Æ
Æ
Æ
latin capital AE
Ç
Ç
Ç
Ç
latin capital C with cedilla
È
È
È
È
latin capital E with grave accent
É
É
É
É
latin capital E with acute accent
3 of 5
Ê
Ê
Ê
Ê
latin capital E with circumflex
Ë
Ë
Ë
Ë
latin capital E with diaeresis
Ì
Ì
Ì
Ì
latin capital I with grave accent
Í
Í
Í
Í
latin capital I with acute accent
Î
Î
Î
Î
latin capital I with circumflex
Ï
Ï
Ï
Ï
latin capital I with diaeresis
Ð
Ð
Ð
Ð
latin capital letter ETH
Ñ
Ñ
Ñ
Ñ
latin capital N with tilde
Ò
Ò
Ò
Ò
latin capital O with grave accent
Ó
Ó
Ó
Ó
latin capital O with acute accent
Ô
Ô
Ô
Ô
latin capital O with circumflex
Õ
Õ
Õ
Õ
latin capital O with tilde
Ö
Ö
Ö
Ö
latin capital O with diaeresis
×
×
×
×
multiplication sign
Ø
Ø
Ø
Ø
latin capital O with stroke
Ù
Ù
Ù
Ù
latin capital U with grave accent
Ú
Ú
Ú
Ú
latin capital U with acute accent
Û
Û
Û
Û
latin capital U with circumflex
Ü
&Uml;
Ü
Ü
latin capital U with diaeresis
Ý
Ý
Ý
Ý
latin capital Y with acute accent
Þ
Þ
Þ
Þ
latin capital THORN
ß
ß
ß
ß
latin small letter sharp s
à
à
à
à
latin small letter a with grave accent
á
á
á
á
latin small letter a with acute accent
â
â
â
â
latin small letter a with circumflex
ã
ã
ã
ã
latin small letter a with tilde
ä
ä
ä
ä
latin small letter a with diaeresis
å
å
å
å
latin small letter a with ring
æ
æ
æ
æ
latin small letter ae
ç
ç
ç
ç
latin small letter c with cedilla
è
è
è
è
latin small letter e with grave accent
é
é
é
é
latin small letter e with acute accent
ê
ê
ê
ê
latin small letter e with circumflex
ë
ë
ë
ë
latin small letter e with diaeresis
ì
ì
ì
ì
latin small letter i with grave accent
í
í
í
í
latin small letter i with acute accent
î
î
î
î
latin small letter i with circumflex
ï
ï
ï
ï
latin small letter i with diaeresis
ð
ð
ð
ð
latin small letter eth
4 of 5
ñ
ñ
ñ
ñ
latin small letter n with tilde
ò
ò
ò
ò
latin small letter 0 with grave accent
ó
ó
ó
ó
latin small letter 0 with acute accent
ô
ô
ô
ô
latin small letter 0 with circumflex
õ
õ
õ
õ
latin small letter 0 with tilde
ö
ö
ö
ö
latin small letter 0 with diaeresis
÷
÷
÷
÷
division sign
ø
ø
ø
ø
latin small letter 0 with stroke
ù
ù
ù
ù
latin small letter u with grave accent
ú
ú
ú
ú
latin small letter u with acute accent
û
û
û
û
latin small letter u with circumflex
ü
ü
ü
ü
latin small letter u with diareresis
ý
ý
ý
ý
latin small letter y with acute accent
þ
þ
þ
þ
latin small letter thorn
ÿ
ÿ
ÿ
ÿ
latin small letter y with diaeresis
Problems, comments, suggestions, corrections (including broken links) or something to add? Please take the time from a busy life to 'mail us' (at top of screen), the webmaster (below) or info-support at zytrax. You will have a warm inner glow for the rest of the day. Copyright © 1994 - 2008 ZyTrax, Inc. All rights reserved. Legal and Privacy
site by zytrax
5 of 5
web-master at zytrax Page modified: November 03 2007.