The HTML charset Attribute
The character set is defined within the <meta> tag.
Example
<meta charset=”UTF-8″> |
The HTML5 specification promotes the adoption of the UTF-8 character encoding.
UTF-8 encompasses nearly all characters and symbols found worldwide!
ASCII, the initial character encoding standard for the web, established a set of 128 distinct characters for internet usage:
The original Windows character set, ANSI (Windows-1252), comprised:
<meta charset=”Windows-1252″> |
ISO-8859-1 served as the default character set for HTML 4, offering support for 256 distinct character codes. HTML 4 also embraced UTF-8.
<meta http-equiv=”Content-Type” content=”text/html;charset=ISO-8859-1″> |
<meta charset=”ISO-8859-1″> |
<meta charset=”UTF-8″> |
The table below illustrates the distinctions among the aforementioned character sets:
Numb |
ASCII |
ANSI |
8859 |
UTF‑8 |
Description |
32 |
|
|
|
|
space |
33 |
! |
! |
! |
! |
exclamation mark |
34 |
“ |
“ |
“ |
“ |
quotation mark |
35 |
# |
# |
# |
# |
number sign |
36 |
$ |
$ |
$ |
$ |
dollar sign |
37 |
% |
% |
% |
% |
percent sign |
38 |
& |
& |
& |
& |
ampersand |
39 |
‘ |
‘ |
‘ |
‘ |
apostrophe |
40 |
( |
( |
( |
( |
left parenthesis |
41 |
) |
) |
) |
) |
right parenthesis |
42 |
* |
* |
* |
* |
asterisk |
43 |
+ |
+ |
+ |
+ |
plus sign |
44 |
, |
, |
, |
, |
comma |
45 |
– |
– |
– |
– |
hyphen-minus |
46 |
. |
. |
. |
. |
full stop |
47 |
/ |
/ |
/ |
/ |
solidus |
48 |
0 |
0 |
0 |
0 |
digit zero |
49 |
1 |
1 |
1 |
1 |
digit one |
50 |
2 |
2 |
2 |
2 |
digit two |
51 |
3 |
3 |
3 |
3 |
digit three |
52 |
4 |
4 |
4 |
4 |
digit four |
53 |
5 |
5 |
5 |
5 |
digit five |
54 |
6 |
6 |
6 |
6 |
digit six |
55 |
7 |
7 |
7 |
7 |
digit seven |
56 |
8 |
8 |
8 |
8 |
digit eight |
57 |
9 |
9 |
9 |
9 |
digit nine |
58 |
: |
: |
: |
: |
colon |
59 |
; |
; |
; |
; |
semicolon |
60 |
< |
< |
< |
< |
less than |
61 |
= |
= |
= |
= |
equals sign |
62 |
> |
> |
> |
> |
greater than |
63 |
? |
? |
? |
? |
question mark |
64 |
@ |
@ |
@ |
@ |
commercial at |
65 |
A |
A |
A |
A |
Latin A |
66 |
B |
B |
B |
B |
Latin B |
67 |
C |
C |
C |
C |
Latin C |
68 |
D |
D |
D |
D |
Latin D |
69 |
E |
E |
E |
E |
Latin E |
70 |
F |
F |
F |
F |
Latin F |
71 |
G |
G |
G |
G |
Latin G |
72 |
H |
H |
H |
H |
Latin H |
73 |
I |
I |
I |
I |
Latin I |
74 |
J |
J |
J |
J |
Latin J |
75 |
K |
K |
K |
K |
Latin K |
76 |
L |
L |
L |
L |
Latin L |
77 |
M |
M |
M |
M |
Latin M |
78 |
N |
N |
N |
N |
Latin N |
79 |
O |
O |
O |
O |
Latin O |
80 |
P |
P |
P |
P |
Latin P |
81 |
Q |
Q |
Q |
Q |
Latin Q |
82 |
R |
R |
R |
R |
Latin R |
83 |
S |
S |
S |
S |
Latin S |
84 |
T |
T |
T |
T |
Latin T |
85 |
U |
U |
U |
U |
Latin U |
86 |
V |
V |
V |
V |
Latin V |
87 |
W |
W |
W |
W |
Latin W |
88 |
X |
X |
X |
X |
Latin X |
89 |
Y |
Y |
Y |
Y |
Latin Y |
90 |
Z |
Z |
Z |
Z |
Latin Z |
91 |
[ |
[ |
[ |
[ |
left square bracket |
92 |
\ |
\ |
\ |
\ |
reverse solidus |
93 |
] |
] |
] |
] |
right square bracket |
94 |
^ |
^ |
^ |
^ |
circumflex accent |
95 |
_ |
_ |
_ |
_ |
low line |
96 |
` |
` |
` |
` |
grave accent |
97 |
a |
a |
a |
a |
Latin small a |
98 |
b |
b |
b |
b |
Latin small b |
99 |
c |
c |
c |
c |
Latin small c |
100 |
d |
d |
d |
d |
Latin small d |
101 |
e |
e |
e |
e |
Latin small e |
102 |
f |
f |
f |
f |
Latin small f |
103 |
g |
g |
g |
g |
Latin small g |
104 |
h |
h |
h |
h |
Latin small h |
105 |
i |
i |
i |
i |
Latin small i |
106 |
j |
j |
j |
j |
Latin small j |
107 |
k |
k |
k |
k |
Latin small k |
108 |
l |
l |
l |
l |
Latin small l |
109 |
m |
m |
m |
m |
Latin small m |
110 |
n |
n |
n |
n |
Latin small n |
111 |
o |
o |
o |
o |
Latin small o |
112 |
p |
p |
p |
p |
Latin small p |
113 |
q |
q |
q |
q |
Latin small q |
114 |
r |
r |
r |
r |
Latin small r |
115 |
s |
s |
s |
s |
Latin small s |
116 |
t |
t |
t |
t |
Latin small t |
117 |
u |
u |
u |
u |
Latin small u |
118 |
v |
v |
v |
v |
Latin small v |
119 |
w |
w |
w |
w |
Latin small w |
120 |
x |
x |
x |
x |
Latin small x |
121 |
y |
y |
y |
y |
Latin small y |
122 |
z |
z |
z |
z |
Latin small z |
123 |
{ |
{ |
{ |
{ |
left curly bracket |
124 |
| |
| |
| |
| |
vertical line |
125 |
} |
} |
} |
} |
right curly bracket |
126 |
~ |
~ |
~ |
~ |
tilde |
127 |
DEL |
|
|
|
|
128 |
|
€ |
|
|
euro sign |
129 |
|
|
|
|
NOT USED |
130 |
|
‚ |
|
|
single low-9 quotation mark |
131 |
|
ƒ |
|
|
Latin small f with hook |
132 |
|
„ |
|
|
double low-9 quotation mark |
133 |
|
… |
|
|
horizontal ellipsis |
134 |
|
† |
|
|
dagger |
135 |
|
‡ |
|
|
double dagger |
136 |
|
ˆ |
|
|
modifier letter circumflex accent |
137 |
|
‰ |
|
|
per mille sign |
138 |
|
Š |
|
|
Latin S with caron |
139 |
|
‹ |
|
|
single left-pointing angle quotation mark |
140 |
|
Œ |
|
|
Latin capital ligature OE |
141 |
|
|
|
|
NOT USED |
142 |
|
Ž |
|
|
Latin Z with caron |
143 |
|
|
|
|
NOT USED |
144 |
|
|
|
|
NOT USED |
145 |
|
‘ |
|
|
left single quotation mark |
146 |
|
’ |
|
|
right single quotation mark |
147 |
|
“ |
|
|
left double quotation mark |
148 |
|
” |
|
|
right double quotation mark |
149 |
|
• |
|
|
bullet |
150 |
|
– |
|
|
en dash |
151 |
|
— |
|
|
em dash |
152 |
|
˜ |
|
|
small tilde |
153 |
|
™ |
|
|
trade mark sign |
154 |
|
š |
|
|
Latin small s with caron |
155 |
|
› |
|
|
single right-pointing angle quotation mark |
156 |
|
œ |
|
|
Latin small ligature oe |
157 |
|
|
|
|
NOT USED |
158 |
|
ž |
|
|
Latin small z with caron |
159 |
|
Ÿ |
|
|
Latin Y with diaeresis |
160 |
|
|
|
|
no-break space |
161 |
|
¡ |
¡ |
¡ |
inverted exclamation mark |
162 |
|
¢ |
¢ |
¢ |
cent sign |
163 |
|
£ |
£ |
£ |
pound sign |
164 |
|
¤ |
¤ |
¤ |
currency sign |
165 |
|
¥ |
¥ |
¥ |
yen sign |
166 |
|
¦ |
¦ |
¦ |
broken bar |
167 |
|
§ |
§ |
§ |
section sign |
168 |
|
¨ |
¨ |
¨ |
diaeresis |
169 |
|
© |
© |
© |
copyright sign |
170 |
|
ª |
ª |
ª |
feminine ordinal indicator |
171 |
|
« |
« |
« |
left-pointing double angle quotation mark |
172 |
|
¬ |
¬ |
¬ |
not sign |
173 |
|
|
|
|
soft hyphen |
174 |
|
® |
® |
® |
registered sign |
175 |
|
¯ |
¯ |
¯ |
macron |
176 |
|
° |
° |
° |
degree sign |
177 |
|
± |
± |
± |
plus-minus sign |
178 |
|
² |
² |
² |
superscript two |
179 |
|
³ |
³ |
³ |
superscript three |
180 |
|
´ |
´ |
´ |
acute accent |
181 |
|
µ |
µ |
µ |
micro sign |
182 |
|
¶ |
¶ |
¶ |
pilcrow sign |
183 |
|
· |
· |
· |
middle dot |
184 |
|
¸ |
¸ |
¸ |
cedilla |
185 |
|
¹ |
¹ |
¹ |
superscript one |
186 |
|
º |
º |
º |
masculine ordinal indicator |
187 |
|
» |
» |
» |
right-pointing double angle quotation mark |
188 |
|
¼ |
¼ |
¼ |
vulgar fraction one quarter |
189 |
|
½ |
½ |
½ |
vulgar fraction one half |
190 |
|
¾ |
¾ |
¾ |
vulgar fraction three quarters |
191 |
|
¿ |
¿ |
¿ |
inverted question mark |
192 |
|
À |
À |
À |
Latin A with grave |
193 |
|
Á |
Á |
Á |
Latin A with acute |
194 |
|
 |
 |
 |
Latin A with circumflex |
195 |
|
à |
à |
à |
Latin A with tilde |
196 |
|
Ä |
Ä |
Ä |
Latin A with diaeresis |
197 |
|
Å |
Å |
Å |
Latin A with ring above |
198 |
|
Æ |
Æ |
Æ |
Latin AE |
199 |
|
Ç |
Ç |
Ç |
Latin C with cedilla |
200 |
|
È |
È |
È |
Latin E with grave |
201 |
|
É |
É |
É |
Latin E with acute |
202 |
|
Ê |
Ê |
Ê |
Latin E with circumflex |
203 |
|
Ë |
Ë |
Ë |
Latin E with diaeresis |
204 |
|
Ì |
Ì |
Ì |
Latin I with grave |
205 |
|
Í |
Í |
Í |
Latin I with acute |
206 |
|
Î |
Î |
Î |
Latin I with circumflex |
207 |
|
Ï |
Ï |
Ï |
Latin I with diaeresis |
208 |
|
Ð |
Ð |
Ð |
Latin Eth |
209 |
|
Ñ |
Ñ |
Ñ |
Latin N with tilde |
210 |
|
Ò |
Ò |
Ò |
Latin O with grave |
211 |
|
Ó |
Ó |
Ó |
Latin O with acute |
212 |
|
Ô |
Ô |
Ô |
Latin O with circumflex |
213 |
|
Õ |
Õ |
Õ |
Latin O with tilde |
214 |
|
Ö |
Ö |
Ö |
Latin O with diaeresis |
215 |
|
× |
× |
× |
multiplication sign |
216 |
|
Ø |
Ø |
Ø |
Latin O with stroke |
217 |
|
Ù |
Ù |
Ù |
Latin U with grave |
218 |
|
Ú |
Ú |
Ú |
Latin U with acute |
219 |
|
Û |
Û |
Û |
Latin U with circumflex |
220 |
|
Ü |
Ü |
Ü |
Latin U with diaeresis |
221 |
|
Ý |
Ý |
Ý |
Latin Y with acute |
222 |
|
Þ |
Þ |
Þ |
Latin Thorn |
223 |
|
ß |
ß |
ß |
Latin small sharp s |
224 |
|
à |
à |
à |
Latin small a with grave |
225 |
|
á |
á |
á |
Latin small a with acute |
226 |
|
â |
â |
â |
Latin small a with circumflex |
227 |
|
ã |
ã |
ã |
Latin small a with tilde |
228 |
|
ä |
ä |
ä |
Latin small a with diaeresis |
229 |
|
å |
å |
å |
Latin small a with ring above |
230 |
|
æ |
æ |
æ |
Latin small ae |
231 |
|
ç |
ç |
ç |
Latin small c with cedilla |
232 |
|
è |
è |
è |
Latin small e with grave |
233 |
|
é |
é |
é |
Latin small e with acute |
234 |
|
ê |
ê |
ê |
Latin small e with circumflex |
235 |
|
ë |
ë |
ë |
Latin small e with diaeresis |
236 |
|
ì |
ì |
ì |
Latin small i with grave |
237 |
|
í |
í |
í |
Latin small i with acute |
238 |
|
î |
î |
î |
Latin small i with circumflex |
239 |
|
ï |
ï |
ï |
Latin small i with diaeresis |
240 |
|
ð |
ð |
ð |
Latin small eth |
241 |
|
ñ |
ñ |
ñ |
Latin small n with tilde |
242 |
|
ò |
ò |
ò |
Latin small o with grave |
243 |
|
ó |
ó |
ó |
Latin small o with acute |
244 |
|
ô |
ô |
ô |
Latin small o with circumflex |
245 |
|
õ |
õ |
õ |
Latin small o with tilde |
246 |
|
ö |
ö |
ö |
Latin small o with diaeresis |
247 |
|
÷ |
÷ |
÷ |
division sign |
248 |
|
ø |
ø |
ø |
Latin small o with stroke |
249 |
|
ù |
ù |
ù |
Latin small u with grave |
250 |
|
ú |
ú |
ú |
Latin small u with acute |
251 |
|
û |
û |
û |
Latin small with circumflex |
252 |
|
ü |
ü |
ü |
Latin small u with diaeresis |
253 |
|
ý |
ý |
ý |
Latin small y with acute |
254 |
|
þ |
þ |
þ |
Latin small thorn |
255 |
|
ÿ |
ÿ |
ÿ |
Latin small y with diaeresis |