Character encodings

Last reviewed May 28, 2026 Content v20260528

Track mode

iframe_html

Means

HTML preview sandbox

Reading

~2 min

Level

advanced

This lesson

This lesson teaches Character encodings—the ideas, syntax, and habits you need before moving on in HTML.

Without a solid grasp of Character encodings, you will repeat mistakes in HTML exercises and on real pages or scripts.

You will apply Character encodings in contexts like: Websites, hybrid apps, email templates, design systems, and CMS-driven content.

Read the lesson, edit HTML/CSS in the playground, press Run to preview, then answer the lesson MCQs. Also use the HTML reference desk when you need tag or attribute lookup.

When intermediate lessons feel comfortable and you are ready for production-style trade-offs.

UTF-8 encodes Unicode code points using one to four bytes. ASCII characters occupy one byte—backward compatible with legacy ASCII tooling.

Declaring UTF-8

Early <meta charset="utf-8"> in head.
HTTP Content-Type headers aligned with bytes.
Save files without conflicting BOM expectations unless tooling demands BOM.

Legacy encodings

Windows code pages and ISO-8859-* appear in older systems—transcode to UTF-8 when migrating.

Detection pitfalls

Mojibake occurs when bytes interpreted under wrong mapping—fix declarations rather than chasing symptoms.

Email + feeds

HTML mail often strips or rewrites encodings—test with real providers; RSS readers may be stricter than browsers.

Example — correct declaration early in `head`

<!DOCTYPE html>
<html lang="en">
<head>
  <meta charset="utf-8">
  <title>Encoding demo</title>
</head>

What mojibake looks like

If Café is stored UTF-8 but read as ISO-8859-1 you may see CafÃ©—fix bytes + headers rather than patching individual strings forever.

Important interview questions and answers

Q: What is the safest default character encoding for modern HTML?
A: UTF-8, declared early with `` and matched by server `Content-Type` headers.
Q: When are HTML entities still useful in UTF-8 pages?
A: For reserved characters (`&`, `<`) and contexts where explicit escaping avoids parser ambiguity.
Q: What is the key difference between HTML5 parsing and XHTML parsing?
A: HTML5 recovers from many errors; XHTML (XML) treats many parse errors as fatal.

Pitfall: Charset meta must appear early in <head>.

Playground

Runs in your browser in a sandboxed frame. Backend runners appear when this track’s profile allows them.

Discussion

Past discussion is visible to everyone. Only logged-in users can post comments and replies.

Starter discussion topics

What confused you about this lesson?
How would you explain this to a teammate in 30 seconds?

No discussion yet. Be the first to ask a question.

Character encodings

This lesson

Declaring UTF-8

Legacy encodings

Detection pitfalls

Email + feeds

Example — correct declaration early in `head`

What mojibake looks like

Important interview questions and answers

Interview tip Lesson completion confidence

Playground

Check yourself

Discussion

This lesson

Declaring UTF-8

Legacy encodings

Detection pitfalls

Email + feeds

Example — correct declaration early in head

What mojibake looks like

Important interview questions and answers

Interview tip Lesson completion confidence

Playground

Check yourself

Discussion

Example — correct declaration early in `head`