Skip to content
Learn Netverks

Lesson

Step 36/58 62% through track

charsets

Character encodings

Last reviewed May 28, 2026 Content v20260528
Track mode
iframe_html
Means
HTML preview sandbox
Reading
~2 min
Level
advanced

This lesson

This lesson teaches Character encodings—the ideas, syntax, and habits you need before moving on in HTML.

Without a solid grasp of Character encodings, you will repeat mistakes in HTML exercises and on real pages or scripts.

You will apply Character encodings in contexts like: Websites, hybrid apps, email templates, design systems, and CMS-driven content.

Read the lesson, edit HTML/CSS in the playground, press Run to preview, then answer the lesson MCQs. Also use the HTML reference desk when you need tag or attribute lookup.

When intermediate lessons feel comfortable and you are ready for production-style trade-offs.

UTF-8 encodes Unicode code points using one to four bytes. ASCII characters occupy one byte—backward compatible with legacy ASCII tooling.

Declaring UTF-8

  • Early <meta charset="utf-8"> in head.
  • HTTP Content-Type headers aligned with bytes.
  • Save files without conflicting BOM expectations unless tooling demands BOM.

Legacy encodings

Windows code pages and ISO-8859-* appear in older systems—transcode to UTF-8 when migrating.

Detection pitfalls

Mojibake occurs when bytes interpreted under wrong mapping—fix declarations rather than chasing symptoms.

Email + feeds

HTML mail often strips or rewrites encodings—test with real providers; RSS readers may be stricter than browsers.

Example — correct declaration early in head

<!DOCTYPE html>
<html lang="en">
<head>
  <meta charset="utf-8">
  <title>Encoding demo</title>
</head>

What mojibake looks like

If Café is stored UTF-8 but read as ISO-8859-1 you may see Café—fix bytes + headers rather than patching individual strings forever.

Important interview questions and answers

  1. Q: What is the safest default character encoding for modern HTML?
    A: UTF-8, declared early with `` and matched by server `Content-Type` headers.
  2. Q: When are HTML entities still useful in UTF-8 pages?
    A: For reserved characters (`&`, `<`) and contexts where explicit escaping avoids parser ambiguity.
  3. Q: What is the key difference between HTML5 parsing and XHTML parsing?
    A: HTML5 recovers from many errors; XHTML (XML) treats many parse errors as fatal.

Pitfall: Charset meta must appear early in <head>.

Interview tip Lesson completion confidence

Can you explain this lesson in 30 seconds without reading notes?

Not saved yet.

Playground

Runs in your browser in a sandboxed frame. Backend runners appear when this track’s profile allows them.

Check yourself

Multiple choice — immediate feedback.

Discussion

Past discussion is visible to everyone. Only logged-in users can post comments and replies.

Starter discussion topics

  • What confused you about this lesson?
  • How would you explain this to a teammate in 30 seconds?

Sign up or log in to post comments and sync lesson progress across devices.

No discussion yet. Be the first to ask a question.

Jump