What is database normalization and why does it matter?

Reported in Zalando European engineering loops. DBMS fundamentals covering 1NF through 3NF and when to denormalize.

Location

Paris, France

Study track

SQL

How to frame this at Zalando: Connect your answer to measurable impact, clarity of thought, and trade-offs the team cares about. Below is a strong baseline response you can adapt with your own project examples.

Normalization is the process of organizing tables to reduce redundancy and update anomalies. Each fact should live in one place so inserts, updates, and deletes stay consistent.

1NF: atomic columns, no repeating groups. 2NF: no partial dependency on a composite key—all non-key columns depend on the entire primary key. 3NF: no transitive dependency—non-key columns depend only on the key, not on other non-key columns.

Example: storing department name on every employee row violates 3NF if department name depends on dept_id alone—split into departments and reference by foreign key.

Denormalization is intentional duplication for read performance (analytics, caching). Interviewers want you to normalize first, then justify controlled denormalization with measured query patterns.

What is database normalization and why does it matter?

Interview scenario

Try answering aloud first

Comments (0)