String methods

Last reviewed May 28, 2026 Content v20260528

Track mode: server_script
Means: Server runner
Reading: ~1 min
Level: intermediate

This lesson

This lesson teaches String methods: Pandas tabular manipulation—indexing, dtypes, reshaping, and analysis habits for real-world tables.

Teams apply String methods in every serious Pandas project—skipping it leaves blind spots in analysis and reviews.

You will apply String methods in contexts like: CSV/Parquet analysis, ETL notebooks, and ad hoc reporting.

Read the narrative, run `import pandas as pd` snippets with in-memory DataFrames (install pandas and numpy with pip if needed), inspect `.head()`, `.dtypes`, and complete MCQs.

When you can explain the previous lesson's ideas in your own words.

Text columns use the .str accessor for vectorized string operations: lower/upper, strip, contains, split, and extract—without slow Python loops.

Common .str methods

s.str.lower(), s.str.upper(), s.str.strip()
s.str.contains('pattern') — boolean mask
s.str.replace('old', 'new')
s.str.split(',', expand=True) — split into columns
s.str.extract(r'(\d+)') — regex capture groups

Example

import pandas as pd
emails = pd.Series([' Ana@Mail.com ', 'bob@test.org'])
clean = emails.str.strip().str.lower()
domains = emails.str.split('@').str[1]
print(clean, domains, sep='\n')

Nullable string dtype

StringDtype ('string') supports pd.NA missing strings—prefer over object for text columns in new code.

Important interview questions and answers

Q: Why .str?
A: Dispatches vectorized string ops; NaN propagates safely without errors.
Q: contains regex?
A: Pass regex=True and use raw strings for pattern matching.

Self-check

Clean emails to lowercase stripped form.
Extract domain after @ with str.split.

Tip: Chain .str.strip().str.lower() on messy CSV text columns early in cleaning.

Interview prep

.str accessor?: Vectorized string ops; NaN propagates safely.
contains regex?: Pass regex=True for pattern matching in str.contains.

Interview tip Lesson completion confidence

Can you explain this lesson in 30 seconds without reading notes?

Self-reflection (saved on this device)

Not saved yet.

Playground

Runs on the configured server runner (dev: npm run runner with LEARNING_RUNNER_ENABLED=true). Output appears below the editor.

Code runner not available

Server runner is disabled. Set LEARNING_RUNNER_ENABLED=true and LEARNING_RUNNER_URL in .env (see .env.example).

Check yourself

Multiple choice — immediate feedback.

Discussion

Past discussion is visible to everyone. Only logged-in users can post comments and replies.

Starter discussion topics

str accessor?
contains regex?

No discussion yet. Be the first to ask a question.