the office
Interactive Data Visualization

Who's Actually
in the Room?

9,986 scenes · 55,130 lines of dialogue · 201 episodes · 9 seasons
Every character interaction from The Office, mapped.
55,130
Lines of Dialogue
9,986
Scenes Analyzed
201
Episodes
9
Seasons
"Would I rather be feared or loved? Easy. Both. I want people to be afraid of how much they love me."
— Michael Scott, Regional Manager

The Dunder Mifflin Social Network

Who shares scenes with who? The thicker the connection, the more scenes two characters appear in together. Toggle between eras to see what happened when Michael left.

The Center Couldn't Hold

Michael Scott was the connective tissue of the entire show. When Steve Carell left after Season 7, the social fabric of the office fragmented.

Seasons 1 – 7

8.38
Average IMDB Rating
Michael connects to every character

Seasons 8 – 9

7.69
Average IMDB Rating
No one could replace the hub

Lines of Dialogue, Ranked

Michael Scott spoke nearly twice as much as anyone else — despite leaving two seasons early.

201 Episodes. 1.3 Million Votes.

Each cell is one episode, colored by its IMDB rating. Hover to explore. Notice the dip in Season 8.

6.0
9.8

Data Points Only True Fans Know

Every running gag, quantified.

40
"That's what she said"
Peak: Season 4 (10 times). Zero in Season 8. Michael said 90% of them.
27
Times Andy mentions Cornell
"I went to Cornell. Ever heard of it?" Averages 3x per season from S3–9.
75
Kevin's food references
Chili. M&Ms. Cookie Monster. Kevin talked about food more than work.
25
Meredith's drinking references
From vodka in the water bottle to the intervention. Consistent across all 9 seasons.
57B
Minutes streamed in 2020
Most-streamed show on ANY platform. Seven years after it ended. Netflix alone.
22.9M
Viewers: "Stress Relief"
Post-Super Bowl episode. Dwight's fire drill. The highest-rated cold open ever.
"I wish there was a way to know you're in the good old days before you've actually left them."
— Andy Bernard, S9E23 "Finale"

Friends just dropped. Next one's almost done.

9,000+ people get these before they hit Reddit. Join them.

See more data viz · We build these for companies too

· Rating: · votes

Sources & Methodology

schrute R Package Lindblad, B. Complete transcript from The Office (US). 55,130 lines, 12 variables. github.com/bradlindblad/schrute
TidyTuesday (2020-03-17) Episode ratings from IMDB. github.com/rfordatascience/tidytuesday
Kaggle Transcripts Complete dialogue/transcript, 55K+ lines. kaggle.com
OfficeTally Nielsen ratings archives, all 9 seasons. officetally.com
Dunderpedia Character data, episode details, catchphrase counts. theoffice.fandom.com
Methodology: Scene co-occurrence is defined as two characters having dialogue within the same scene of the same episode. Data sourced from the schrute R package (55,130 lines across 9,986 scenes). Character interaction counts represent the number of distinct scenes where both characters speak. IMDB ratings as of the TidyTuesday 2020-03-17 dataset (1.3M+ total votes). Streaming data from Nielsen 2020 annual report.
Buy me a coffee