Modern WisdomHow Much Does Google Know About Me? | Seth Stephens-Davidowitz | Modern Wisdom 134
CHAPTERS
- 0:00 – 1:59
From ad-click optimization to “data for social good”
Seth and Chris open by discussing Seth’s day-to-day work as a data scientist and author, and why many people enter data science for money before seeking meaning. Seth frames big data as a tool that can be redirected from ads and finance toward socially valuable insights.
- 1:59 – 5:43
The “How Big Is My Penis?” story: what private searches reveal
The conversation turns to the provocative early insight that men search Google about penis size more than any other body part. Seth uses it to illustrate how online behavior exposes insecurity and absurdity that people hide in public.
- 5:43 – 8:41
Why the book is called Everybody Lies: surveys vs. behavioral data
Seth explains the central thesis: people lie to others and even to anonymous surveys, skewing traditional research methods. Internet data—especially Google searches—often provides a more accurate window into what people actually think and do.
- 8:41 – 10:34
The art of data science: creativity, pattern-finding, and cultural oddities
Seth argues that data science is not purely technical—it requires creativity to identify meaningful “nuggets” in massive datasets. He shares surprising examples (like India-specific searches) that traditional research would likely miss.
- 10:34 – 15:01
Getting Pornhub data—and why academia underuses it
They discuss Pornhub’s annual stats and Seth’s experience obtaining and analyzing Pornhub datasets for his book. Seth notes the methodological conservatism of academia and the missed opportunity to study large-scale, revealed-preference sexual behavior.
- 15:01 – 19:41
What porn data shows: women’s preferences, universality, and “why” questions
Seth shares findings that surprised audiences: certain violent/rape-themed porn is disproportionately popular among women, and this pattern appears broadly across cultures. The discussion expands into how difficult it is to explain “why,” not just report correlations.
- 19:41 – 22:58
Sexual orientation signals: lesbian porn among straight women and closeted men
Seth describes the popularity of lesbian porn among women who identify as straight, contrasting it with much lower rates of gay porn consumption among men. He then explains how regional patterns suggest closeted homosexuality in areas where it’s harder to be openly gay.
- 22:58 – 25:24
Search strings as narratives: suicide, stigma, and the herpes insight
Chris asks about other revealing ‘search sequences,’ and Seth describes work on suicide-related searches. A key discovery: herpes diagnoses can trigger suicidal ideation among young people, driven largely by stigma and fear rather than physical severity.
- 25:24 – 29:43
Role-model searches: “celebrities with herpes” and what people are really seeking
Seth explains that people often search for celebrities who share their condition as a way to reduce shame and find hope. The tragedy: for herpes, search results often surface celebrity denials rather than supportive disclosures—potentially worsening stigma.
- 29:43 – 34:01
Elections and prediction: what search behavior can reveal about voting
The discussion shifts to the 2020 U.S. election and how internet data might help forecast outcomes. Seth shares a subtle indicator: the order people type candidate names (e.g., “Trump Clinton” vs. “Clinton Trump”) correlates with preference and may reveal subconscious leanings.
- 34:01 – 38:27
Dating insights from recorded speed dates: what predicts a second date
Chris pulls a practical example: what language increases second-date probability. Seth summarizes research that mined recorded speed-date conversations, revealing cues like supportive phrases, laughter, hedge words, and how much a woman talks about herself.
- 38:27 – 53:16
The next book: data-driven life decisions (happiness, parenting, neighborhoods)
Seth outlines his upcoming book focused on using data to make better choices across life domains. He shares counterintuitive happiness findings about alcohol timing and major parenting insights suggesting neighborhood and role models matter more than most in-home tactics.
- 53:16 – 59:23
Other datasets and controversies: abortions, racism, Wikipedia births, Facebook fandom
They survey additional platforms Seth has analyzed beyond Google: DIY abortion searches mapping to access restrictions, Stormfront activity, Wikipedia birthplace patterns, and Facebook fandom in the NBA. Seth notes that some problems are easier to find insights in than the highly-optimized stock market.
- 59:23 – 1:02:01
Wrap-up: anonymity, personal tech habits, and where to find Seth
Chris closes by reflecting on the power—and limits—of anonymous aggregated data: it reveals patterns without identifying individuals. Seth jokes that the biggest behavioral change is Googling himself more, then shares how listeners can find his work online.