Skip to content
Modern WisdomModern Wisdom

How Much Does Google Know About Me? | Seth Stephens-Davidowitz | Modern Wisdom 134

Seth Stephens-Davidowitz is a former Data Scientist at Google and a writer. There are things which you write into Google which you have never told another person. Our search history is a window into the deepest recesses of our mind which has never before been available. Time for the big data analysts like Seth to step in and look at what we can discover from these information. Why do people commit suicide? How many Americans are racist? What is the most popular type of pornography in India? And what is the biggest determining factor in a child's development? Extra Stuff: Buy Everybody Lies - https://amzn.to/2QZqHH0 Follow Stephen on Twitter - https://twitter.com/SethS_D Take a break from alcohol and upgrade your life - https://6monthssober.com/podcast Check out everything I recommend from books to products - https://www.amazon.co.uk/shop/modernwisdom #bigdata #google #datascience - Listen to all episodes online. Search "Modern Wisdom" on any Podcast App or click here: iTunes: https://apple.co/2MNqIgw Spotify: https://spoti.fi/2LSimPn Stitcher: https://www.stitcher.com/podcast/modern-wisdom - Get in touch in the comments below or head to... Instagram: https://www.instagram.com/chriswillx Twitter: https://www.twitter.com/chriswillx Email: modernwisdompodcast@gmail.com

Seth Stephens-DavidowitzguestChris Williamsonhost
Jan 16, 20201h 2mWatch on YouTube ↗

CHAPTERS

  1. 0:00 – 1:59

    From ad-click optimization to “data for social good”

    Seth and Chris open by discussing Seth’s day-to-day work as a data scientist and author, and why many people enter data science for money before seeking meaning. Seth frames big data as a tool that can be redirected from ads and finance toward socially valuable insights.

  2. 1:59 – 5:43

    The “How Big Is My Penis?” story: what private searches reveal

    The conversation turns to the provocative early insight that men search Google about penis size more than any other body part. Seth uses it to illustrate how online behavior exposes insecurity and absurdity that people hide in public.

  3. 5:43 – 8:41

    Why the book is called Everybody Lies: surveys vs. behavioral data

    Seth explains the central thesis: people lie to others and even to anonymous surveys, skewing traditional research methods. Internet data—especially Google searches—often provides a more accurate window into what people actually think and do.

  4. 8:41 – 10:34

    The art of data science: creativity, pattern-finding, and cultural oddities

    Seth argues that data science is not purely technical—it requires creativity to identify meaningful “nuggets” in massive datasets. He shares surprising examples (like India-specific searches) that traditional research would likely miss.

  5. 10:34 – 15:01

    Getting Pornhub data—and why academia underuses it

    They discuss Pornhub’s annual stats and Seth’s experience obtaining and analyzing Pornhub datasets for his book. Seth notes the methodological conservatism of academia and the missed opportunity to study large-scale, revealed-preference sexual behavior.

  6. 15:01 – 19:41

    What porn data shows: women’s preferences, universality, and “why” questions

    Seth shares findings that surprised audiences: certain violent/rape-themed porn is disproportionately popular among women, and this pattern appears broadly across cultures. The discussion expands into how difficult it is to explain “why,” not just report correlations.

  7. 19:41 – 22:58

    Sexual orientation signals: lesbian porn among straight women and closeted men

    Seth describes the popularity of lesbian porn among women who identify as straight, contrasting it with much lower rates of gay porn consumption among men. He then explains how regional patterns suggest closeted homosexuality in areas where it’s harder to be openly gay.

  8. 22:58 – 25:24

    Search strings as narratives: suicide, stigma, and the herpes insight

    Chris asks about other revealing ‘search sequences,’ and Seth describes work on suicide-related searches. A key discovery: herpes diagnoses can trigger suicidal ideation among young people, driven largely by stigma and fear rather than physical severity.

  9. 25:24 – 29:43

    Role-model searches: “celebrities with herpes” and what people are really seeking

    Seth explains that people often search for celebrities who share their condition as a way to reduce shame and find hope. The tragedy: for herpes, search results often surface celebrity denials rather than supportive disclosures—potentially worsening stigma.

  10. 29:43 – 34:01

    Elections and prediction: what search behavior can reveal about voting

    The discussion shifts to the 2020 U.S. election and how internet data might help forecast outcomes. Seth shares a subtle indicator: the order people type candidate names (e.g., “Trump Clinton” vs. “Clinton Trump”) correlates with preference and may reveal subconscious leanings.

  11. 34:01 – 38:27

    Dating insights from recorded speed dates: what predicts a second date

    Chris pulls a practical example: what language increases second-date probability. Seth summarizes research that mined recorded speed-date conversations, revealing cues like supportive phrases, laughter, hedge words, and how much a woman talks about herself.

  12. 38:27 – 53:16

    The next book: data-driven life decisions (happiness, parenting, neighborhoods)

    Seth outlines his upcoming book focused on using data to make better choices across life domains. He shares counterintuitive happiness findings about alcohol timing and major parenting insights suggesting neighborhood and role models matter more than most in-home tactics.

  13. 53:16 – 59:23

    Other datasets and controversies: abortions, racism, Wikipedia births, Facebook fandom

    They survey additional platforms Seth has analyzed beyond Google: DIY abortion searches mapping to access restrictions, Stormfront activity, Wikipedia birthplace patterns, and Facebook fandom in the NBA. Seth notes that some problems are easier to find insights in than the highly-optimized stock market.

  14. 59:23 – 1:02:01

    Wrap-up: anonymity, personal tech habits, and where to find Seth

    Chris closes by reflecting on the power—and limits—of anonymous aggregated data: it reveals patterns without identifying individuals. Seth jokes that the biggest behavioral change is Googling himself more, then shares how listeners can find his work online.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.