Skip to content
Lex Fridman PodcastLex Fridman Podcast

Andrew Ng: Deep Learning, Education, and Real-World AI | Lex Fridman Podcast #73

Andrew Ng is one of the most impactful educators, researchers, innovators, and leaders in artificial intelligence and technology space in general. He co-founded Coursera and Google Brain, launched deeplearning.ai, Landing.ai, and the AI fund, and was the Chief Scientist at Baidu. As a Stanford professor, and with Coursera and deeplearning.ai, he has helped educate and inspire millions of students including me. This episode is presented by Cash App. Download it & use code "LexPodcast": Cash App (App Store): https://apple.co/2sPrUHe Cash App (Google Play): https://bit.ly/2MlvP5w PODCAST INFO: Podcast website: https://lexfridman.com/podcast Apple Podcasts: https://apple.co/2lwqZIr Spotify: https://spoti.fi/2nEwCF8 RSS: https://lexfridman.com/feed/podcast/ Full episodes playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4 Clips playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOeciFP3CBCIEElOJeitOr41 EPISODE LINKS: Andrew Twitter: https://twitter.com/AndrewYNg Andrew Facebook: https://www.facebook.com/andrew.ng.96 Andrew LinkedIn: https://www.linkedin.com/in/andrewyng/ deeplearning.ai: https://www.deeplearning.ai landing.ai: https://landing.ai AI Fund: https://aifund.ai/ AI for Everyone: https://www.coursera.org/learn/ai-for-everyone The Batch newsletter: https://www.deeplearning.ai/thebatch/ OUTLINE: 0:00 - Introduction 2:23 - First few steps in AI 5:05 - Early days of online education 16:07 - Teaching on a whiteboard 17:46 - Pieter Abbeel and early research at Stanford 23:17 - Early days of deep learning 32:55 - Quick preview: deeplearning.ai, landing.ai, and AI fund 33:23 - deeplearning.ai: how to get started in deep learning 45:55 - Unsupervised learning 49:40 - deeplearning.ai (continued) 56:12 - Career in deep learning 58:56 - Should you get a PhD? 1:03:28 - AI fund - building startups 1:11:14 - Landing.ai - growing AI efforts in established companies 1:20:44 - Artificial general intelligence CONNECT: - Subscribe to this YouTube channel - Twitter: https://twitter.com/lexfridman - LinkedIn: https://www.linkedin.com/in/lexfridman - Facebook: https://www.facebook.com/LexFridmanPage - Instagram: https://www.instagram.com/lexfridman - Medium: https://medium.com/@lexfridman - Support on Patreon: https://www.patreon.com/lexfridman

Lex FridmanhostAndrew Ngguest
Feb 20, 20201h 29mWatch on YouTube ↗

CHAPTERS

  1. 0:00 – 2:30

    Show setup: Andrew Ng’s impact, sponsor message, and the opening question

    Lex introduces Andrew Ng’s career highlights (Coursera, Google Brain, Baudu, DeepLearning.AI, Landing AI, AI Fund) and sets the tone for a wide-ranging conversation about education and real-world AI. A sponsor segment precedes Lex’s first question about what inspired Andrew to enter computer science and machine learning.

    • Andrew Ng’s roles across academia, industry, and entrepreneurship
    • Podcast format and sponsor/ads up front
    • Transition into Andrew’s origin story in AI
  2. 2:30 – 4:25

    Early inspirations: coding as a kid and a lifelong theme of automation

    Andrew describes learning to program at age 5–6, the joy of making simple games, and later being influenced by his father’s interest in expert systems and neural networks. He connects a high-school internship full of repetitive office work to a central motivation: automating tedious human tasks.

    • Early exposure to BASIC programming and “type from the book” learning
    • Reading about expert systems/neural networks as a teenager
    • Internship boredom as catalyst for thinking about automation
    • Automation as a through-line in his later work (ML and education)
  3. 4:25 – 8:30

    The birth of MOOCs: scaling a teacher’s reach and iterating toward Coursera

    Andrew explains how re-recording the same Stanford lectures pushed him to reuse content and invest time in deeper student interaction. He recounts the scrappy early days of MOOCs—filming late at night under pressure—and the iterative product lessons that led to Coursera’s eventual success.

    • Reusing lectures to scale teaching impact and free time for mentorship
    • Late-night recording workflow (webcam, tablet, mic) under tight deadlines
    • “Do what’s best for learners” as the core design principle
    • Iteration through failed/unused features (e.g., shared-login group watching)
  4. 8:30 – 16:03

    AI as a new kind of literacy: who learns ML and why it keeps expanding

    They discuss how MOOCs revealed a far larger global appetite for AI than the research community expected. Andrew frames ML and data science as potentially becoming as universal as literacy, enabling more people—including non-programmers—to communicate with computers through data-driven tools.

    • ML interest grew via word-of-mouth and visible usefulness
    • Future: many (possibly most) developers become “AI developers” broadly defined
    • Programming/ML as literacy analogy for human-to-computer communication
    • Data science as a practical entry point for many professions
  5. 16:03 – 17:37

    Teaching style: why the whiteboard still wins for certain ideas

    Lex asks why Andrew often prefers marker-and-whiteboard instruction over slides. Andrew argues that building equations and concepts incrementally can improve understanding, even though writing is slower—sometimes that slowness is a feature, not a bug.

    • Whiteboard supports step-by-step construction of mathematical ideas
    • Trade-off: clarity and pacing vs speed and verbosity
    • Slides vs writing depends on the concept being taught
    • Minimalism and focusing on fundamentals
  6. 17:37 – 23:17

    Early Stanford research with Pieter Abbeel: helicopter reinforcement learning and hard-earned lessons

    Andrew reflects on working with his first PhD student, Pieter Abbeel, on reinforcement learning for autonomous helicopter flight. He emphasizes how difficult real systems research is, highlighting failed localization approaches before settling on a camera-based solution that enabled the RL breakthroughs.

    • Helicopter RL as a rare practical RL application of that era
    • Localization challenges (GPS issues, hardware experiments) and repeated failures
    • Breakthrough via ground cameras for localization
    • Motivation: preference for work that functions in the real world and helps people
  7. 23:17 – 27:43

    Early deep learning convictions: what Google Brain got wrong—and what it got right

    Andrew recounts a key conversation with Geoff Hinton that pushed him toward unsupervised learning, and explains why that focus was “wrong for the time” compared to supervised learning’s immediate payoff. What proved right was the importance of scale—bigger models and more data driving better performance—an idea that fueled the pitch for Google Brain.

    • Hinton’s napkin argument: humans learn far more than labels can explain
    • Early overemphasis on unsupervised learning vs supervised learning’s near-term impact
    • Adam Coates’ scaling curve as decisive evidence for going bigger
    • Scale as a then-controversial bet that became foundational to modern deep learning
  8. 27:43 – 32:55

    Scale vs architecture, and the underrated problem: managing messy data in the real world

    They explore whether progress comes more from bigger datasets or better learning mechanisms, concluding it depends on the domain and headroom to base error. Andrew then shifts to a practical frontier: immature tooling and processes for dataset management, label noise, and small-data regimes common outside consumer internet.

    • Both scale and architectural innovation matter (transformers + scale)
    • Need for better data tooling analogous to evolution of code version control
    • Label disagreement and inconsistency as a core production ML challenge
    • Small-data settings amplify labeling errors and require new workflows
  9. 32:55 – 46:11

    DeepLearning.AI roadmap: prerequisites, what to learn first, and how to debug ML

    Andrew outlines how DeepLearning.AI’s specialization helps learners progress from basic neural nets to practical techniques for making models work. He emphasizes “debugging” ML as a distinct skill: systematically deciding whether to change data, architecture, regularization, or optimization rather than wasting months on the wrong lever.

    • Prerequisites: Python + very basic linear algebra; calculus optional
    • Core curriculum: networks, activations, CNNs, RNNs, attention models
    • Practical know-how: overfitting diagnosis, when more data helps vs doesn’t
    • ML debugging as a high-leverage skill that can make engineers 10–100× faster
  10. 46:11 – 56:08

    Unsupervised and self-supervised learning: why it’s still the most beautiful long-term idea

    Andrew argues that if he could focus purely on long-term research, he’d invest heavily in unsupervised learning—especially self-supervised methods that manufacture labels from unlabeled data. He explains concrete examples (rotation prediction, masked words, jigsaw permutations) and how learned representations transfer to downstream tasks.

    • Self-supervised learning as a practical slice of unsupervised learning
    • Generating “free labels” via transformations (rotate images, mask words, jigsaw)
    • Transfer learning via hidden representations as the payoff
    • Why supervised learning’s success delayed broader unsupervised exploration
  11. 56:08 – 58:39

    Building a deep learning career: habits, notes, and efficient learning systems

    They discuss how to turn initial interest into long-term progress: start now, use structured coursework early, then move to projects and continuous reading. Andrew stresses habit formation (regular learning schedules) and the learning science behind handwritten notes—slower capture forces recoding and improves retention.

    • Coursework is efficient early; projects and papers become essential later
    • Start small (MNIST-scale) to build momentum and intuition
    • Consistency beats bursts: sustained learning compounds over years
    • Handwritten notes improve retention by forcing summarization and recoding
  12. 58:39 – 1:03:25

    Should you get a PhD? Choosing paths by people, mentorship, and daily environment

    Andrew frames the PhD as one of several strong options, especially for those aiming at top academic roles, but not required for industry impact or startups. His strongest career advice is to optimize for the people you’ll work with—managers and peers matter more than the company logo or institution brand.

    • PhD is necessary for certain academic career goals, optional for many others
    • Compare opportunities: top labs vs top industry teams vs entrepreneurship
    • Daily collaborators are the biggest determinant of growth and happiness
    • Ask explicitly who you’ll work with; avoid vague rotation promises
  13. 1:03:25 – 1:11:14

    AI Fund and the startup studio model: customer obsession, repeatable company-building, and social good

    Andrew explains AI Fund as a “startup studio” designed to systematically create new AI companies, borrowing from his Baidu experience of launching new business lines. He emphasizes that many startups fail by building products no one wants, and adds a strong filter: he wants ventures that genuinely help people, not just monetize attention.

    • Customer validation as the core discipline for startup success
    • Startup studio as a structured, less-lonely way to found companies
    • Transferable playbooks: sales, hiring, marketing, ML product decisions
    • Intentional focus on social good; killing ideas that don’t meaningfully help
  14. 1:11:14 – 1:20:44

    Landing AI and AI transformation in established companies: start small, then deploy for real

    Andrew argues AI will transform every industry, with the next wave happening outside consumer internet—manufacturing, agriculture, healthcare, logistics, and more. He outlines the ‘start small’ approach to build internal trust (as Google Brain did with speech and Maps) and explains why production deployment is vastly harder than a Jupyter notebook demo.

    • AI’s biggest remaining economic impact is outside software/internet
    • Manufacturing visual inspection as a high-value, messy real-world problem
    • Start with small wins to build organizational belief and capability
    • Deployment gaps: distribution shift, robustness, workflow redesign, MLOps constraints
  15. 1:20:44 – 1:25:55

    AGI and alignment: why near-term harms (bias, inequality, misuse) deserve more focus

    Lex asks about human-level AI and existential risk. Andrew says AGI may happen but timeline estimates are highly uncertain, and he critiques “paperclip problem” style debates as distracting from urgent present-day issues like bias, deepfakes, and wealth inequality driven by winner-take-most dynamics.

    • AGI as plausible but with uncertain timelines (100–5000 years framing)
    • Trolley-problem focus is less practical than core safety/performance failures
    • Immediate priorities: bias, misuse (deepfakes), and concentration of power
    • AI + internet economics can amplify inequality and disrupt labor markets
  16. 1:25:55 – 1:29:09

    Reflections on mistakes, pride, and meaning: family, helping others, and a closing principle

    Andrew discusses how hindsight makes many lessons feel obvious and shares a lighthearted story about wishing he’d read certain books earlier. He highlights his daughter as a profound source of joy, and defines meaning as helping others reach their dreams—ending with a personal heuristic for choosing worthwhile work.

    • Learning is iterative; “obvious in hindsight” is a recurring experience
    • Personal fulfillment from family and from enabling others’ success
    • Meaning framed as moving the world forward and helping people
    • Closing maxim: pursue work that, if wildly successful, significantly helps others

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.