Skip to content
Michael I. Jordan: Machine Learning, Recommender Systems, and Future of AI | Lex Fridman Podcast #74
This video isn’t embeddableWatch on YouTube →
Lex Fridman PodcastLex Fridman Podcast

Michael I. Jordan: Machine Learning, Recommender Systems, and Future of AI | Lex Fridman Podcast #74

Michael I Jordan is a professor at Berkeley, and one of the most influential people in the history of machine learning, statistics, and artificial intelligence. He has been cited over 170,000 times and has mentored many of the world-class researchers defining the field of AI today, including Andrew Ng, Zoubin Ghahramani, Ben Taskar, and Yoshua Bengio. This episode is presented by Cash App. Download it & use code "LexPodcast": Cash App (App Store): https://apple.co/2sPrUHe Cash App (Google Play): https://bit.ly/2MlvP5w PODCAST INFO: Podcast website: https://lexfridman.com/podcast Apple Podcasts: https://apple.co/2lwqZIr Spotify: https://spoti.fi/2nEwCF8 RSS: https://lexfridman.com/feed/podcast/ Full episodes playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4 Clips playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOeciFP3CBCIEElOJeitOr41 EPISODE LINKS: (Blog post) Artificial Intelligence -- The Revolution Hasn’t Happened Yet: https://hdsr.mitpress.mit.edu/pub/wot7mkc1 OUTLINE: 0:00 - Introduction 3:02 - How far are we in development of AI? 8:25 - Neuralink and brain-computer interfaces 14:49 - The term "artificial intelligence" 19:00 - Does science progress by ideas or personalities? 19:55 - Disagreement with Yann LeCun 23:53 - Recommender systems and distributed decision-making at scale 43:34 - Facebook, privacy, and trust 1:01:11 - Are human beings fundamentally good? 1:02:32 - Can a human life and society be modeled as an optimization problem? 1:04:27 - Is the world deterministic? 1:04:59 - Role of optimization in multi-agent systems 1:09:52 - Optimization of neural networks 1:16:08 - Beautiful idea in optimization: Nesterov acceleration 1:19:02 - What is statistics? 1:29:21 - What is intelligence? 1:37:01 - Advice for students 1:39:57 - Which language is more beautiful: English or French? CONNECT: - Subscribe to this YouTube channel - Twitter: https://twitter.com/lexfridman - LinkedIn: https://www.linkedin.com/in/lexfridman - Facebook: https://www.facebook.com/LexFridmanPage - Instagram: https://www.instagram.com/lexfridman - Medium: https://medium.com/@lexfridman - Support on Patreon: https://www.patreon.com/lexfridman

Lex FridmanhostMichael I. Jordanguest
Feb 24, 20201h 45mWatch on YouTube ↗

CHAPTERS

  1. 0:00 – 6:12

    From “AI” to a new engineering discipline: Jordan’s historical lens

    Michael I. Jordan reframes today’s AI boom as the early formation of an engineering discipline built from statistics and computer science, analogous to chemical or electrical engineering. He argues the current era is about building large-scale decision systems, not achieving human-like intelligence.

    • Machine learning as an emerging engineering field, not “true AI”
    • Analogy to chemical/electrical engineering: principles + scalable systems
    • Statistics/CS as proto-foundations; real-world systems still ad hoc
    • Aspirational “human-level AI” is far off and conceptually different
  2. 6:12 – 8:25

    How far are we from understanding the brain (and why hype misleads)

    Jordan emphasizes that we lack fundamental scientific understanding of brain computation, down to synapses and biochemical complexity. He warns that metaphors about the brain often get mistaken for science, and that overselling near-term breakthroughs misguides students and funding narratives.

    • Neuroscience is centuries-long; synapses are extraordinarily complex
    • Popular metaphors (electrical/economic/immune) aren’t explanations
    • Hype and PR demos aren’t scientific understanding
    • Young researchers can be misled by claims of being “close”
  3. 8:25 – 14:49

    Neuralink and brain–computer interfaces: useful engineering, not deep understanding

    Discussing Neuralink-style BCIs, Jordan allows that pattern-finding over neural signals can yield practical medical or assistive tools. But he rejects the idea that this implies near-term deep integration with cognition or true brain-level understanding.

    • BCIs can enable practical control (cursor, assistive tech)
    • Signal-processing without concepts is like remote-sensing a city’s economy
    • Potential medical value (e.g., brain disease) is more plausible near-term
    • Strong skepticism about Musk-level claims of near-term deep integration
  4. 14:49 – 18:59

    Why “Artificial Intelligence” is the wrong name (and what ML should include)

    Jordan critiques the term AI as historically contingent branding that sets unrealistic expectations. He prefers framing the field as large-scale decision-making under uncertainty with feedback loops—going beyond pattern recognition to consequential decisions in the world.

    • AI as a philosophical aspiration (McCarthy) vs today’s systems work
    • Machine learning should include decisions, uncertainty, and feedback loops
    • No perfect label exists; naming shapes public promises and misconceptions
    • Goal: reliable planetary-scale systems, not “intelligence” per se
  5. 18:59 – 23:53

    Ideas vs personalities in science, and a “friendly disagreement” with Yann LeCun

    Jordan argues progress needs diverse personalities but cautions against exuberance that promises too much. His main difference in emphasis with LeCun is prediction versus decision-making: prediction is powerful, but decisions demand uncertainty, risk, and consequences.

    • Science advances via both personalities and ideas; too much hype today
    • Prediction alone is limited; perfect prediction is impossible in practice
    • Decision-making needs error bars, risk, counterfactuals, and dialogue
    • Real-world consequential domains (medicine) highlight prediction’s limits
  6. 23:53 – 31:49

    Decision-making at scale: markets, ecosystems, and the missing “creator economy” infrastructure

    Jordan introduces distributed decision systems as market-like structures connecting producers and consumers at scale. Using music as a case study, he argues current platforms fail to create transparent, trustworthy markets that let long-tail creators build sustainable careers.

    • Core framing: large collections of agents making decisions under uncertainty
    • Music example: streaming creates consumption but not fair creator markets
    • Transparency dashboards and verified demand signals could enable careers
    • New markets can create jobs and increase welfare, but need principled design
  7. 31:49 – 34:18

    Recommender systems: powerful but incomplete—must pair with economics and incentives

    Recommender systems matter greatly, but Jordan stresses they aren’t magic and must be integrated with market design, matching, and incentives. He praises early Amazon recommendations for discovery while noting many domains (restaurants, apparel) are harder and require richer mechanisms.

    • Good recommenders are much better than bad ones; still a major industry
    • Amazon as a positive example of discovery and peer-induced learning
    • Harder domains need blended tools: recommenders + matching/market design
    • Research remains open for scalable, robust recommendation ecosystems
  8. 34:18 – 44:47

    Facebook, advertising, and why the business model breaks trust (plus micropayment-style alternatives)

    Jordan argues many social/news platform failures trace to ad-driven monetization, which optimizes for clicks and engagement rather than true value exchange. He proposes shifting toward direct producer–consumer markets (not necessarily micro) where users pay for high-value contributions and platforms take a modest cut.

    • Click-through optimization creates perverse incentives (fake news, outrage)
    • Fixing algorithms alone is “hopeless” without changing the business model
    • Direct value transactions (e.g., paying for tailored Mumbai advice)
    • Amazon vs Google vs Facebook: differing proximity to real markets and logistics
  9. 44:47 – 1:00:03

    Creepiness, privacy, and agency: what “control” should mean

    Jordan explains why targeted advertising and pervasive inference feel creepy: they lack an auditable, relationship-based reason for knowing personal information. Privacy is contextual and social; the core demand is agency and transparency without burdening users with endless checkboxes.

    • Trust depends on why data is known and how it’s used—auditable relationships
    • Privacy isn’t binary; villages had no privacy but had mutual support
    • Need layered institutions/standards analogous to electrical safety ecosystems
    • A sustainable future requires user agency, not opaque algorithmic control
  10. 1:00:03 – 1:01:11

    Anonymity, online darkness, and designing “worlds” worth entering

    The discussion turns to how anonymity and platform design amplify harmful behavior and waste attention. Jordan sketches possible directions—less anonymity, more locality, and bounded communities with norms—while noting the problem is deeply socio-technical.

    • Comment sections revealed unexpected social “darkness” amplified by tech
    • Anonymity changes behavior in psychologically complex ways
    • Possible remedies: locality, reduced anonymity, trusted community boundaries
    • Need forums that support sober, constructive discourse (podcasts as an example)
  11. 1:01:11 – 1:02:31

    Are humans fundamentally good? Good—but limited (and technology should help)

    Jordan states humans are fundamentally good but constrained by limited perspective and empathy. He hopes technology can broaden viewpoints and reduce ignorance-driven conflict, though he hasn’t yet seen it succeed at scale.

    • Humans are good but blinkered; empathy is hard
    • Ignorance and misperception drive conflict and escalation
    • Technology could, in principle, expand perspectives and understanding
    • Current systems often fail by amplifying the worst dynamics
  12. 1:02:31 – 1:09:51

    Optimization, uncertainty, and multi-agent thinking: where math helps (and where it doesn’t)

    Jordan resists framing human life or society as a simple optimization problem, emphasizing complexity and uncertainty. He contrasts optimization with sampling, then connects multi-agent systems to game-theoretic equilibria (Nash, Stackelberg) and highlights open problems in stochastic, decentralized settings.

    • Optimization vs sampling: point solutions vs distributions
    • World is massively uncertain; probabilistic reasoning is essential
    • Game theory as equilibrium analysis; Nash as a saddle point; Stackelberg as leader–follower
    • Strategic data collection and exploration in multi-agent environments remain open
  13. 1:09:51 – 1:16:08

    Deep learning optimization landscapes and why stochastic gradient works so well

    Jordan describes neural network loss surfaces as relatively smooth and navigable in over-parameterized regimes, though why some optima generalize better remains unclear. He expects architectures and optimization algorithms to co-evolve, with stochastic gradient methods staying central for now.

    • Over-parameterization yields many paths to good optima; descent often easier than expected
    • Generalization differences between optima still need explanation
    • Architectures and optimizers should be co-designed over time
    • Stochasticity can avoid pathological surface features and scales well in high dimensions
  14. 1:16:08 – 1:19:03

    Optimization beauty: Nesterov acceleration and the nontrivial mystery of gradients

    Jordan highlights Nesterov acceleration as a deep idea that achieves optimal convergence rates for smooth convex problems by using momentum-like structure—even moving uphill at times. He uses the discussion to emphasize that gradients are subtler than intuition suggests.

    • Gradient direction is mathematically optimal under constraints, but unintuitive
    • Nesterov acceleration achieves 1/k² rate vs gradient descent’s 1/k
    • Momentum intuition helps but doesn’t fully explain the method
    • Optimization theory remains rich, with ongoing attempts to understand acceleration
  15. 1:19:03 – 1:29:21

    What is statistics (and why decision theory, Bayes vs frequentist, and FDR matter)

    Jordan defines statistics as principled inference and decision-making under uncertainty, rooted in inverse problems and state policy. He explains Bayes vs frequentist as different ways of handling uncertainty in loss functions, then points to empirical Bayes and false discovery rate as especially elegant ideas.

    • Statistics bridges math, science, and technology; aims at reliable inference/decisions
    • Historical roots: inverse probability, Laplace, and governance needs
    • Frequentist guarantees (software reliability) vs Bayesian conditioning (domain-focused inference)
    • Empirical Bayes and false discovery rate as powerful middle-ground frameworks
  16. 1:29:21 – 1:45:48

    What is intelligence (markets as intelligence), advice to students, and the value of languages

    Jordan broadens “intelligence” beyond individuals, arguing markets and infrastructures are intelligent in being robust and adaptive. He closes with career advice emphasizing apprenticeship, breadth, cooperation, and internationalism—then a personal coda on learning French/Italian and the importance of humanities.

    • Markets as decentralized intelligent systems distinct from human cognition
    • Skepticism toward AGI forecasting in public; focus on present challenges
    • Student advice: apprenticeship, breadth, humility, and cooperative communities
    • Languages/humanities cultivate empathy and thinking; Jordan’s path to French/Italian

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.