Skip to content
Judea Pearl: Causal Reasoning, Counterfactuals, and the Path to AGI | Lex Fridman Podcast #56
This video isn’t embeddableWatch on YouTube →
Lex Fridman PodcastLex Fridman Podcast

Judea Pearl: Causal Reasoning, Counterfactuals, and the Path to AGI | Lex Fridman Podcast #56

Lex Fridman and Judea Pearl on judea Pearl explains causal reasoning as missing key to true AI.

Lex FridmanhostJudea Pearlguest
Dec 11, 20191h 23mWatch on YouTube ↗

EVERY SPOKEN WORD

  1. 0:005:58

    Judea Pearl’s origins: analytic geometry and the power of translating “languages”

    1. LF

      The following is a conversation with Judea Pearl, professor at UCLA, and the winner of the Turing Award that's generally recognized as the Nobel Prize of computing. He's one of the seminal figures in the field of artificial intelligence, computer science, and statistics. He has developed and championed probabilistic approaches to AI, including Beijing Networks, and profound ideas in causality in general. These ideas are important not just to AI, but to our understanding and practice of science. But in the field of AI, the idea of causality, cause and effect, to many, lie at the core of what is currently missing and what must be developed in order to build truly intelligent systems. For this reason, and many others, his work is worth returning to often. I recommend his most recent book, called Book of Why, that presents key ideas from a lifetime of work in a way that is accessible to the general public. This is the Artificial Intelligence podcast. If you enjoy it, subscribe on YouTube, give it five stars on Apple Podcast, support it on Patreon, or simply connect with me on Twitter, @lexfridman, spelled F-R-I-D-M-A-N. If you leave a review on Apple Podcast especially, but also Castbox, or comment on YouTube, consider mentioning topics, people, ideas, questions, quotes in science, tech, and philosophy that you find interesting, and I'll read them on this podcast. I won't call out names, but I love comments with kindness and thoughtfulness in them, so I thought I'd share them with you. Someone on YouTube highlighted a quote from the conversation with Noam Chomsky, where he said that, "The significance of your life is something you create." I like this line as well. On most days, the existentialist approach to life is one I find liberating and fulfilling. I recently started doing ads at the end of the introduction. I'll do one or two minutes after introducing the episode, and never any ads in the middle that break the flow of the conversation. I hope that works for you and doesn't hurt the listening experience. This show is presented by Cash App, the number one finance app in the App Store. I personally use Cash App to send money to friends, but you can also use it to buy, sell, and deposit Bitcoin in just seconds. Cash App also has a new investing feature. You can buy fractions of a stock, say $1 worth, no matter what the stock price is. Brokerage services are provided by Cash App Investing, a subsidiary of Square and member SIPC. I'm excited to be working with Cash App to support one of my favorite organizations called FIRST, best known for their FIRST robotics and LEGO competitions. They educate and inspire hundreds of thousands of students in over 110 countries, and have a perfect rating on Charity Navigator, which means the donated money is used to the maximum effectiveness. When you get Cash App from the App Store or Google Play, and use code LEXPODCAST, you'll get $10, and Cash App will also donate $10 to FIRST, which again is an organization that I've personally seen inspire girls and boys to dream of engineering a better world. And now, here's my conversation with Judea Pearl. You mentioned in an interview that science is not a collection of facts, but a constant human struggle with the mysteries of nature. What was the first mystery that you can recall that hooked you, that captivated your curiosity?

    2. JP

      Oh, the first mystery. That's a good one. Yeah, I remember that.

    3. LF

      What was it?

    4. JP

      I had a fever for three days, uh, when I learned about Descartes' analytic geometry, and I found out that you can do all the construction in geometry using algebra. And I couldn't get over it. I simply couldn't get out of bed. (laughs)

    5. LF

      So, what- what kind of world does analytic geometry unlock?

    6. JP

      Well, it connects algebra with geo- geometry. Okay, so Descartes had the idea that, um, (clears throat) geometrical construction and geometrical theorems and the assumptions can be articulated in the language of algebra, which means that all the proof that we did in high school in trying to prove that the three bisectors meet at one point, and that... (laughs) Okay, uh, all this can be proven by just-

    7. LF

      Through algebra.

    8. JP

      ... shuffling around notation. Uh, that was a-

    9. LF

      The connection-

    10. JP

      ... traumatic experience.

    11. LF

      (laughs) The tr- traumatic experience.

    12. JP

      For me, it was. I'm telling you, right?

    13. LF

      So, it's the connection between the different mathematical disciplines, that they all-

    14. JP

      No, in between two diff- two different languages.

    15. LF

      Just even... Languages?

    16. JP

      Yeah.

    17. LF

      So, which mathematic discipline is the most beautiful? Is geometry it for you?

    18. JP

      Both are beautiful. They have, uh, almost the same power.

    19. LF

      But there's a visual element to geometry, being a-

    20. JP

      Visually, (laughs) it's more transparent, but, uh, once you get over to algebra, then, uh, y- a linear equation is a straight line. This translation is easily absorbed, uh, and, um, the f- to pass a tangent to a circle, uh, you know, (laughs) you have the basic theorems and you can do it with algebra. So, but, uh, the transition from one to another was really ... I thought that Descartes was the greatest mathematician of all times. (laughs)

    21. LF

      So, you have been at the ... if you think of engineering and mathematics as a spectrum-

    22. JP

      Yes.

    23. LF

      ... uh, you have been ... you have walked casually along this spectrum throughout your- throughout your life. You know, you had a little bit of engineering, and then, you know, uh, you have b- done a little bit of mathematics here and there. (laughs)

  2. 5:587:09

    Learning math “chronologically”: teachers, history, and the people behind theorems

    1. JP

      Well, not a little bit. I mean, we got a very solid background in mathematics because our teachers were geniuses.

    2. LF

      Yeah.

    3. JP

      Our teachers came from Germany in the 1930s, running away from Hitler. Uh, they left their careers in Heidelberg and Berlin, and came to teach high school in Israel. And we were the beneficiary of that experiment. So, I al- when they taught us math, the good way.

    4. LF

      What's the good way to teach math?

    5. JP

      Chronologically.

    6. LF

      The people?

    7. JP

      The people behind the theorems, yeah. Their cousins, and their nieces (laughs) and their faces, and how they jumped from the bathtub when they scream, "Eureka!" (laughs) and ran naked in town. (laughs)

    8. LF

      So, you were almost educated as a historian of math.

    9. JP

      No, we just got a glimpse of that history together with the theorem. So, every, um, exercise in math was connected with a person-

    10. LF

      S-

    11. JP

      ... and the time of the person, the period.

    12. LF

      The period also mathematically speaking?

    13. JP

      Mathematically speaking, yes. Not the politics.

    14. LF

      Yeah.

    15. JP

      No.

  3. 7:099:14

    From engineering and physics to AI: superconductivity and the “Pearl vortex”

    1. LF

      So, and then in, uh, in university, you have, you have gone on to do engineering.

    2. JP

      Yeah. I got a BS in engineering at Technion.

    3. LF

      Mm-hmm.

    4. JP

      Right? And then, uh, I moved here for graduate work, and I got to, I did engineering, uh, in addition to physics in Rutgers.

    5. LF

      Mm-hmm.

    6. JP

      And it would combine very nicely with my thesis, which I did in RCA Laboratories in superconductivity.

    7. LF

      And then somehow thought to switch to, uh, almost computer science software even, even the, the, uh, not switch, but long to become, uh, to get into software engineering a little bit-

    8. JP

      Yes.

    9. LF

      ... almost even programming, if you can call it that in the '70s. So, though there's all these disciplines.

    10. JP

      Yeah.

    11. LF

      If you were to pick a favorite, what, uh, in terms of engineering and mathematics, which path do you think has more beauty? Which path has more power?

    12. JP

      It's hard to choose now. I enjoy doing physics, and even have a vortex named on my name. So, I have a (laughs) investment in immortality. (laughs)

    13. LF

      Do you, uh, so what- what is a vortex?

    14. JP

      Vortex is in superconductivity.

    15. LF

      In the superconductivity, yeah.

    16. JP

      You have permanent current, uh, swirling around. One way or the other, you can have it store one or zero for computer. That was we worked on in, in the 1960 in RCA. And, uh, I discovered a few nice phenomena with the vortices.

    17. LF

      (laughs)

    18. JP

      You push current-

    19. LF

      So, there's a pearl vortex.

    20. JP

      ... and they move. Pearl vortex, right? You can Google it.

    21. LF

      (laughs)

    22. JP

      Right? I, I didn't know about it, but the physicists, they picked up on my thesis, on my, uh, PhD thesis. And, uh, they, um, it becomes popular when thin film superconductors became important for high temperature superconductors. So, they called it, uh, pearl vortex without my knowledge. (laughs) I discovered it only about 15 years ago.

  4. 9:1411:59

    Determinism, quantum mechanics, and free will as an AI-solvable illusion

    1. LF

      You have footprints in all of the sciences. So, let's talk about the universe a little bit. Is the universe at the lowest level deterministic or stochastic, in your amateur philosophy view? Put another way, does God play dice?

    2. JP

      Well, we, we know it is stochastic, right? Because-

    3. LF

      Today. Today, we think it is stochastic.

    4. JP

      Yes. We think, because we have the Heisenberg uncertainty principle, and we have some, uh, experiments to, um, confirm that.

    5. LF

      All we have is experiments to confirm it. We don't understand why.

    6. JP

      Why is already-

    7. LF

      You wrote a book about why. (laughs)

    8. JP

      Yes. (laughs) Yeah. It- it's a puzzle. It's a puzzle that you have the, uh, dice-flipping machine or god, and the, and the, uh, result of the flipping propagate with a speed faster than the speed of light (laughs) .

    9. LF

      Right.

    10. JP

      We can't explain it, okay? So, um, but it's, it only governs microscopic phenomena.

    11. LF

      So, you don't think of quantum mechanics as useful-

    12. JP

      No.

    13. LF

      ... uh, for understanding-

    14. JP

      It's just a diversion.

    15. LF

      ... the nature of reality?

    16. JP

      No. Diversionary.

    17. LF

      So, in your thinking, the world might as well be deterministic?

    18. JP

      The world is deterministic. And as far as the neural fir- neuron firing is concerned, it, it's a deterministic to first approximation.

    19. LF

      What about free will?

    20. JP

      Free will is also a nice e- exercise. Free will is an illusion-

    21. LF

      Illusion.

    22. JP

      ... that we AI people are going to solve.

    23. LF

      So, what do you think once we solve it, that solution will look like? Once we put it in the page.

    24. JP

      The solution will look like... First of all, it will look like a machine, a machine that act as though it had free will. It communicates with other machines as though they have free will. And you wouldn't be able to tell the difference between a machine that does and machine that doesn't have free will. Okay?

    25. LF

      So, the illusion, it propagates the illusion of free will amongst the other machines.

    26. JP

      And, and faking it is having it. Okay? That's what Turing test is all about.

    27. LF

      Yeah.

    28. JP

      Faking intelligence is intelligent, because it's not easy to fake. It's very hard to fake, and you can only fake if you have it.

    29. LF

      (laughs) Yeah, that-

    30. JP

      So-

  5. 11:5914:48

    What probability and correlation really mean (and why we reach for causality anyway)

    1. LF

      So, uh, let- let's begin at the beginning with probability, uh, both philosophically and mathematically. What- what does it mean to say the probability of something happening is 50%? What is probability?

    2. JP

      It's the degree of uncertainty that an agent has about the world.

    3. LF

      You're still expressing some knowledge in that statement.

    4. JP

      Of course. If the probability is 90%, it's absolutely different kind of knowledge than if it is 10%. (clears throat)

    5. LF

      But it's still not solid knowledge. It's-

    6. JP

      It is solid knowledge, boy. If you tell me that (laughs) the 90% assurance smoking will, um, give you lung cancer in five years, versus 10%, it's a piece of useful knowledge.

    7. LF

      So, this statistical view of the universe-

    8. JP

      Yeah.

    9. LF

      ... why is it useful? So, we're swimming in complete uncertainty.

    10. JP

      Yeah.

    11. LF

      Most of everything around us is-

    12. JP

      It allows you to predict things with a certain probability, and computing those probabilities are very useful. That's, uh, the whole idea of, uh, (sighs) of prediction, and you need prediction to be able to survive. If you cannot predict the future, then you're just a ... crossing the street will be extremely, uh, fearful.

    13. LF

      And so you've done a lot of work in causation, and so let's, let's think about correlation.

    14. JP

      I started with the probability.

    15. LF

      You started with probability.

    16. JP

      Yes.

    17. LF

      You've invented the Bayesian networks.

    18. JP

      Yeah.

    19. LF

      And so, you know, we'll, we'll, we'll, we'll dance back and forth between these levels of, uh-

    20. JP

      Okay.

    21. LF

      ... uh, uncertainty. But what, what is correlation? What is it? So, probability of something happening is something, but then there's a bunch of things happening, and, uh, sometimes they happen together, sometimes not. They're independent or not. So, how do you think about correlation of things?

    22. JP

      Correlation occurs when two things vary together over a very long time, is one way of measuring it, or when you have a bunch of variables that they all vary cohesively. Um, then we call we have a correlation here. And usually, when we think about correlation, we really think causally. Things can not be correlated unless there is a reason for them to vary together. Why should they vary together if they don't see each other? Why should they vary together?

    23. LF

      So, underlying it somewhere is causation.

    24. JP

      Yes. Hidden in our intuition, there is a notion of causation, because we cannot grasp any other logic except causation.

  6. 14:4823:11

    Conditioning pitfalls: selection effects, Simpson’s paradox, and observational studies

    1. LF

      And how does conditional probability differ from causation? So, what is conditional probability?

    2. JP

      Conditional probability, how things vary when one of them, uh, stays the same. Now, staying the same means that I have chosen to look only at those incidents where the guy has the same value as previous one. It's my choice as an experimenter. So, things that are not correlated before could become correlated. Like, for instance, if I have two coins which are uncorrelated, okay, and I choose only those flippings experiments in which a bell rings, and the bell rings when at least one of them is a tail, okay, then suddenly, I see correlation between the two coins, because I only look at the cases where the bell rang. You see, it's my design with my ignorance, essentially, with my, uh, audacity to ignore certain incidents, I suddenly create a correlation where it doesn't exist physically.

    3. LF

      Right. So that's ... You just outlined one of the flaws of observing the world and, and trying to infer something fundamental about the world from looking at the correlation.

    4. JP

      I don't look at it as a flaw. The world works like that. Which mean- but the flaws comes if you try to impose-

    5. LF

      Hmm.

    6. JP

      ... um, causal logic on correlation. It doesn't work too well.

    7. LF

      I mean, but that's exactly what we do. That's what ... That has been the majority of science, is you're r-

    8. JP

      No, no, no. Majority of, of naive science. Statisticians know it. The statisticians know that if you condition on a third variable, then you can destroy or create correlations among two other variables.

    9. LF

      Right.

    10. JP

      They know it. It's in the data.

    11. LF

      Right.

    12. JP

      There's nothing to surprise them. That's why they all dismiss the Simpson paradox. "Ah, we know it." They don't know anything about it. (laughs)

    13. LF

      Well, there's, uh, there's disciplines, like psychology, where all the variables are hard to ge- to account for. And so, uh, oftentimes, there is a leap between correlation to causation. You're, you're imposing-

    14. JP

      What do you mean a leap?

    15. LF

      Uh-

    16. JP

      Who, who is trying to get causation from correlation? No one.

    17. LF

      Not, not ... You're not proving causation-

    18. JP

      (laughs)

    19. LF

      ... but you're sort of, uh, um, discussing it, implying, sort of hypothesizing without ability to prove.

    20. JP

      Well, which discipline we have in mind? I'll tell you if they are obsolete-

    21. LF

      (laughs)

    22. JP

      ... or if they are outdated, or they're about to get outdated.

    23. LF

      Yes.

    24. JP

      Or (laughs) -

    25. LF

      Yes.

    26. JP

      Yeah, tell me which one you have in mind.

    27. LF

      Well, psychology, you know? Uh-

    28. JP

      Psychology, what is it? SEM? Structural equation modeling?

    29. LF

      No, no. I was thinking of applied psychology, studying, um ... for example, we work with human behavior in semi-autonomous vehicles, how people behave. And you have to conduct these studies of people driving cars.

    30. JP

      Everything starts with a question. What is a research question?

  7. 23:1129:23

    Causality needs a language: models first, discovery second

    1. LF

      Let's, let's, let's even take a step back. You invented, uh, Bayesian networks ... that look awfully a lot like they express something like causation, but they don't, not necessarily. So how do we turn Bayesian networks into expressing causation? How do we build causal networks? This A causes B, B causes C, how do we start to infer that kind of thing?

    2. JP

      We start asking ourself question, what are the factors that would determine the value of X? X could be blood pressure, uh, death, um, hungry, hunger.

    3. LF

      But these are hypotheses that we propose-

    4. JP

      Hypotheses, everything which has to do with causality comes from a theory, okay? The difference is only what kind ... how you interrogate the theory that you have in your mind.

    5. LF

      So, it still needs the human expert to propose-

    6. JP

      Right. You need the human expert to specify-

    7. LF

      Yeah.

    8. JP

      ... the initial model.... initial model could be very qualitative, just who listens to whom, by whom listen to, I mean one variable listen to the other. So, I say, "Okay, the tide is listening to the moon, and, and not to the rooster crow." Okay? And so far, this is our understanding of the world in which we live, scientific understanding of reality. We have to start there, because if we don't know how to handle cause and effect relationship when we do have a model, then we certainly do not know how to handle it when we don't have a model. So, let's start first, an AI slogan is, "Representation first, discovery second."

    9. LF

      Mm-hmm.

    10. JP

      Right? If I give you all the information that you need, can you do anything useful with it? That is the first, representation. How do you represent it? I give you all the knowledge in the world. How do you represent it? When you represent it, I ask you, "Can you infer X or Y or Z? Can you answer certain queries? Is it, uh, complex? Is it polynomial?" It... All the computer science exercises-

    11. LF

      Mm-hmm.

    12. JP

      ... we do, once you give me a representation for my knowledge, then you can ask me, "Now I understand how to represent things, how do I discover them?" It's a secondary thing.

    13. LF

      First of all, we... I- I should echo the statement that mathematics and the current m- m- much of the machine learning world has not considered causation, that A causes B.

    14. JP

      Correct.

    15. LF

      Just in anything. So that- that seems like a s- uh, that seems like a non-obvious thing that you- you'd think we would've really acknowledged it, but we haven't. So, we- we have to put that on the table. So, uh, knowledge. How hard is it to create a knowledge from which t- to work?

    16. JP

      In certain area-

    17. LF

      S-

    18. JP

      ... it's easy, because we have only four or five major variables, okay, and- and an epidemiologist or an economist can put them down, um, what, the- the minimum wage, uh, un- unemployment, p- policy, X, Y, Z, and- and start collecting data, and quantify the parameter that were left unquantified with the initial knowledge.

    19. LF

      Yeah.

    20. JP

      Okay? That's the routine work that you find in experimental psychology-

    21. LF

      Yes.

    22. JP

      ... in economics, ev- everywhere, any- any health science. That's a routine things. But I should emphasize, you should start with a research question: What do you want to estimate? Once you have that, you have to have a language of expressing what you want to estimate. You think it's easy? No.

    23. LF

      So, we can talk about two things. I think, b- one is, um, how the science of causation is, uh, very useful for a- uh, answering certain questions, and then the other is, how do we create intelligent systems, uh, that need to reason with causation? So, if my research question is, "How do I pick up this water bottle from the table?" (laughs) Uh, the w- all the knowledge that is required to be able to do that, how do we construct that knowledge base? Does it- does it- do we return back to the problem that we didn't solve in the '80s with expert systems? Do we have to solve that problem of automated construction of knowledge?

    24. JP

      Hmm. You're talking about the, uh, task of eliciting knowledge from an expert.

    25. LF

      Task of eliciting knowledge from an expert, or just self-discovery of more knowledge, uh, more and more knowledge, so automating the building of knowledge as much as possible.

    26. JP

      It's a different game in the causal domain, because... It- it's essentially the same thing. You have to start with some knowledge, and you're trying to enrich it.

    27. LF

      Mm-hmm.

    28. JP

      But you don't enrich it-

    29. LF

      Mm-hmm.

    30. JP

      ... by asking for more rules. You enrich it by asking for the data, for them to look at the data and quantifying, and ask queries that you couldn't answer when you started. You couldn't, because it- it- it's... the- the question is quite complex, and it's not within the, um, capability of ordinary cognition, of ordinary person, or ordinary expert even, to answer.

  8. 29:2334:47

    Interventions and the do-operator: asking “what if we do X?”

    1. LF

      So, what kind of questions do you think we can s- start to answer? What's-

    2. JP

      Even a simple one. Suppose... uh, yeah (laughs) . I start with easy one.

    3. LF

      Let's do it.

    4. JP

      Okay. What's the effect of a drug on my recovery? Uh, was it the aspirin that caused my headache to be cured, or was it the television program, or the good news I received? Um, this is already... you see, it's a difficult question, because it's find the cause from effect. The easy one is find effect from cause.

    5. LF

      That's right. So, first you construct a model saying that this is an important research question.

    6. JP

      Yeah.

    7. LF

      This is an important question. Then you- you d-

    8. JP

      No, no, I didn't construct a model yet. I just said it's important question.

    9. LF

      It's important question.

    10. JP

      And the first exercise is express it mathematically. What do you want to prove? Like, if I tell you, "What's the eff- what will be the effect of taking this drug?" Okay? You have to say that in mathematics. How do you say that?

    11. LF

      Yes.

    12. JP

      Can you write down the question? Not the answer.I want to find the effect of the drug on my headache.

    13. LF

      Right.

    14. JP

      W- write down, right? Write it down.

    15. LF

      That's where the do calculus comes in. (laughs)

    16. JP

      Yes. Do operator, what is do operator?

    17. LF

      Do operator, yeah.

    18. JP

      Yeah. You have to have a-

    19. LF

      Which is nice. It's the difference between association and intervention.

    20. JP

      Correct.

    21. LF

      Very beautifully, sort of constructed.

    22. JP

      Yeah. So, we co- we have a do operator, so the do calculus connected on the do operator itself, connects the operation of doing to something that we can see.

    23. LF

      Right. So, as opposed to the purely observing, you're making the choice to change a variable.

    24. JP

      Yeah. That's what it, it expresses.

    25. LF

      Hm. Gotcha.

    26. JP

      And then, the way that we interpret it, and the mechan- mechanism by which we take your query, and we translate it into something that we can work with, is by giving it semantics. Saying that you have a model of the world, and you cut off all the incoming arrow into X, and you're looking now in the modified, mutilated model, you ask for the probability of Y. That is interpretation of doing X, because by doing things, you've liberated them from all influences that acted upon them earlier, and you subject them to the tyranny of your muscles.

    27. LF

      So you... (laughs) You remove all the questions about causality by doing them.

    28. JP

      So, because-

    29. LF

      You're now-

    30. JP

      ... it's one level of questions.

  9. 34:4737:09

    Why adding arrows can hurt: assumptions, identifiability, and when experiments are necessary

    1. JP

      I construct a model, I can still cannot answer it.

    2. LF

      Right.

    3. JP

      I have to see if I have enough information in the model that would allow me to find out the effects of intervention from a non-interventional study. From observation- hands-off study.

    4. LF

      Right. So, what's needed to make that-

    5. JP

      You need to have assumptions about who affects whom. If the, if the graph had a certain property, the answer is yes, you can get it from observational study.

    6. LF

      Mm-hmm.

    7. JP

      If the graph is too meshy, bushy, bushy, the answer is no, you cannot. Then, you need to find either different kind of observation that you haven't considered or one experiment.

    8. LF

      So basically, does that put... That puts a lot of pressure on you to encode wisdom into that graph.

    9. JP

      Correct. But you don't have to encode more than what you know. God forbid. (laughs) If you put... like economists are doing that, they call it identifying assumption. They put assumptions, even they don't prevail in the world, they put assumptions so they can identify things. Let no-

    10. LF

      But the problem is... Yes, beautifully put, but the problem is you don't know what you don't know. So...

    11. JP

      You know what you don't know, because if you don't know, you say, "It's possible. It's possible that X affect the, uh, traffic tomorrow." And I'm...

    12. LF

      Yeah.

    13. JP

      It's possible. You put down a arrow which says it's pos- every arrow in the graph says it's possible.

    14. LF

      So, there's not a significant cost to adding arrows that-

    15. JP

      The more arrow you add-

    16. LF

      The better.

    17. JP

      ... the less likely you are to identify things from purely observational data.So, if the whole world is bushy, and every- everybody ac- affect everybody else, the answer is, you can answer it ahead of time. I cannot answer my query from observational data. I have to go to experiments.

    18. LF

      Right. So, you talk about machine learning as essentially learning by s- association, or reasoning by association. And this new calculus is allowing for intervention. I like that word. Uh, but-

    19. JP

      Intervention.

  10. 37:0940:46

    Counterfactuals as explanations: responsibility, regret, and the limits of today’s ML

    1. LF

      ... action. So, you also talk about counterfactuals.

    2. JP

      Yeah.

    3. LF

      And I'm trying to sort of understand the difference in counterfactuals and in- in- intervention. Uh, what's the... well, first of all, what is counterfactuals, and why are they useful? Wh- wh- why are they, eh, eh, especially, uh, useful as opposed to just reasoning what- what effect actions have?

    4. JP

      Well, conf- counterfactual contains what we normally call explanations.

    5. LF

      Can you give an example of a counterfactual?

    6. JP

      If I tell you that acting one way affects something else, I didn't explain anything yet. But if I, if I ask you, uh, "Was it the aspirin that cure my headache?" I'm asking for explanation. What cure my headache?

    7. LF

      Mm-hmm.

    8. JP

      And putting a finger on aspirin provide information. It was aspirin. It was responsible for your headache going away. If y- if you didn't take the aspirin, you will still have a headache.

    9. LF

      So by s- by saying, "If I didn't take aspirin, I would have a headache." You're thereby saying that aspirin is the thing that removes the headache.

    10. JP

      Yeah, but you have to have another important information. "I took the aspirin, and my headache is gone." Okay? It's very important information. Now I'm reasoning backward, and I said, "Was it the aspirin?"

    11. LF

      Yeah.

    12. JP

      Okay.

    13. LF

      By considering what would have happened if everything else is the same, but I didn't take aspirin.

    14. JP

      That's right.

    15. LF

      Counterfactual.

    16. JP

      So, in order, things took place, you know. Joe killed Schmo, okay?

    17. LF

      Yeah.

    18. JP

      And Schmo would, uh, would be alive had Joe not used his gun.

    19. LF

      Right.

    20. JP

      Okay? So, that is the counterfactual. It, it, it conf- it had a confliction, it had a conflict here or clash between observed fact that he, he did shoot, okay, and the hypothetical predicate, which says, "Had he not shot." You have a clash, a logical clash. They cannot exist together. That's a counterfactual, and that is the source of our explanation of our idea of responsibility, regret, and free will.

    21. LF

      Yeah, so it certainly seems, uh, that's the highest level of reasoning, right, is counterfactual?

    22. JP

      Yes, and physicists do it all the time.

    23. LF

      Who does it all the time?

    24. JP

      Physicists.

    25. LF

      Physicists.

    26. JP

      In every equation of physics, let's say you have a Hooke's law, and you put, uh, one kilogram on the spring, and the spring is one meter, and you say, "Had this weight been two kilogram, the spring would have been twice as long." It's no problem in, for a physicist-

    27. LF

      Hmm.

    28. JP

      ... to say that. Accept that mathematics is only i- i- is in the form of equation. Okay? Equating the weight, proportionality constant, and the length of the string. So, you don't have the asymmetry in the equation of physics, although every physicist thinks counterfactually. Ask the high school kids.

    29. LF

      Mm-hmm.

    30. JP

      Had the weight been three kilograms, what would be the length of the spring? They can answer it immediately, because they do the counterfactual processing in their mind, and then they put it into equation, algebra equation, and they solve it. Okay? But a robot cannot do that.

  11. 40:4644:37

    How could machines learn causality? Manipulation, noise, and inferring the “strings behind the facts”

    1. LF

      How do you make a robot learn these relationships?

    2. JP

      Uh, well, and, and why you would learn?

    3. LF

      Or not learn.

    4. JP

      Suppose you tell him. Can you do it? See, before you go learning-

    5. LF

      Yeah.

    6. JP

      ... you have to ask yourself, suppose I give him all the information, okay? How... can the, can the robot perform a task that I ask him to perform? Can he reason and say, "No, it wasn't the aspirin. It was the good news you received on the phone."

    7. LF

      Right, because, well, unless the robot had a model, uh, a, a causal, um, model of the world.

    8. JP

      Right, right. That's right.

    9. LF

      Look, I'm sorry I have to linger on this.

    10. JP

      But now we have to linger, and we have to say how do we, how do we do it?

    11. LF

      How do we build it?

    12. JP

      Yes.

    13. LF

      How do we build a causal model without a h- a team of human experts-

    14. JP

      No, put- eh- eh-

    15. LF

      ... running around?

    16. JP

      ... why don't, why don't you go to learning right away? You're too much involved with learning.

    17. LF

      'Cause I like babies. Babies learn fast.

    18. JP

      Oh, yeah.

    19. LF

      I'm trying to figure out how they do it.

    20. JP

      Oh, yeah. Okay, good.

    21. LF

      So, yeah.

    22. JP

      That's another question. How do the babies come out with a counterfactual model of the world?

    23. LF

      Yeah.

    24. JP

      And babies do that.

    25. LF

      Yeah.

    26. JP

      They know how to play with the... in the crib. They know which balls hits another one.

    27. LF

      Mm-hmm.

    28. JP

      And, but they learn it by, um, playful manipulation of the world.

    29. LF

      Yes.

    30. JP

      In a simple world involve only toys, and balls, and chimes (laughs) and bells. But it's a v- if you think about it, it's a complex world.

  12. 44:3759:13

    Metaphor as intelligence: mapping the unfamiliar to the familiar (and why curve-fitting isn’t enough)

    1. JP

      I think it's a matter of combining simple models from many, many sources-

    2. LF

      Mm-hmm.

    3. JP

      ... from many, many disciplines, and many metaphors. Metaphors are the basics of human intelligence, basis.

    4. LF

      Yeah. So h- how do you think of, about a metaphor, in terms of its use in human intelligence?

    5. JP

      Metaphors is an expert system.

    6. LF

      Mm-hmm.

    7. JP

      An expert... it's, it's mapping problem with which you are not familiar to a problem with which you are familiar.

    8. LF

      Mm-hmm.

    9. JP

      Like, I'll give you a good example. The Greek believed that the sky is an opaque shell. It's not really outsp- in infinite space. It's an opaque shell, and the stars are holes-

    10. LF

      Yeah.

    11. JP

      ... poked in the shell through which you see the eternal light. Okay? That was a metaphor. Why? Because they u- they understand how you poke holes in shells, okay? They're not f- they were not familiar with infinite space, okay? And it's, so, a- and, and we are walking on a shell of a turtle, and if you get too close to the edge, you're going to fall down to Hades, or wherever.

    12. LF

      Yeah.

    13. JP

      Yeah? Um, that's a metaphor. It's not true. But this kind of metaphor enabled Aristarchus to measure the radius of the Earth.

    14. LF

      Hm.

    15. JP

      Because he said, "Come on. If the w- we are walking on a turtle shell, then the ray of light coming through this angle will be different, um, this place, will be a different angle that's coming to this place. I know the distance. I'll measure the two, uh, angles, and then I have the radius of the shell of the, of the turtle."

    16. LF

      Mm-hmm.

    17. JP

      Okay? And he did. And he found his measurement very close to the measurements we have today through the, uh, what, 6,000 and 700, 700 kilometers (laughs) of the Earth. That's something that would not h- occur to Babylonian astronomer, even though the Babylonian experiments were the machine learning people of the time. They fit curves, and they could predict the, um, eclipse of the moon much more accurately than the Greek, because they fit curve. Okay? Uh, so that's the difference in metaphor.

    18. LF

      Mm-hmm.

    19. JP

      Something that you're familiar with. Again, a turtle shell. Okay? What does it mean if you are familiar? Familiar means that answers to certain questions are explicit. You don't have to derive them.

    20. LF

      And they were made explicit, because somewhere in the past, you've constructed a model of that. Uh-

    21. JP

      Before, yeah, you- you're familiar with-

    22. LF

      Yeah.

    23. JP

      So the child is familiar with billiard balls.

    24. LF

      Yes.

    25. JP

      So the child could predict that if you let loose of one ball, the other one will bounce off. These are... You, you obtain that by, um, familiarity. Familiarity is answering questions, and you store the answer explicitly. You don't have to derive them. So, this is ideal for metaphor. All our life, all our intelligence is built around metaphors. Mapping from the unfamiliar to the familiar, but the, um, marriage between the two is a tough thing, which I, which we haven't yet been able to algorithmatize.

    26. LF

      So, you think of that process of cas- of using metaphor to leap from one place to another as, we can call it reasoning? Is it a kind of reasoning?

    27. JP

      It is, uh, reasoning by metaphor, metaphorical reason-

    28. LF

      Reasoning by metaphor.

    29. JP

      ... yeah.

    30. LF

      Do you think of that as learning?So, learning is a popular terminology today in a narrow sense.

  13. 59:131:04:31

    Toward human-level AI: communication, ethics, self-models, and consciousness as a ‘software blueprint’

    1. LF

      Got it. So, I know you're not a futurist, but are you excited? Have you, when you look back at your life, longed for the idea of creating a human level intelligence system?

    2. JP

      Yeah. I'm, uh, I'm driven by that. All my life, I'm driven just by one thing (laughs) . But I go slowly. I go from what I know, to the next step, incrementally.

    3. LF

      So, without imagining what the end goal looks like, do you imagine what an-

    4. JP

      Yes, end- the end goal is going to be a machine that can answer sophisticated questions-

    5. LF

      Questions.

    6. JP

      ... counterfactuals, regret, compassion, um, responsibility, and free will.

    7. LF

      So, what is a good test? Is a Turing test a reasonable test?

    8. JP

      If there is test of free will, doesn't exist yet, uh, there's no-

    9. LF

      How would you test free will? Uh, that's a-

    10. JP

      So far, we know only one thing. I mean (laughs) , if robots can communicate with reward and punishment among themselves, and hitting each other on the wrists and say, "You shouldn't have done that."

    11. LF

      Mm-hmm.

    12. JP

      Okay? Um, playing better soccer because they can do that.

    13. LF

      What do you mean, because they can do that?

    14. JP

      Because they can communicate among themselves. It's ƒ-

    15. LF

      Because of the communication, they can do the soccer?

    16. JP

      Because they communicate like us.

    17. LF

      Yeah.

    18. JP

      Reward and punishment. Yes, you didn't pass the ball the right, the right time and so forth, therefore you're going to sit on the bench for the next two... If they start communicating like that, the question is, will they play better soccer?As opposed to what? As opposed to what they do now, without this ability to reason about, uh, uh, reward and punishment, responsibility.

    19. LF

      And a lot of factions.

    20. JP

      So far, I can only think about communication.

    21. LF

      Communication is... And, and, and not necessarily natural language, but just communication.

    22. JP

      Just communication. And that's important to have a quick and effective means of communicating knowledge. If the coach tells you, "You should have passed the ball." Ping. He conveys so much knowledge to you, as opposed to, what? Go down and change your software. Right. That's the alternative. But the coach doesn't know your software.

    23. LF

      Right.

    24. JP

      So, how can the coach tell you, "You should have passed the ball?" But that... (laughs) Our language is very effective. "You should have passed the ball." You know your software, you tweak the right module, okay? And next time, you don't do it.

    25. LF

      Now, that's for playing soccer where the rules are well defined.

    26. JP

      No, no, no. Well, they're not well defined. When you should pass the ball and when you should stop-

    27. LF

      Is not well defined.

    28. JP

      No, it's a-

    29. LF

      Yeah.

    30. JP

      It's very soft world, very noisy.

  14. 1:04:311:19:08

    Risk, society, and personal tragedy: AI as a new species; Israel, religion, and the story of Daniel Pearl

    1. LF

      Do you have concerns about the future of AI, all the different trajectories of all of our research?

    2. JP

      Yes.

    3. LF

      Um, where's your hope, where the movement heads? Where are your concerns?

    4. JP

      I- I'm concerned, because I know we are building a new species that has the capability of exceeding our... Exceeding us, uh, uh, exceeding our capabilities, and can breed itself and take over the world. Absolutely. It's a new species. It's uncontrolled. We don't know the degree to which we control it. We don't even understand what it means to be able to control this new species. Uh, so, I'm concerned. I don't have anything to add to that, because it's such a gray area, an unknown. It never happened in history.

    5. LF

      Yeah.

    6. JP

      Okay? The only, the only time it happened in history was evolution with a human being.

    7. LF

      Right.

    8. JP

      And it wasn't very successful, was it? (laughs)

    9. LF

      Uh-

    10. JP

      Some people says it was a great success.

    11. LF

      For us, it was, but a few people along the way, uh, or a few creatures along the way would not agree. So, uh, so, it's just because it's such a gray area, there's nothing else to say.

    12. JP

      Uh, we have a sample of one.

    13. LF

      Sample of one.

    14. JP

      That's us.

    15. LF

      But-

    16. JP

      We dominantly

    17. NA

      (murmuring)

    18. LF

      ... some people would look at you and say, "Yeah, but we were looking to you to help us make sure that-

    19. JP

      Correct.

    20. LF

      "... sample two works out okay."

    21. JP

      Actually, we have more than a sample of one. We have theory of, theories.

    22. LF

      Yeah.

    23. JP

      And that's a good... And s- we don't need to be statisticians. So, sample of one doesn't mean any, uh, uh, poverty of knowledge.

    24. LF

      Yeah.

    25. JP

      It's not. Sample of one plus theory, conjectural theory of what could happen.

    26. LF

      Yeah.

    27. JP

      That, we do have. But I- I really feel helpless in contributing to this argument, because I know so little, and, uh, and my imagination is limited. And, uh, I know how much I don't know, and, uh-

    28. LF

      Hmm.

    29. JP

      ... I... But I'm concerned.

    30. LF

      You were born and raised in Israel.

  15. 1:19:081:23:00

    Advice, rebellion, and legacy: ask questions you can’t yet name

    1. LF

      (laughs) Do you have advice for young minds today dreaming about creating, as you have dreamt, creating intelligent systems? What is the best way to arrive at new breakthrough ideas and carry them through the fire of criticism and, uh, and past conventional ideas?

    2. JP

      Ask your questions (laughs) freely. Your questions are never dumb.

    3. LF

      (laughs)

    4. JP

      And solve them your own way, (laughs) okay? And don't take no for an answer. Look, if they are really dumb, you will find out quickly by trying an arrow to see that they're not leading any place. But follow them and try to understand things your way. That is my, uh, advice. I don't know if it- it's going to help anyone.

    5. LF

      No, that brilliantly put.

    6. JP

      But I-- there is a lot of, uh, I would say, inertia in science, in academia. It is slowing down science.

    7. LF

      Yeah, those two words: "your way." That's a powerful thing.

    8. JP

      Um, yes.

    9. LF

      It's against inertia, potentially, against the flow.

    10. JP

      Against your professor.

    11. LF

      (laughs) Against your professor.

    12. JP

      It is, uh... I- I wrote The Book of Why-

    13. LF

      Yeah.

    14. JP

      ... in order to democratize common sense, yeah. (laughs)

    15. LF

      (laughs)

    16. JP

      In order to instill rebellious spirits in students so they wouldn't wait until the professor get things right.

    17. LF

      (laughs) Thus, you wrote the manifesto of the rebellion against the professor. (laughs)

    18. JP

      Against the professor, yes.

    19. LF

      So, looking back at your life of research, what ideas do you hope ripple through the next many decades? What, what do you hope your legacy will be?

    20. JP

      I already have a tombstone, uh, carved. (laughs)

    21. LF

      (laughs) Oh boy, yeah.

    22. JP

      The fundamen- the fundamental law of counterfactuals, that what, uh, it said. (laughs)

    23. LF

      (laughs)

    24. JP

      It's a simple equation. What a counterfactual in terms of a model surgery.

    25. LF

      Mm-hmm.

    26. JP

      That's it, because everything follows from that. If you get that, all the rest, I can die in peace, and my student can derive all my knowledge (laughs) by mathematical means.

    27. LF

      The rest follows.

    28. JP

      Yeah.

    29. LF

      Judea, thank you so much for talking today. I really appreciate it.

    30. JP

      Well, thank you for being so attentive and, uh, instigating. (laughs)

Episode duration: 1:23:01

Install uListen for AI-powered chat & search across the full episode — Get Full Transcript

Transcript of episode pEBI0vF45ic

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.