Skip to content
Lex Fridman PodcastLex Fridman Podcast

Vijay Kumar: Flying Robots | Lex Fridman Podcast #37

Lex Fridman and Vijay Kumar on vijay Kumar reveals future of agile flying robots and swarms.

Lex FridmanhostVijay Kumarguest
Sep 8, 201956mWatch on YouTube ↗

EVERY SPOKEN WORD

  1. 0:0015:00

    The following is a…

    1. LF

      The following is a conversation with Vijay Kumar. He's one of the top roboticists in the world, a professor at the University of Pennsylvania, a dean of Penn Engineering, former director of GRASP Lab, or the General Robotics Automation Sensing and Perception Laboratory at Penn that was established back in 1979. That's 40 years ago. Vijay is perhaps best known for his work in multi-robot systems, robot swarms, and micro aerial vehicles, robots that elegantly cooperate in flight under all the uncertainty and challenges that the real-world conditions present. This is the Artificial Intelligence Podcast. If you enjoy it, subscribe on YouTube, give it five stars on iTunes, support it on Patreon, or simply connect with me on Twitter at Lex Fridman, spelled F-R-I-D-M-A-N. And now, here's my conversation with Vijay Kumar. What is the first robot you've ever built or were a part of building?

    2. VK

      Way back when I was in graduate school, I was part of a fairly big project that involved building a very large hexapod. This weighed close to 7,000 pounds, and it was powered by hydraulic actuation, or was actuated by hydraulics with 18 motors, hydraulic motors, each controlled by an Intel 8085, uh, processor and an Inte- 8086 co-processor. And so imagine this huge, uh, monster that had 18 joints, each controlled by an independent computer, and there was a 19th computer that actually did the coordination between these 18 joints. So I was part of this project and my thesis work was how do you coordinate the 18 legs, and in particular, the-the pressures in the hydraulic cylinders to get efficient locomotion.

    3. LF

      It sounds like a giant mess. So how difficult is it to make all the motors communicate? Presumably, you have to send signals hundreds of times a second, or at least-

    4. VK

      Yeah, so this was not my work, but the-the folks who worked on this wrote what I believe to be the first multiprocessor operating system. This was-

    5. LF

      Mm-hmm.

    6. VK

      ... in the '80s. And you have to make sure that, uh, obviously messages got across from one joint to another. You have to remember, the-the clock speeds on those computers were about half a megahertz.

    7. LF

      (laughs) Right.

    8. VK

      So.

    9. LF

      The '80s. So not to romanticize the notion, but how did it make you feel to make-to see that robot move?

    10. VK

      It was amazing. In hindsight, it looks like, well, we built this thing which really should have been much smaller. And of course, today's robots are much smaller. You look at, you know, Boston Dynamics or Ghost Robotics, a spinoff from-from Penn. But back then, you were stuck with the substrate you had, the compute you had, so things were unnecessarily big. But at the same time, uh, and this is just human psychology, somehow bigger means grander. You know, people never have the same appreciation for nanotechnology or nano devices as they do for the space shuttle or the Boeing 747.

    11. LF

      Yeah, you've actually done quite a good job at illustrating that small is beautiful.

    12. VK

      Yeah.

    13. LF

      In terms of robotics. (laughs)

    14. VK

      Yeah.

    15. LF

      So what is, uh, on that topic is the most beautiful or elegant robot in motion that you've ever seen? Not to pick favorites or whatever, but something that just inspires you, that you remember.

    16. VK

      Well, I think thing that I'm-I'm most proud of that my students have done is really think about, uh, small UAVs that can maneuver in constrained spaces, and, uh, p- in particular, their ability to coordinate with each other and form three-dimensional patterns. So once you can do that, uh, you can, uh, essentially create 3D objects in the sky, and you can deform these objects on the fly. So in some sense, your toolbox of what you can create is suddenly got enhanced.

    17. LF

      Mm-hmm.

    18. VK

      And before that, we did the two-dimensional version of this, so we had, uh, ground robots forming patterns and-and so on. So that-that was not as impressive, that was not as beautiful. But if you do it in 3D, suspended in midair, and you've got to go back to 2011 when we did this, um, now it's actually pretty standard to do these things eight years later, um, but back then, it was a big accomplishment.

    19. LF

      So the distributed cooperation is where-is where beauty emerges, in your eyes?

    20. VK

      Well, I think beauty to an engineer is very different from-from beauty to, you know, someone who's looking at robots from the outside, if you will.

    21. LF

      Yeah.

    22. VK

      But what I meant there, so before we said that grand is associated with size, and, um, another way of thinking about this is just the physical shape, and the idea that you can create physical shapes in midair and have them deform, uh, that's beautiful.

    23. LF

      But the individual components, the agility is beautiful too, right?

    24. VK

      That is true too.

    25. LF

      So the-the aggressive move.

    26. VK

      So then how-how quickly can you actually manipulate these three-dimensional shapes and the individual components? Yes, you're right.

    27. LF

      Uh, by the way, you said UAV, unmanned aerial vehicle. Well, what's a good term for drones, UAVs, quadcopters? Is there a term that's stan- being standardized?

    28. VK

      I don't know if there is. Everybody wants to use the word drones. And I've often said this, drones to me is a pejorative word. It-it signifies something that's, uh, that's dumb-

    29. LF

      Hmm.

    30. VK

      ... that's pre-programmed, that does one little thing. And robots are anything but drones. So I actually don't like that word, but that's what everybody uses.You could call it unpiloted.

  2. 15:0030:00

    Yeah. So let's step…

    1. VK

      the cost of individuals would really become insignificant.

    2. LF

      Yeah. So let's step back at a high-level view, the impossible question of what kind of, and as an overview, what kind of autonomous flying vehicles are there, in general?

    3. VK

      I think the ones that, uh, receive a lot of notoriety are obviously the military vehicles. Military vehicles are controlled by a base station, but have, uh, uh, a lot of human supervision, but they have limited in- autonomy, which is the ability to go from point A to point B, and even the more sophisticated now, f- sophisticated vehicles can do autonomous, uh, takeoff and landing.

    4. LF

      And those usually have wings, and they're heavy, and they-

    5. VK

      Usually, they're wings, but then there's nothing preventing us from doing this for helicopters as well. There are, um, I mean, there are many military organizations that have autonomous helicopters in the same vein. And by the way, you, you look at autopilots in airplanes, and it's, it's actually very similar. In fact, I can... One interesting question we can ask is, uh, if you look at all the air safety violations or all the crashes that occurred-

    6. LF

      Yeah.

    7. VK

      ... would they have happened if the plane were truly autonomous? And I think you'll find that, in many of the cases, you know, because of pilot error, we make silly decisions. And so, in some sense, even in air traffic, commercial air traffic, there's a lot of applications, uh, although we only see, um, autonomy being enabled at very high altitudes, uh, when, when the, the, the pilot, the, the, the, the plane is on autopilot.

    8. LF

      There's still a role for the human, and tho- that kind of, uh, autonomy is, uh, you're kind of implying, I don't know what the right word is, but it's a little dumb- dumber than it could be.

    9. VK

      Uh, right.

    10. LF

      It's simpler and naive.

    11. VK

      So, so in the lab, of course, we can, we can, we can afford to be a lot more aggressive.

    12. LF

      Right.

    13. VK

      And the question we try to ask is, um, can we make robots that would be able to make decisions without any kind of external infrastructure?

    14. LF

      Right.

    15. VK

      So what does that mean? So the most common piece of infrastructure that airplanes use today is GPS. Uh, GPS is also the most brittle form of information. Um, if you have driven in a city, tried to use GPS navigation, you know, in tall buildings, you immediately lose GPS. Um, and so that's not a very sophisticated way of building autonomy. I think the second piece of infrastructure they rely on is communications. Again, it's very easy to jam communications. In fact, if you use wifi, you know that wifi signals drop out, cell signals drop out, so to rely on something like that is not, is not good. The third form of infrastructure we, we use, and I hate to call it infrastructure, but, but it is that in the sense of robots, is people.

    16. LF

      (laughs)

    17. VK

      So you could rely on somebody to pilot you. (laughs)

    18. LF

      Right.

    19. VK

      Uh, and so the question you wanna ask is if there are no pilots, if there's no communications with any base station-

    20. LF

      Yeah.

    21. VK

      ... if there's no knowledge of position-... and if there's no a priori map, a priori knowledge-

    22. LF

      Yeah.

    23. VK

      ... of what the environment looks like, a priori model of what might happen in the future, can robots navigate? So that is true autonomy.

    24. LF

      Right. So that's, that's true autonomies. And we're talking about, you mentioned, like military applications of drones. Okay, so what else is there? You talk about agile, autonomous flying robots, aerial robots. So that's a different kind of... It's not winged, it's not big. At least, it's small.

    25. VK

      So I use the word agility mostly, or at least we're motivated to do agile robots, mostly because robots can operate and should be operating in constrained environments.

    26. LF

      Mm-hmm.

    27. VK

      And if you want to operate the way a Global Hawk o- operates, I mean, the kinds of conditions in which you operate are very, very restrictive. If you go, want to go inside a building, for example, for search and rescue, or to locate an active shooter, or you want to navigate under the canopy in an orchard to look at health of plants or to, to look for, count, to count fruits-

    28. LF

      Mm-hmm.

    29. VK

      ... to measure the tree, the tree trunks, uh, these are things we do, by the way. Uh-

    30. LF

      Yeah. Yeah, some cool agriculture stuff you've shown in, in the past. It's really awesome.

  3. 30:0045:00

    Mm-hmm. …

    1. VK

      to fly through a constrained space, the first time you try it, you'll invariably... you might get it wrong-

    2. LF

      Mm-hmm.

    3. VK

      ... if, if the task is challenging. And the reason is, to get it perfectly right, you have to model everything in the environment.

    4. LF

      Right.

    5. VK

      And flying is notoriously hard to model. There are aerodynamic effects that we constantly discover, uh-... even just before I was talking to you, I was talking to a student about how blades flap when they fly.

    6. LF

      Huh. Wow.

    7. VK

      And that ends up changing how a rotorcraft is accelerated in the angular direction.

    8. LF

      So these are like micro flaps or something? It's mo- it's-

    9. VK

      It's not micro flaps. So we assume that each blade is, is rigid but actually it flaps a little bit.

    10. LF

      Oh.

    11. VK

      It bends.

    12. LF

      Interesting. Yeah.

    13. VK

      And so the models rely on the fact, on the, on, on an assumption that they're, they're actually rigid. But that's not true. If you're flying really quickly, these effects become significant. If you're flying close to the ground, you get pushed off by the ground, right? Something which every pilot knows when, when he tries to land or she tries to land.

    14. LF

      Mm-hmm.

    15. VK

      This is, this is called the ground effect. Something very few pilots think about is what happens when you go close to a ceiling or you get sucked into a ceiling. There are very few aircrafts that fly close to any kind of ceiling. Likewise, when you go close to, close to a wall, uh, there are these wall effects. Um, and if you've gone on a train and you pass another train that's traveling the opposite direction, you feel the buffeting. And so these kinds of micro climates-

    16. LF

      Mm-hmm.

    17. VK

      ... affect our UAVs significantly. So if you want-

    18. LF

      And they're impossible to model, essentially (overlapping)

    19. VK

      I wouldn't say they're-

    20. LF

      ... explicitly.

    21. VK

      ... impossible to model, but the level of sophistication you would need in the model and the software would be tremendous. Uh, plus to get everything right would be awfully tedious. So the way we do this is over time, we figure out how to adapt to these conditions. Um, so we've, early on, we used a form of learning that we call iterative learning. Uh, so this idea if you wanna perform a task, um, there are a few things that you need to change-

    22. LF

      Mm-hmm.

    23. VK

      ... uh, and iterate over, few parameters, that over time, you can, you can, you can figure out. So I could call it policy gradient reinforcement learning.

    24. LF

      (laughs)

    25. VK

      But actually it was just iterative learning.

    26. LF

      Iterative learning.

    27. VK

      Um, and so this was there way back. I think what's interesting is, if, if you look at autonomous vehicles today, learning occurs, could occur in two pieces. One is perception. Understanding the world. Second is action, taking actions. Everything that I've seen that is successful is in the perception side of things. So in computer vision, we've made amazing strides in the last 10 years. So recognizing objects, actually detecting objects, classifying them, uh, uh, uh, and, and tagging them in some sense, annotating them. This is all done through machine learning. On the action side on the other hand, I don't know of any examples where there are fielded systems where we actually learn the right behavior.

    28. LF

      Outside a single demonstration of success that you know-

    29. VK

      Oh, in the laboratory, this is the Holy Grail. Can you do end-to-end learning? Can you go from pixels to motorbo- motor currents? This is really, really hard. And I think if you loo- go forward, the right way to think about these things is, uh, data-driven approaches, learning-based approaches, in concert with model-based approaches, which is the traditional way of doing things.

    30. LF

      Right.

  4. 45:0056:31

    Mm-hmm. …

    1. VK

      and how humans interact with robots-

    2. LF

      Mm-hmm.

    3. VK

      ... you know, we think of these in sort of three different ways. One is the human commanding the robot. The second is the human collaborating with the robot. So, for example, we work on how a robot can actually pick up things with a human and carry things.

    4. LF

      With a human, yeah.

    5. VK

      That's, like, true collaboration. And, uh, third, we think about humans as bystanders. Self-driving cars, what's the human's role and how does- how do self-driving cars acknowledge the presence of humans?

    6. LF

      Right.

    7. VK

      Um, so I think all of these things are different scenarios. It depends on what kind of humans, what kind of task, and I think it's very difficult to say that there's a general theory that we all have for this. But at the same time, it's also silly to say that we- we should think about robots independent of humans.

    8. LF

      Right.

    9. VK

      So, to me, human-robot interaction is almost a mandatory aspect of everything we do.

    10. LF

      Yes, so but to which degree? So what- your thoughts, if we jump to autonomous vehicles, for example, there's a- there's a big debate between what- what's called Level 2 and Level 4, so semi-autonomous and autonomous vehicles. And sort of the Tesla approach currently, at least, has a lot of collaboration between human and machine. So the human is supposed to actively supervise the operation of the robot. Part of the safety, uh, definition of how safe a- a robot is, in that case, is how effective is the human in monitoring it? Do you think that's ultimately not a good, uh, approach in sort of having a human in the picture, not as a bystander or, um, part of the infrastructure, but really as part of what's required to make the system safe?

    11. VK

      This is harder than it sounds.

    12. LF

      Right.

    13. VK

      I think, you know, if you- if you- if- I mean, I'm sure you've driven the- driven before on highways and so on. It's- it's really very hard to have to relinquish control to a machine and then take over when needed.

    14. LF

      Right.

    15. VK

      So I think Tesla's approach is interesting 'cause it allows you to periodically establish some kind of contact, uh, with the car. Toyota, on the other hand, is thinking about shared autonomy as a-

    16. LF

      Mm-hmm.

    17. VK

      ... or collaborative autonomy as a paradigm. If I may argue, these are very, very simple ways of human-robot collaboration 'cause the task is pretty boring. You sit in a vehicle. You go from point A to point B. I think the more interesting thing to me is, uh, for example, search and rescue. I've got a human-

    18. LF

      Right.

    19. VK

      ... first responder, robot first responders. I gotta do something. It's important. I have to do it in two minutes. The building is burning. There's been an explosion. It's collapsed. How do I do it? I think, to me, those are the interesting things where it's very, very unstructured, and what's the role of the human? What's the role of the robot? Uh, clearly, there's lots of interesting challenges, and as a field, I think we're gonna make a lot of progress in this area.

    20. LF

      Yeah, it's an exciting form of collaboration. You're right. In autonomous driving, the main enemy is just boredom of the human. (laughs)

    21. VK

      Yes.

    22. LF

      As- as opposed to in rescue operations, it's literally life and death, and, uh, the collaboration enables the effective completion of the mission, so it's exciting.

    23. VK

      Well, in some sense, you know, we're also doing this. You- you know, you think about the human driving a car, and almost invariably, the human's trying to estimate the state of the car, estimate the state of the environment, and so on. But what if the car were to estimate the state of the human?

    24. LF

      Mm-hmm.

    25. VK

      So, for example, I'm sure you have a smartphone, and the smartphone, uh, tries to figure out what you're doing and send you reminders.... and oftentimes telling you to drive to a certain place, although you have no intention of going there-

    26. LF

      (laughs)

    27. VK

      ... because it thinks that that's where you should be, uh, 'cause of some Gmail calendar entry, um, e- or, or, or something like that. And, and, you know, it's trying to f- constantly figure out who you are, what you're doing. If a car were to do that, maybe that would make the driver safer because the car is trying to figure out, is the driver paying attention, looking at his or her eyes, um-

    28. LF

      Mm-hmm. All that kind of stuff.

    29. VK

      ... looking at saccadic movements. So, I think the potential is there. But from the reverse side, it's not robot modeling, but it's human modeling.

    30. LF

      It's more on the human, right?

Episode duration: 56:46

Install uListen for AI-powered chat & search across the full episode — Get Full Transcript

Transcript of episode HYsLTNXMl1Q

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome