Lex Fridman PodcastRegina Barzilay: Deep Learning for Cancer Diagnosis and Treatment | Lex Fridman Podcast #40
EVERY SPOKEN WORD
135 min read · 26,718 words- 0:00 – 15:00
The following is a…
- LFLex Fridman
The following is a conversation with Regina Barzilay. She's a professor at MIT and a world-class researcher in natural language processing and applications of deep learning to chemistry and oncology or the use of deep learning for early diagnosis, prevention, and treatment of cancer. She's also been recognized for teaching of several successful AI-related courses at MIT including the popular Introduction to Machine Learning course. This is the Artificial Intelligence podcast. If you enjoy it, subscribe on YouTube, give it five stars on iTunes, support it on Patreon or simply connect with me on Twitter @lexfridman, spelled F-R-I-D-M-A-N. And now here's my conversation with Regina Barzilay. In an interview you've mentioned that if there's one course you would take, it would be a literature course with a friend of yours, uh, that a friend of yours teaches. Just out of curiosity, 'cause I couldn't find anything on it, are there books or ideas that had profound impact on your life journey, books and ideas perhaps outside of computer science and the technical fields?
- RBRegina Barzilay
I think because I'm spending a lot of my time at MIT and previously in other institutions where I was a student, I have limited ability to interact with people so a lot of what I know about the world actually comes from books, uh, and there were quite a number of books that had profound impact on me and how I view the world. Let me just give you, mm, one example of such a book. I've, um, maybe a year ago read a book called The Emperor of All Maladies. It's a book about, um, it's kind of a history of science book on how the treatments and drugs for cancer were developed, and that book, despite the fact that I am in the business of science, really opened my eyes on how imprecise and imperfect the discovery process is, a- and how imperfect our current solutions, uh, and what makes science succeed and be implemented and sometimes it's actually not the strengths of the idea but devotion of the person who wants to see it implemented. So, this is one of the books that, you know, at least for the last year quite changed the way I'm thinking about scientific process just from the historical perspective and what do I need to do to make my ideas really implemented. Let me give you an example of a book which is not a kind of, uh, which is a fiction book, is a book called Americanah and this is a book about a young female student who comes from Africa to study in the United States and it describes her past, uh, you know, within her studies and, uh, her life transformation that, you know, i- in a new country and kind of adaptation to a new culture.
- LFLex Fridman
Mm-hmm.
- RBRegina Barzilay
And when I read this book, I saw myself in many different points of it, uh, but- but it also (laughs) kind of gave me the- the lens on different events and some events that I never actually paid attention. One of the funny stories in this book is how she, uh, arrives, uh, to- to her c- new college and she starts speaking in English and she had this beautiful British a- accent because that's how she was educated-
- LFLex Fridman
Mm.
- RBRegina Barzilay
... uh, in her country, uh, this is not my case, uh, and then she notices that the person who talks to her, you know, talks to her in a very funny way, in a very slow way, and she's thinking that this woman is disabled, uh, in- in... and she's also trying to kind of to accommodate her and then after a while when she finishes her discussion with this officer from her college, she sees how she interacts with the other students, with American students, and she (laughs) discovers that actually, uh, she talked to her this way because she thought that she doesn't understand English-
- LFLex Fridman
Mm.
- RBRegina Barzilay
... and, uh, I thought, "Wow, this is a funny experience," and (laughs) l- literally within few weeks, I went to- to LA to a conference and I asked somebody in- in the airport, you know, how to find, like, the, a cab or something and then I noticed that this person is talking in a very strange way and my first thought was that this person have some, you know, pronunciation issues or something and I'm trying to talk very slowly to him and I was with another professor, Ernst Fränkel, and he's like laughing (laughs) because it's funny that I don't get that the guy is talking in- in this way because he thinks that I cannot speak. So, it was really kind of mirroring experience and it let me think a lot about my own experiences moving, you know, from different countries. So, I think that books play a big role in my understanding of the world.
- LFLex Fridman
On the- on the science question, you mentioned that it made you discover that personalities of human beings are more important than perhaps ideas. Is that what I heard?
- RBRegina Barzilay
It's not necessarily that they are more important than ideas, but I think that ideas on their own are not sufficient and, um, many times, at least at the local horizon, it's the personalities and their devotion to the ideas is really that locally changes the landscape. Now if- if you're looking at AI, like let's say 30 years ago, you know, dark ages of AI or whatever word is symbolic, times you can use any word, you know there were some people l- now- now we're looking at a lot of that work and we're kind of thinking this was not really, uh, maybe a relevant work, but you can see that some people managed to take it and to make it so shiny and, uh, dominate the, you know, the academic world and make it to be the standard. If you look at the area of natural language processing, uh-... it is well-known fact that the reason the statistics in NLP took such a long time to became- to become mainstream, because there were quite a number of personalities which didn't believe in this idea and then stop research progress in this area. So I do not think that, you know, kind of asymptotically maybe personalities matters, but I think, uh, locally-
- LFLex Fridman
(laughs)
- RBRegina Barzilay
... it does make quite a bit of impact and it's-
- LFLex Fridman
Beautifully put, okay. (laughs)
- RBRegina Barzilay
... generally, uh, you know, speed ups, speeds up the rate of adoption of the new ideas.
- LFLex Fridman
Yeah, and, uh, and the other interesting question is in, uh, the early days of particular discipline, I think you mentioned, uh, i- in- in that book was is ultimately a book of cancer. Oh-
- RBRegina Barzilay
It's called the, uh, The Emperor of All Maladies.
- LFLex Fridman
Yeah, the... yeah. And those maladies included the trying to- the medi- medicine? Was it centered around medicine mostly?
- RBRegina Barzilay
So, so it was actually, uh, centered on, you know, how people thought of curing cancer, like i- like for me, it was really a discovery how people... what was the science of chemistry behind drug development, that it actually grew up out of d- dying, like coloring industry, that people-
- LFLex Fridman
Hm.
- RBRegina Barzilay
... who developed chemistry in 19th century in Germany and Britain to do, you know, the, the really new dyes, uh, they looked at the molecules and identified that they do certain things to cells, and from there the process started and, you know, like historically saying, "Yeah, this is fascinating that they managed to make the connection and-"
- LFLex Fridman
Mm-hmm.
- RBRegina Barzilay
"... look under the microscope and do all this discovery." But as you continue reading about it and you, uh, read about how ch- chemotherapy drugs, which were developed in Boston, and, uh, some of them were developed and, um, um, Farber, Dr. Farber from Dana-Farber, you know, how the experiments were done, uh, that, you know, there was some miscalculation, let's put it this way, and they tried it on the patients and then just, uh, and those were children with leukemia and they died. And then they tried another modification. You look at the process, how imperfect is this process and, you know, like if we're again loo- looking back like 60 years ago, 70 years ago, you can kind of understand it, but some of the stories in this book which were really shocking to me were really happening, you know, maybe decades ago, and we still don't have a vehicle to do it much more fast and effective and, you know, scientific the way I'm thinking computer science scientific.
- LFLex Fridman
So from the perspective of computer science, you've gotten a chance to work the application to cancer and to medicine in general. From a perspective of an engineer and a computer scientist, how far along are we from understanding the human body, biology, of being able to manipulate it in a way we can cure some of the maladies, some of the diseases?
- RBRegina Barzilay
So this is very interesting question and if you're thinking as a computer scientist about this problem, uh, I think one of the reasons that we succeeded in the areas we as a computer scientist succeeded is because we don't have... we are not trying to understand in some ways, like if you're thinking about like e-commerce, Amazon, Amazon doesn't really understand you and that's why it recommends you certain books or certain products, correct? And, uh, in, um, you know, traditionally when people were thinking about marketing, you know, they divided the population to different kind of subgroups, identify the features of this subgroup and come up with a strategy which is specific to that subgroup. Uh, if you're looking about recommendation system, they're not claiming that they're understanding somebody, they're just managing to, from the patterns of your behavior, to recommend you a product. Now, if you look at the traditional biology, and obviously I wouldn't by... say that I, uh, am at any w- way, you know, educated in this field, but, you know, what I see, there is really a lot of emphasis on mechanistic understanding, and it was very surprising to me coming from computer science how much emphasis is on this understanding. And e- given the complexity of the system, maybe the deterministic full understanding of these processes is, you know, beyond our capacity, and the same with the computer science when we do recognition, when doing recommendation in many other areas, i- it's just probabilistic matching process and in some way maybe, uh, in certain cases we shouldn't even attempt to understand or we can attempt to understand but in parallel we can actually do this kind of matchings that would, uh, help us to find cure or to do early diagnostics and so on.
- LFLex Fridman
Hm.
- RBRegina Barzilay
And I know that in these communities it's really important to understand, but I'm sometimes wondering (laughs) what exactly does it mean to understand here?
- LFLex Fridman
Well, there's stuff that works and, uh, but that can be, like you said, separate from this deep human desire to uncover the mysteries of the universe, of, uh, of science, of the way the body works, the way the mind works. It's the dream of symbolic AI, of being able to reduce human knowledge into, into logic and be able to play with that logic in a way that's very explainable and u- understandable for us humans. I mean, that's a beautiful dream. So I, I understand it, but it seems that what seems to work today, and we'll talk about it more, is as much as possible reduce stuff into data, reduce whatever problem you're interested in to data and try to apply (laughs) statistical methods, apply machine learning to that. On, on a personal note, you were diagnosed with breast cancer in 2014. What did facing your mortality make you think about? How did it change you?
- RBRegina Barzilay
You know, this is a great question and, uh, the thing that I was interviewed many times and nobody actually asked me this question. I think I was-... 43 at a time, and it's the first time I realized in my life that I may die. Uh, and I never thought about it before. And, yeah, and there is a long time since you're diagnosed until you actually know what you have and how severe is your disease. For me, it was like maybe two and a half months. And I didn't know where I am, uh, during this time because I was getting different tests and one would say it's bad and I would say, "No, it is not." So until I knew where I am, I really was thinking about all the different possible outcomes.
- LFLex Fridman
Were you imagining the worst or were you trying to be optimistic, or what-
- RBRegina Barzilay
It would be the really, uh, uh, I, I don't remember, you know, what was my thinking. It was really a mixture with many components at the time.
- 15:00 – 30:00
So though that moment...…
- RBRegina Barzilay
you, you start looking at this, you know, what do you perceive as important at different point of time? And how, you know, it's ca- fades over time. And, uh, since we have limited time, all of us have limited time on Earth, it's really important to prioritize things that really matter to you, maybe matter to you at that particular point. But it's important to take some time and understand what matters to you, which may not necessarily be the same as what matters to the rest of your scientific community and pursue that vision.
- LFLex Fridman
So though that moment... D- Did it make you cognizant, you mentioned suffering, of just the general amount of suffering in the world? Is that what you're referring to? So as opposed to topic models and specific detail, problems in NLP, did, did you start to think about other people who have been diagnosed with cancer?
- RBRegina Barzilay
Well, I-
- LFLex Fridman
Is that the way you saw the w- started to see the world, perhaps?
- RBRegina Barzilay
Oh, absolutely, and it actually (laughs) creates because, uh, like, uh, for instance, you know, there is parts of the treatment where you need to go to the hospital every day and you see, you know, the community of peoples that you see and many of them are much worse than I, I was at the time. And you all of a sudden see it all. And people who are happier someday just because they feel better and for people who are in our normal realm, you take it totally for granted that you feel well, that if you decide to go running, you can go running, and, uh, you can... You know, you're pretty much free to do whatever you want with your body. Like, I, I saw like a community, my community became those people. And, um, I, I remember one of my friends, Dina Katabi took me to Prudential to buy me a gift for my birthday, and it was like the first time in months that I went to kind of to see other people and I was like, "Wow, first of all, these people are, you know, they're happy and they're laughing and they're very different from these other my people." And secondly thinking, "Aren't they totally crazy? They're like laughing and wasting their money on, on some stupid, uh, gifts." And, you know, they may die. They already may have cancer.
- LFLex Fridman
Yeah.
- RBRegina Barzilay
And, and they don't understand it. So you can really see how, (laughs) how the mind changes, that you can see that... You know, before that you can ask, "Didn't you know that you're gonna die?" Of course I knew, but it was (laughs) kind of a theoretical notion, it wasn't something which was concrete. And at that point when you really see it and see how little means sometimes the system has to help them, you really feel that we need to take a lot of our brilliance that we have here at MIT and translate it into something useful.
- LFLex Fridman
Yeah. And useful can have a lot of definitions, but of course alleviating suffering, alleviating trying to cure cancer is a beautiful mission. So I of course know the theo- theoretically the notion of cancer-
- RBRegina Barzilay
Mm-hmm.
- LFLex Fridman
... but just reading more and more about it, uh, 1.7 million new cancer cases in the United States every year, 600,000 cancer-related deaths every year.... so this has a huge impact, United States, globally. W- when, uh, broadly, before we talk about how machine learning, how MIT can help, when do you think we, as a civilization, will cure cancer? How hard of a problem is it from everything you've learned from it recently?
- RBRegina Barzilay
I cannot really assess it. What I do believe will happen with the advancement in machine learning, is that a lot of, uh, types of cancer we will be able to predict way early, and more effectively utilize existing treatments. Uh, I think, I hope at least, that with all the advancements in AI and drug discovery, we would be able to much faster find relevant molecules. What I'm not sure about is how long it will take the medical establishment and regulatory bodies to kind of catch up and to implement it. And I think this is a very big piece of puzzle that is currently not addressed.
- LFLex Fridman
That's the really interesting question. So first, a small detail that I think the answer is yes, but is cancer one of, uh, o- one of the diseases that when detected earlier, that's a significantly improves the outcomes?
- RBRegina Barzilay
(sighs)
- LFLex Fridman
It... so like... 'cause we'll talk about there's the cure, and then there is detection, and I think where machine learning can really help is earlier detection. So does detection help with-
- RBRegina Barzilay
Detection is crucial. For instance, the vast majority of pancreatic cancer patients are detected at the stage that they are incurable. That's why they have such a, you know, terrible survival rate. I- it's like just few percent over five years. Uh, is pretty much today their sentence. But if you can discover this disease early, there are mechanisms to treat it. And in fact, I know c- a number of people who were diagnosed and saved just because they had food poisoning. They had terrible food poisoning, they went, uh, to ER, they got scan-
- LFLex Fridman
Mm-hmm.
- RBRegina Barzilay
... there were early signs on the scan and that what saved their lives. But this wasn't really an accidental case. So as we become better, uh, we would be able to help to many more people, uh, that have, uh, you know, that are likely to develop diseases. And I just want to say that as I got more into this field, I realized that, you know, cancer is of course a terrible disease, but there are (laughs) really the whole slew of terrible diseases out there, uh, like neurodegenerative diseases and others. Uh, so we of course, a lot of us are fixated on cancer just because it's so prevalent in our society and you see these people, but there are a lot of patients with neurodegenerative diseases-
- LFLex Fridman
Mm-hmm.
- RBRegina Barzilay
... and the kind of aging diseases that we still don't have a good solution for. And, uh, we, y- you know, and I felt as a computer scientist, we kind of decided that it's other people's job to treat these diseases, uh, because it's like traditionally people in biology or in chemistry or MDs are the ones who are thinking about it. A- and after kind of start paying attention, I think that it's really, uh, wrong assumption and we all need (laughs) to join the battle.
- LFLex Fridman
So how... it seems like in cancer specifically, that there's a lot of ways that machine learning can help. So what's, what's the role of machine learning in the diagnosis of cancer?
- RBRegina Barzilay
So for many cancers today, we really don't know what is your likelihood to get cancer and for the vast majority of patients, especially on the younger patients, it really comes as a surprise. Like for instance, for breast cancer, 80% of the patients are first in their families, it's like me-
- LFLex Fridman
Hmm.
- RBRegina Barzilay
... and I never saw that I had any increased risk because, you know, nobody had it in my family and for some reason in my head it was kind of a inherited disease, but even if I would pay attention the, the models that currently, it's very simplistic statistical models that are currently used and in clinical practice they really don't give you an answer so you don't know. And, uh, the same true for pancreatic cancer, the same true for, uh, non-smoking lung cancer and many others. Uh, so what machine learning can do here is utilize all this data-
- LFLex Fridman
Mm-hmm.
- RBRegina Barzilay
... to tell us early, uh, who is likely to be susceptible and d- using all the information that is already there, be it imaging, uh, be it your other tests, and you know eventually, uh, liquid biopsies and others where the signal itself is not sufficiently strong for human eye to do good discrimination because the signal may be weak but by combining many sources, a machine which is trained on large volumes of data can really, uh, detect it early and that's what we've seen with breast cancer and people are reporting it in other diseases as well.
- LFLex Fridman
That really boils down to data, right? And, and the different kinds of sources of data and you mentioned regulatory challenges, so w- what are the challenges in gathering large datasets in this space?
- RBRegina Barzilay
Again, another great question. So it took me after I decided that I want to work on it two years to get access to data. Uh-
- LFLex Fridman
Any data? Like, any significant dataset?
- RBRegina Barzilay
Any significant amount, like right now in this country, there is no publicly available dataset of modern mammograms that you can just go on your computer, sign a document and, and get it. It just doesn't exist. I, I mean obviously every hospital has its own collection of mammograms, there are data that come out... that came out of clinical trials but we're talking about you as a computer scientist who just want to run-... his or her model and see how it works. This data, like ImageNet, doesn't exist, and, uh, the im- there is an e- uh, set which is called like Florida Dataset, which is a film mammogram from '90s, which is totally not representative of the current developments. Whatever you're learning on them doesn't scale up. This is the only resource that is available, and, uh, today there are many agencies that govern access to data. Like, the hospital holds your data and the hospital decides whether they would give it to the researcher to work with this data or not.
- LFLex Fridman
An individual hospital? So like a-
- 30:00 – 45:00
(laughs) …
- RBRegina Barzilay
that comes by mail (laughs) to my office at MIT.
- LFLex Fridman
(laughs)
- RBRegina Barzilay
I'm serious.
- LFLex Fridman
That's amazing.
- RBRegina Barzilay
Uh, that, uh, people ask to run because they need to make, you know, life changing decisions, and, uh, of course, you know, I'm not planning to open a clinic here but, uh, w- we do run and give them the results for their doctors. But-... the point that I'm trying to make, that we all, at some point, or our loved ones, will be in this situation where you need information to make the best choice, and if this information is not available, you would feel vulnerable and unprotected. And then the question is, you know, what do I care more? Because in the end everything's a trade-off, correct?
- LFLex Fridman
Yeah, exactly. Just, uh, out of curiosity, what... it seems like one possible solution, I'd like to see what you think of it, based on what you just said, based on wanting to know answers for when, when you're yourself in that situation. Is it possible for patients to own their data, as opposed to the hospitals owning their data?
- RBRegina Barzilay
(sighs)
- LFLex Fridman
Of course, theoretically, I guess patients own their data, but can you walk out there with a USB stick containing everything, or upload it to the cloud, where a company... You know, I- I remember Microsoft had a service, th- like I try, I was really excited about, and Google Health was there. I tried to give... I- I was excited about it. Basically, companies helping you upload your data to the cloud so that you can move from hospital to hospital, from doctor to doctor. Do you see a promise of that kind of possibility?
- RBRegina Barzilay
I absolutely think this is, you know, the right way to, to exchange the data. I don't know now who's the biggest player in this field, but I can clearly see that even, you know, for ... even for totally selfish health reasons, when you are going to a new facility, and many of us are sent to some specialized treatment, they don't easily have access to your data. And, uh, today, you know, (laughs) women who want to send a mammogram need to go the hospital, find some small office which gives them the CD and they ship as a CD. So you can imagine we're looking at the kind of decades-old mechanism of data exchange. So I definitely think this is an area where hopefully all the right regulatory and technical forces will align, and we will see it actually implemented.
- LFLex Fridman
It's sad because, um, unfortunately, and I have a need to research why that happened, but I'm pretty sure Google Health and Microsoft HealthVault, or whatever it's called, both closed down. Which means that there was either regulatory pressure, or there's not a business case, or there was challenges from hospitals, which is very disappointing. So when you say you don't know what the biggest players are, uh, the two biggest that I was aware of (laughs) closed their doors. So I'm hoping, uh... I'd love to see why and I'd love to see who else can come up. And it seems like a- one of those, uh, Elon Musk-style problems, that are obvious needs to be solved, and somebody needs to step up and actually do this large-scale data cl- dim- you know, data collection.
- RBRegina Barzilay
So I know there is an initiative in Massachusetts, I think it's led by the governor, to try to create this kind of health exchange system, where at least to help people who are kind of ... when you show up in the emergency room and there is no information about what are your allergies and other things. Uh, so I don't know how far it will go, but a- another thing that you said, and I find it very interesting, is actually who are the successful players in this space? And the whole implementation, how does it go? To me, it is, from the anthropological perspective, it's more fascinating that AI that today goes in healthcare. You know, we've seen so many, you know attempts, and so very little (laughs) successes. And it's interesting to understand, and I by no means, uh, you know, have knowledge to assess it, uh, why we are in the position where we are.
- LFLex Fridman
Yeah, it's interesting 'cause, um, data is really fuel for a lot of successful applications, and when that data requires regulatory approval, like the FDA or any- any kind of approval, it's, uh, it seems that the computer scientists are not quite there yet in being able to play the regulatory game, understanding the fundamentals of it.
- RBRegina Barzilay
I think that in many cases when even people do have data, we still don't know what exactly do you need to demonstrate to- to change the standard of care?
- LFLex Fridman
Ah.
- RBRegina Barzilay
Uh, like let me give you example related to my breast cancer research. So traditional- in traditional breast cancer risk assessment, uh, there is something called density which determines the likelihood of a woman to get cancer, and this pretty much says how much white do you see on the mammogram. The whiter it is, the more likely the tissue is dense. And, um, the idea behind density, it's not a bad idea. In 1967, a radiologist called Wolf decided to look back at women who were diagnosed and, uh, see what is special in their images. Can we look back and say that they're likely to develop? So he come up with some patterns, and it was the best that his human eye can, uh, you know, can identify, then it was kind of formalized and coded into four categories, and that what we are using today. And, uh, uh, today this density assessment is actually federal law from 2019, the- uh, approved by President Trump and for the previous FDA commissioner, where women are supposed to be advised by their providers if they have high density, putting them into higher-risk category, and, um, in some states you can actually get supplementary screening paid by your insurance because you're in this category. Now, you can say, "How much science do we have behind it?" Whatever, biological science or epidemiological evidence. So it turns out that between 40 and 50% of women have dense breasts. So about 40% (laughs) of patients are coming out of their screening and somebody tells them, "You are in high risk."
- LFLex Fridman
Mm-hmm. High risk.
- RBRegina Barzilay
Now, what exactly does it mean if you, as half of the population, high risk? ??? A- ah, maybe I'm not. You know, or what do I really need to do with it? Because-... the system doesn't provide me a lot of the solutions because there are so many people like me, we cannot really provide very expensive solutions for them. And the reason this whole density became this big deal, it's actually advocated by the patients who felt very unprotected because many women went, did the mammograms which were normal, and then it turns out that they already had cancer-
- LFLex Fridman
Mm-hmm.
- RBRegina Barzilay
... quite developed cancer. So they didn't have a way to know who is really at risk, and what is the likelihood that when the doctor tells you you're okay, you are not okay. Well, at the time, and it was, you know, 15 years ago, this maybe was the best piece of science that, uh, we had, and, uh, it took (laughs) you know, quite, uh, 15, 16 years to make it federal law. But now that this is, this is a standard, now with the deep learning model, we can so much more accurately predict who is going to develop breast cancer just because you are trained on a logical thing, and instead of describing how much white and what kind of white, machine can systematically identify the patterns, which was the original idea behind the sort of the radiologist machine is can do it much more systematically and predict the risk when you train the machine to look at the image and to say the risk in one to five years. Now, you can ask me how long it will take-
- LFLex Fridman
(laughs)
- RBRegina Barzilay
... to substitute this density which is broadly used across the country and really is not helping t- to bring these new models. And I would say it's not a matter of the algorithm. Algorithm's already orders of magnitude better than what is currently in practice. I think it's really the question, who do you need to convince? How many hospitals do you need to run the experiment? What... You know, all this mechanism of adoption, and how do you explain to patients and to women across the country that this is really a better measure? And again, I don't think it's an AI question. We can work more and make the algorithm even better, but I don't think that this is a current, uh, you know, the barrier. The barrier is really this other piece that for some reason is not really explored. It's like anthropological piece. And, uh, coming back to your question about books, there is a book that I'm reading, uh, it's called American Sickness by Elizabeth Rosenthal, and I got this book from my clinical collaborator, Dr. Connie Lehman, and I thought I know everything that I need to know about American health system, but you know, every page doesn't fail to surprise me. And I think that there is a lot of interesting and really deep lessons for people like us from computer science who are coming into this field, to really understand how complex is the system of incentives-
- LFLex Fridman
Mm-hmm.
- RBRegina Barzilay
... in, in the system to understand how you really need to play to drive adoption.
- LFLex Fridman
You just said it's complex, but i- if we're trying to simplify it, who do you think most likely would be successful if we push on this group of people? Is it the doctors? Is it the hospitals? Is it the governments or policymakers? Is it the individual patients, consumers? Who needs to be inspired to most likely lead to, uh, adoption? Or is there no simple answer? (laughs)
- RBRegina Barzilay
There's no simple answer, but I think there is a lot of good people in medical system who do want, uh, you know, to make a change, and I think a lot of power will come from us as the consumers because we all are consumers or future consumers of healthcare services, and I think we can do so much more in explaining the potential and noting the hype terms and not saying that we now cured all Alzheimer's, and, you know, I'm really sick of reading these kind of articles which make these claims. But really to show with some examples what this implementation does and how it changes the care, because I can't imagine, doesn't matter what kind of politician it is, you know, we all are susceptible to these diseases. There is no one who is free. Uh, and eventually, you know, we all are humans and we are looking for way to alleviate the suffering, and, and this is one possible way where we currently are under-utilizing which I think can help.
- LFLex Fridman
So, it sounds like the biggest problems are outside of AI in terms of the biggest impact at this point, but, uh, are there any open problems in the application of ML to oncology in general? So improving the detection or any other creative methods, whether it's on the detection, segmentations of the vision perception side, or some other clever, uh, inference? Uh, yeah, what in, what in general in your, in your view are the open problems in this space?
- RBRegina Barzilay
Yeah, I just want to mention that beside detection, another area where I am kind of quite active and I think it's really an increasingly important area in healthcare is drug design.
- LFLex Fridman
Absolutely.
- RBRegina Barzilay
Uh, because, uh, you know, uh, eh, it's fine if you detect something early but you still need to get, uh, you know, to, to get drugs and, uh, new drugs for these conditions, and today all of the drug design, ML is nonexistent there. We don't have any drug that was developed by the ML model, or even not developed but at least they knew that ML model plays some significant role. Uh, I think, uh, this area with all the new ability to generate molecules with desired properties to do in silico screening, uh, is really a big open area. To be totally honest with you, you know, when we are doing diagnostics and imaging, primarily taking the ideas that were developed for other areas and you're applying them with some adaptation, the area of, you know-Drug design is really technically interesting and exciting area. You need to work a lot with graphs and, uh, uh, capture various 3D properties. There are lots and lots of, uh, opportunities to be technically creative, and, um, I think there are a lot of open questions in this area. W- w- you know, we're already getting a lot of successes even, you know, with the kind of the first generation of these models, but there is much more new creative things that you can do. And what's very nice to see is that actually the, you know, the- the more powerful, the more interesting models actually do do better, so there is a place to- to innovate in machine learning in this area. Uh, and some of these techniques are really unique to, let's say, to, you know, graph generation and other things. So, uh...
- LFLex Fridman
Wha- what, uh... Just to interrupt really quick, I'm sorry. Uh, graph generation or graphs, is drug discovery in general, what's, uh, how do you discover a drug? Is this chemistry? Is this trying to predict different chemical reactions or is it, uh, some kind of... W- what do graphs even represent in this space? (laughs)
- 45:00 – 1:00:00
So, so there is...…
- LFLex Fridman
real labs with lab coats and stuff?
- RBRegina Barzilay
So, so there is... So when you do high-throughput screening, you really do screening. It's in the lab, but it's- it's really the lab screening. You screen the molecules, correct? You don't really-
- LFLex Fridman
I don't know what screening is. (laughs) So what-
- RBRegina Barzilay
Uh, the screening is just check them for certain property, usually.
- LFLex Fridman
Like, in the physical space?
- RBRegina Barzilay
The-
- LFLex Fridman
In the physical world?
- RBRegina Barzilay
Yeah.
- LFLex Fridman
Like, actually there's a machine probably that is doing some... That- that-
- RBRegina Barzilay
Yeah.
- LFLex Fridman
... that's actually running the reaction?
- RBRegina Barzilay
That's actually running the reactions, yeah.
- LFLex Fridman
Yeah. Wow.
- RBRegina Barzilay
So, so, so there is a process where you can run, and that's why it's called high-throughput, that, you know, it become cheaper, uh, and faster to do it on very big number of molecules. You run the screening, you identify potential, um, you know, potential good starts, and then where the chemists come in who, you know, have done it many times and then they can try to look at it and say, "How can it change the molecule to get the desired, uh, profile in terms of all other properties?" So maybe how do we make it more bioactive and so on. And there, you know, the creativity of the chemist really, um, is one that determines the success of this design, uh, th- because they, again, they have a lot of domain knowledge of, you know, what works, how do you decrease toxicity and so on. And that's what they do. So all the drugs that are currently, you know, in the, uh, F- FDA approved drugs or even drugs that are in clinical trials, they are designed using these, uh, domain experts which-
- LFLex Fridman
Yeah.
- RBRegina Barzilay
... uh, goes through this combinatorial space of molecules on graphs or whatever, and find the right one or adjust it to be the right ones.
- LFLex Fridman
Sounds like the- the- the breast density heuristic from '67.
- RBRegina Barzilay
It- it's not-
- LFLex Fridman
The same echoes.
- RBRegina Barzilay
It's not necessarily that. It's really, you know, the- the- it's really driven by deep understa... It's not like they just observe it. I mean, they do deeply understand chemistry and they do understand-
- LFLex Fridman
Mm-hmm.
- RBRegina Barzilay
... how different groups and- and how does it changes the properties. So there is a lot of-
- LFLex Fridman
Got it.
- RBRegina Barzilay
... science that gets into it and a lot of, kind of simulation, how do you want it to behave. Uh, uh, it- it's very, very complex.
- LFLex Fridman
So they're quite effective at this design, obviously.
- RBRegina Barzilay
The... Now, effective, yeah, we have drugs, like, uh, depending on how do you measure effective. If you measure, uh, it's in terms of cost, it's prohibitive. If you measure it in terms of times, you know, we have lots of diseases for which we don't have any drugs and we don't even know how to approach and don't need to mention few drugs on neurode- degenerative disease drugs that fail, you know. So- so there are lots of, you know, trials that fail, uh, you know, in later stages which is really catastrophic from the financial perspective. So, uh, you know, is it- is it the effective, the most effective mechanism? Absolutely no, but this is the only one that currently works. And I would, you know, I was as, uh, uh, closely interacting with people in pharmaceutical industry. I was really fascinating on how sharp and- and what a deep understanding of the domain do they have. It's not observation driven, it's r- there is really a lot of science behind what they do. But if you ask me can machine learning change it, I firmly believe yes, uh, because even the most experienced chemist cannot, you know, hold in their memory and understanding everything that you can learn, you know, from millions of molecules and reactions. And-
- LFLex Fridman
And this space of graphs is a totally new space. I mean, it's a- it's a really interesting space for machine learning to explore, graph generation.
- RBRegina Barzilay
Yeah. So- so there are a lot of things that you can do here. So we do a lot of work, so the first tool that we started with was the tool that can predict...... properties of the molecules, so you can just give the molecule, molecule and the property. It can be bio-activity property, or it can be some other property, and you train the molecules and, um, you can now take a new molecule and predict, uh, this property. Now, when people started working in this area, they did something very simple. They did kind of existing, you know, fingerprints, which is kind of handcrafted features of the molecule, when you break the graph to substructures and then you run it in a feed-forward neural network. And what was interesting to see that clearly, you know, this was not the most (laughs) effective way to proceed, and you need to have much more complex models that can induce a representation which can translate this graph into the embeddings and, and do these predictions. So, this is one, uh, direction. Then another direction, which is kind of related, is, uh, not only to stop by looking at the embedding itself, but to actually modify it to produce better molecules. So, you can think about it as machine translation-
- LFLex Fridman
Mm-hmm. (laughs)
- RBRegina Barzilay
... that you can start with a molecule and then there is an improved version of molecule, and you can again, t- with encoder translate it into the hidden space, and then learn how to modify, to improve the in some ways version of the molecules. So, that's i- it's kind of really exciting. We already have seen that the property prediction works pretty well, and now we are, uh, generating molecules and there is actually labs which are manufacturing these molecules. So, we'll see where it will get us.
- 1:00:00 – 1:15:00
Uh-huh. …
- RBRegina Barzilay
them the hint that machine understands you and you can complete the rest, that he kind of stopped this research and went into kind of trying to understand what this artificial intelligence can do to our brains. M- uh, so my point is, you know, how much... I- i- i- it's not how good is the technology, it's how ready we are to believe (laughs) that it delivers-
- LFLex Fridman
Uh-huh.
- RBRegina Barzilay
... the goods that we are trying to get.
- LFLex Fridman
Uh, that's a really beautiful way to put it. I- I, by the way, am not horrified by that possibility but inspired by it because...I mean, um, human connection, whether it's through language or through love-
- RBRegina Barzilay
(laughs)
- LFLex Fridman
... it, uh, it seems like it's, uh, very amenable to machine learning and the rest is just, um, challenges of psychology. Like you said, the secretaries who enjoy spending hours... I would say, I would describe most of our lives as enjoying spending hours with those we love for very silly reasons. All we're doing is keyword matching as well. So I- I'm not sure how much intelligence we exhibit to each other in- w- with the people we love that are- we're close with. So it's a very interesting point of what it means to pass the Turing test with language. I think you're right, in terms of conversation. I think machine translation is a- has a very clear performance and improvement, right? What it means to have a fulfilling conversation is very, very person dependent and context dependent and- and so on. That's, uh, yeah, that's very well put. So but i- in your view, what's a benchmark in natural language, a test that's just out of reach right now, but we might be able to, that's exciting? Is it in machi- is it perfecting machine translation or is there other ... is it summarization? What's- what's out there just out of reach?
- RBRegina Barzilay
I think it goes across specific application. It's more about the ability to learn from few examples for real, what we call few-shot learning and all these cases because, you know, the way we publish these papers today, we say if we have like naively we get, uh, 55, but now we had a few example and we can move to 65. None of these methods actually realistically doing anything useful. You cannot use them today.
- LFLex Fridman
Mm-hmm.
- RBRegina Barzilay
And, uh, their ability to be able to generalize and to move, uh, or to be a- autonomous in finding the data that you need to learn, uh, to be able to perfect new task or new language. Uh, this is an area where I think we really need to- to move forward to, and we are not yet there.
- LFLex Fridman
Are you at all excited, curious, by the possibility of creating human level intelligence? Uh, is this- 'cause you've been very, in your discussion, so if we look at oncology, you're trying to, uh, use machine learning to help the world in terms of alleviating suffering. If you look na- nat- natural English processing, you focus on the outcomes of improving practical things like machine translation. But, you know, human level intelligence is a thing that uh, c- our civilization has dreamed about creating, uh, um, superhuman level intelligence. Do you think about this? Do you think it's at all within our reach?
- RBRegina Barzilay
As you said yourself earlier, uh, talking about, you know, how do you perceive, uh, you know, our communications with each other, that you know, we're matching keywords and certain behaviors and so on. So and then whenever, uh, one assesses, let's say, relations with another person, you have k- separate kind of measurements and outcomes inside your head-
- LFLex Fridman
(laughs)
- RBRegina Barzilay
... that determine, you know-
- LFLex Fridman
Yeah.
- RBRegina Barzilay
... what is the status of the relation. And so one way, this is a classical dilemma, what is the intelligence? Is it the fact that now we are gonna do the same way as human is doing when we don't even understand what the human is doing? Or we now have an ability to deliver these outcomes, but not in one area, not in NLP, and not just to translate or just to answer questions, but across many, many areas that we can achieve the functionalities that humans can achieve with the ability to learn and do other things. I think this is ... a- and this we can actually measure how far we are, and, uh, that's what makes me excited that we, you know, in my lifetime at least so far what we've seen is like tremendous progress across these different functionalities. And, uh, I think it will be really exciting to see where we will be. And again, one way to think about it is there are machines which are improving their functionality. Another one is to think about us with our brains, which are imperfect, how they can be accelerated by this technology w- as it becomes stronger and stronger. Uh, coming back to another book that I love, Flowers for Algernon.
- LFLex Fridman
Mm-hmm.
- RBRegina Barzilay
Ha- have you read this book?
- LFLex Fridman
Yes.
- RBRegina Barzilay
Uh, so there is this point that it- then- the patient gets this miracle cure which changes his brain and all of a sudden, they see life in a different way and can do certain things better, but certain things much worse. So you can imagine this, uh, kind of computer-augmented cognition, where it can bring you is that now the same way as, you know, the cars enable us to get to places where we've never been before, can we think differently? Can we think faster? So ... a- and we already see a lot of it happening, uh, in how it impacts us. But, uh, I think we have a long way to go there.
- LFLex Fridman
So that's sort of, uh, artificial intelligence and technology affecting our- augmenting our intelligence as humans. Yesterday, uh, a company called Neuralink, uh, announced ... they did this whole demonstration, I don't know if you saw it, it's, uh, they demonstrated brain-computer, brain-machine interface, where there's like a- a sewing machine for the brain. Do you, uh ... you know, a lot of that is, uh, quite out there, in terms of things that some people th- would say are impossible, but they're dreamers and want to engineer systems like that. Do you see, based on what you just said, a hope for that more direct interaction with the brain?
- RBRegina Barzilay
Uh, I think there are different ways. One is a direct interaction with the brain, and again, there are lots of companies that work in this space and I think there will be a lot of developments.... when I'm just thinking that many times we are not aware of our feelings of motivation, what drives us.
- LFLex Fridman
Hm.
- RBRegina Barzilay
Like let me give you a trivial example, our attention. The- there are a lot of studies that demonstrate that it takes a while to a person to understand that they are not attentive anymore.
- LFLex Fridman
Mm-hmm.
- RBRegina Barzilay
And we know that there are people who really have strong capacity to hold attention, they're on other end of the spectrum, people with ADD and other issues and they have problem to regulate their attention. Imagine to yourself that you have like a cognitive aid that just alerts you based on your gaze-
- LFLex Fridman
Mm-hmm.
- RBRegina Barzilay
... that your attention is now not on what you are doing, and instead of writing a paper, you are now dreaming of what you're going to do in the evening. So, even this kind of simple measurement things, how they can change us. And I see it even in simple ways with myself. I have my Zone app from, that I got at MIT gym. It kind of records, you know, how much did you run and you have some points and you can get, uh, some, uh, status, whatever. (laughs) Like I, (laughs) I said, "What is this ridiculous thing? Who would ever care about some status in some app?" Guess what? So to c- to ma- con- maintain the status, you have to do certain number of points every month. And (laughs) not only is it that I do it every single month for the last, uh, 18 months, it went to the point that I was running, uh, that I was injured, and when I could run again, I, (laughs) in two days, I did like some humongous amount of running-
- LFLex Fridman
You tried to catch up?
- RBRegina Barzilay
... just to complete the points.
- LFLex Fridman
Oh, no.
- 1:15:00 – 1:17:13
Mm-hmm. …
- RBRegina Barzilay
uh, that we don't listen to that part. And you know, you- we all should find some time to understand what is our own individual missions and we may have very different missions and to make sure that while we are running 10,000 things, we are not, um, you know, missing out and we're putting all the resources to- to satisfy our own mission. And if I look over my time, uh, when I was younger, most of these missions, uh, you know, I- I was primarily driven by the external stimulus, you know-
- LFLex Fridman
Mm-hmm.
- RBRegina Barzilay
... to- to achieve this or to be that and now a lot of what I do is driven by really thinking what is important for me to- to achieve independently of the external recognition. Uh, and you know, I- I- I don't mind to be viewed in certain ways. Uh, eh, t- the most important thing for me is to be true to myself, to what I think is right.
- LFLex Fridman
How long did it take? How hard was it to find the you that you have to be true to?
- RBRegina Barzilay
So it takes time and even now sometimes, you know, the vanity and the triviality can take-
- LFLex Fridman
Of course.
- RBRegina Barzilay
... uh, you know-
- LFLex Fridman
At MIT. (laughs) No, that's-
- RBRegina Barzilay
Um, yeah, it- it can everywhere, you know, it's just the vanity at MIT is different, the vanity in different places-
- LFLex Fridman
Yeah.
- RBRegina Barzilay
... but we all have our piece of vanity. But I think actually for me the- m- many times the place to- to get back to it, is, um, you know, when I- when I'm alone and also when I read. And I think by selecting the right books, you can get the right questions and learn from what you read, so. B- bu- but again, it's, uh, not perfect like-
- LFLex Fridman
(laughing) Nothing is.
- RBRegina Barzilay
... uh, vanity sometimes dominates.
- LFLex Fridman
Well, that's a beautiful way to end. Thank you so much for talking today.
- RBRegina Barzilay
Thank you.
- LFLex Fridman
It was fun. That was fun.
- RBRegina Barzilay
Oh, it was fun.
Episode duration: 1:17:28
Install uListen for AI-powered chat & search across the full episode — Get Full Transcript
Transcript of episode x0-zGdlpTeg
Get more out of YouTube videos.
High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.
Add to Chrome