An initiative to secure the world's software | Project Glasswing

Project Glasswing is a new initiative that brings together Amazon Web Services, Anthropic, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks in an effort to secure the world’s most critical software. We formed Project Glasswing because of capabilities we’ve observed in a new frontier model trained by Anthropic that we believe could reshape cybersecurity. Claude Mythos Preview is a general-purpose, unreleased frontier model that reveals a stark fact: AI models have reached a level of coding capability where they can surpass all but the most skilled humans at finding and exploiting software vulnerabilities. Read more: https://anthropic.com/glasswing

Apr 7, 20265mWatch on YouTube ↗

EVERY SPOKEN WORD

5 min read · 960 words

0:00 – 0:49
Why rare software bugs can become worldwide security crises
1. SPSpeaker
  Most people who use software every day don't think about bugs. They don't think about what can happen if the software that they depend upon suddenly is less secure. That's something that software developers have to deal with every single day.
2. SPSpeaker
  [gentle music] So software has always had flaws and vulnerabilities. That's not new.
3. SPSpeaker
  For an average person, the bugs are by and large not something they notice on a daily basis because if they do, they get fixed.
4. SPSpeaker
  But then every so often there are vulnerabilities that have real severe impacts.
5. SPSpeaker
  Like one single bug that works its way into shared, uh, software that many, many, many different products or websites use, so one issue just gets magnified out around the world.
0:49 – 0:55
The historical bottleneck: slow, expensive vulnerability discovery and patching
1. SPSpeaker
  So historically, finding and patching vulnerabilities has been a slow, time-consuming, and expensive process.
0:55 – 1:23
LLMs raise the stakes for both attackers and defenders
1. SPSpeaker
  If LLMs are now able to write code at the level of some of the greatest software developers in the world, it can also be used to find bugs and exploit that software equally effectively.
2. SPSpeaker
  These models have capabilities which are raising the bar from a cybersecurity point of view with their ability to help defenders as well as potentially help adversaries.
1:23 – 1:54
Introducing Claude Mythos Preview and its unexpected cyber leap
1. SPSpeaker
  We recently developed a new model, Claude Mythos Preview. Early on it was clear to us that this model was gonna be meaningfully better at cybersecurity capabilities.
2. SPSpeaker
  There's a kind of accelerating exponential, but along that exponential there are, there are points of significance. Claude Mythos Preview is a particularly big jump along that point. We haven't trained it specifically to be good at cyber. We trained it to be good at code, but as a side effect of being good at code, it's also good at cyber.
1:54 – 2:37
From finding bugs to chaining exploits: autonomous, long-range attacker-like workflows
1. SPSpeaker
  The model that we're experimenting with is by and large as good as a professional human at identifying bugs. It's good for us because we can find more vulnerabilities sooner, and we can fix them.
2. SPSpeaker
  It has the ability to chain together vulnerabilities. So what this means is you find two vulnerabilities, either of which doesn't really get you very much independently, but this model is able to create exploits out of three, four, sometimes five vulnerabilities that in sequence give you some kind of very sophisticated end outcome. And we think that this model can do this really well because we notice that this model is very autonomous. It's just generally better at pursuing really long-range tasks that are kind of like the tasks that a
2:37 – 2:56
Why the model won’t be widely released—and why planning must start now
1. SPSpeaker
  human security researcher would do throughout the course of an entire day. Obviously, capabilities in a model like this could do harm if in the wrong hands, and so we won't be releasing this model widely.
2. SPSpeaker
  More powerful models are gonna come from us and from others, um, and so we do need a plan to, to, to respond to this.
2:56 – 3:30
Project Glasswing: controlled partnerships to harden critical code
1. SPSpeaker
  That's why we're launching what we're calling Project Glasswing, where we partner with a number of the organizations that power some of the world's most critical code to put the model into their hands to allow them to look at how they can use models like this to bring down risk and protect everyone.
2. SPSpeaker
  And by giving these software developers advanced tools before anyone else, it gives all of us a collective head start.
3. SPSpeaker
  It allows us to find things that we couldn't find before, and it helps us fix these things, uh, much more quickly.
3:30 – 3:36
Early results: vulnerability discoveries across major platforms
1. SPSpeaker
  Working with our partners, we've been finding vulnerabilities across essentially every major platform.
3:36 – 4:27
Concrete examples: decades-old OpenBSD issue and Linux privilege escalations
1. SPSpeaker
  I found more bugs in the last couple of weeks than I found in the rest of my life combined. We've used the model to scan a bunch of open source code, and the thing that we went for first was operating systems, uh, because this is the code that underlies the entire internet infrastructure. For OpenBSD, we found a bug that's been present for twenty-seven years where I can send a couple of pieces of data to any OpenBSD server and crash it. On Linux, we found a number of vulnerabilities where as a user with no permissions, I can elevate myself to the administrator, um, by just running some binary on my machine. For each of these bugs, we, we told the maintainers who actually run the software, um, about them, and they went and fixed them and have deployed the patches so that anyone who runs this software is, is no longer vulnerable to these attacks.
4:27 – 4:40
Empowering maintainers: AI as an invaluable defensive tool
1. SPSpeaker
  For a developer who tirelessly maintains software, a model that can help them discover vulnerabilities in their own code and fix them before they can be exploited, that is an invaluable tool.
4:40 – 5:11
Coordinating with government and reframing cyber as societal security
1. SPSpeaker
  We've spoken to officials across the US government, and we've offered to work with them and, and collaborate to assess the risks of these models and to help defend against the risks of these models. Everything that we do in our lives now depends on software.
2. SPSpeaker
  Software kinda ate the world. Every analog aspect of our life is somehow represented in digital domain.
3. SPSpeaker
  And so all of our daily lives run on the idea that we can rely on the systems that power them.
4. SPSpeaker
  Cybersecurity is the security of our society.
5:11 – 5:48
A long-term, cross-industry effort to make the world’s software safer
1. SPSpeaker
  It is essential that we come together and work together i- across industry to help build better defensive capabilities.
2. SPSpeaker
  No single organization sees the whole picture and can tackle this on their own.
3. SPSpeaker
  This is not gonna be done as part of a few-week program. This is gonna be the work of certainly months, perhaps years. But what I do hope is at the, at the end of this we can be in a position where the world's software, its customer data, its financial transactions, its critical infrastructure are safer than they were before. [gentle music]