Anthropic Hands Defenders a Lifeline with Project Glasswing
A new frontier model finds zero-day vulnerabilities better than almost any human. Anthropic handed the keys to defenders first.
Anthropic just built an AI model that can find software vulnerabilities better than almost any human on Earth. Instead of shipping it like a product, they handed defenders the keys first.
That sentence should stop you for a second. Not because of what the model can do, but because of what Anthropic chose to do with it.
Project Glasswing, announced this week, isn’t really about a model. It’s about a release decision. And in 2026, a release decision like this one is the exception, not the rule.
What Glasswing actually is
The model is called Claude Mythos Preview. Anthropic says it can autonomously hunt down zero-day vulnerabilities in operating systems, web browsers, and other critical software.
What Mythos has already found is the part that got my attention:
- A vulnerability in OpenBSD that had been sitting undetected for 27 years
- A 16-year-old flaw in FFmpeg that every automated security tool in the industry had walked right past
- Multiple Linux kernel bugs of the kind that let an attacker break out of a container and take over the host
On CyberGym, the industry’s vulnerability-reproduction benchmark, Mythos scored 83.1% against the previous leader’s 66.6%. That’s not incremental. That’s a different category of tool.
Instead of selling access to the highest bidder or releasing broadly and hoping for the best, Anthropic stood up a coalition of launch partners: AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorgan Chase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks. More than forty additional organizations are getting access on top of that.
The financial commitment includes $100 million in Mythos credits for defenders, $2.5 million routed through the Linux Foundation to the Alpha-Omega and OpenSSF security projects, and $1.5 million to the Apache Software Foundation, the volunteer-maintained backbone of nearly every system you use, including the one you’re reading this on.
The defenders get to move first. Attackers have to catch up.
Why the release method matters more than the model
Most coverage of new AI models only covers half the story. The capability is one half. The how of the release (who gets access, in what order, with what guardrails) is the other. In dual-use territory, it’s arguably the half that matters more.
Think about what the default playbook looks like in the AI industry right now. Ship fast. Let the market sort it out. Patch what breaks. Apologize when it’s bad enough to make the news. Repeat.
That works fine when the worst case is a chatbot saying something embarrassing. It doesn’t work when the model can surface a 27-year-old vulnerability that survived decades of human review.
This is exactly where the lack of meaningful AI regulation hurts innocent actors. When responsible release is left entirely to the goodwill of individual companies, everyone downstream takes the risk. The small business owner whose point-of-sale system runs on software a volunteer keeps patched on weekends. The open-source maintainer who suddenly has to triage a flood of AI-generated vulnerability reports. The end user who has no idea any of this is happening until their data shows up on a breach site. None of them chose to be part of this experiment. They just get the fallout.
In an ideal world, there would be a model-safety body with real teeth, one that could require red-team results, enforce disclosure windows, and gate the release of dual-use capabilities behind something more structured than a company’s internal values. Realistically, capitalism will push that solution out of reach. The incentives aren’t there. The political will isn’t there. Every lab knows it.
Which is what makes Glasswing interesting. Anthropic chose to behave as if that structure already existed.
It’s also the kind of problem you only get to have when you’re ahead. You don’t have to choreograph a defender-first release until you’ve built something capable enough that the choreography matters. Being the lab faced with this decision right now is, itself, a signal of where Anthropic sits in the field.
Will anyone else follow?
I don’t think they will.
Not in the next twelve months, and probably not until something forces the issue. When I look at the other frontier labs (OpenAI, Google DeepMind, Meta, xAI), each has its own reason, but the bottom line is the same: none of them are currently incentivized to trade capability visibility for defender-first release, and none of them have shown the kind of organizational character that would lead them there voluntarily. Anthropic has spent years building an identity around safety as a first-class priority, not a compliance checkbox. That culture is hard to fake and harder to copy.
Open-weights releases (the Llama-style “post the model, let people do what they want” approach) worry me less here, because open-weights models tend to lag the frontier. By the time you can self-host something approaching Mythos’s capability, defenders have had time to build. It’s the closed labs I’m watching.
The thing worth watching, though, isn’t whether OpenAI or Google copies Glasswing’s release pattern. They probably won’t. It’s whether one of them stumbles into the same step change in capability over the next six to twelve months. Mythos’s leap isn’t proprietary magic; it’s what happens when frontier coding and reasoning skills cross a threshold, and other labs are climbing the same curve. When one of them gets to the same place, we’ll find out what they choose. That’s when we’ll know whether Glasswing was a one-off or a template. Not because anyone followed Anthropic’s lead, but because they had to make the same call themselves.
I’ll also grant what the skeptical read of Glasswing gets right: this is a slam-dunk piece of PR. Eleven of the biggest companies in the world signed on, the dollar commitment is in the nine figures, and the quotes about “democratic states” write themselves. None of that is an accident. But I think Anthropic’s framing here is mostly staying true to its values, not performing them. The real test is whether Mythos’s capabilities back the marketing in the wild. That’s a wait-and-see, not a take-it-on-faith.
Personally, Glasswing moves the trust needle for Anthropic. It doesn’t move it for the industry. If any other frontier lab shipped something structurally similar in the next year, I’d be surprised and impressed in equal measure.
On the deeper question of whether responsible release is an ethics issue or a strategy issue, I’d answer ethics first, strategy second. You can build a case for Glasswing on pure strategic grounds (first-mover defenders, reputational moat, regulatory goodwill), and that case is real. But the reason it matters is the ethical one: when the tool you’ve built can cause a lot of harm if it lands in the wrong hands first, the order you hand it out is a moral choice, not a go-to-market choice.
What would convince me Glasswing was an inflection point and not an outlier? One thing: actual legislation creating an advisory structure for frontier model releases. Until that exists, every Glasswing-shaped move is voluntary. And voluntary doesn’t scale.
What’s next
For years, “responsible AI release” has been a theoretical conversation. Conference panels, op-eds, white papers. Everyone debating what a hypothetical future lab should do when it built a hypothetical future model that could really hurt people.
Glasswing is what that conversation looks like when it stops being theoretical.
Mythos is real. The 27-year-old vulnerability is real. The choice Anthropic faced was a real choice with real consequences. They reached the precipice the rest of the field has been arguing about for years, and they had to pick a direction.
The pontification phase is over. Every frontier lab climbing toward this capability is going to be standing at the same edge soon. And when they get there, they don’t get to retreat into another white paper. They have to make a moral choice. The kind you make in public, with consequences that don’t unwind.
That’s the part of this story I’m watching now. Not who copies Glasswing. Who arrives at the edge next, and what they do when they get there.