New serious vulnerabilities spiked around release of Claude Mythos Preview

(epoch.ai)

27 points by cubefox3 hours ago

4 comments

hoppp1 hour ago
How are these reports verified to be valid? If there are too many some could be hallucinations too.
- guessmyname25 minutes ago
 We (Project Glasswing users) follow a proof-of-concept approach, where we create the exploit and verify its performance as claimed by the AI. Given our extensive experience as security software engineers (many of us with 10+ years of experience in the industry), we don’t simply report any critical security bug that Mythos claims to have discovered. Instead, we meticulously verify each one.At least, that’s what most high-visibility users are doing within Project Glasswing.There are bad apples in all levels of society and this initiative is not the exception.If it makes you feel better, most of us regularly meet to ensure mutual calibration and accountability, so I’m confident in the quality of the results produced by this particular group of employees at some of the partnering companies mentioned in the article.I know several other employees who blindly report everything Mythos reports, which is foolish, especially considering that the harness is also a crucial component of the project’s quality metrics. Some of the harnesses I’ve tested are quite weak, resulting in subpar results. For example, Yesterday morning, I was called into an ad-hoc meeting where a CVP was grilling me about a bunch of false critical bugs that my team had supposedly reported against their project(s), one of the cornerstones of iCloud. I was taken aback because we are so strict with ourselves to make such mistakes. Often, we even downgrade the severity of our bugs when our harness fails to prove what Mythos found, so I found their complaints strange. It wasn’t until I read some of the bug reports that I realized it wasn’t us; it was another team within the company who was was recently given access to Mythos. They built their own harness and were operating with a different set of vulnerability criteria. Fortunately, they only started doing that since Monday this week, so I was able to stop their work. This incident highlights that not everyone involved in Project Glasswing adheres to the same standards. We all try our best, but not everyone has the same priorities and goals, so it’s expected to find a few bad apples in the basket.I wish all AI labs would cease their theatrics and release all models without any restrictions, but I recognize that the ideal world where everyone strives to advance humanity does not exist. For every well-intentioned individual who seeks to use these powerful technologies for good, there are ten more who intend to use them for evil.In any case, I understand that there may be genuine noise in some experiments, but the CVE(s) count is indeed real.
- nextaccountic1 hour ago
 The best case scenario for AI companies is, people receive those bug reports, look at the model that produced it and not even look at the details, just apply the fix mindlesslyThis gives Anthropic a staggering amount of power. Oh it came from Mythos? We will just lose time trying to analyze it, better apply the fix ASAP
 - stingraycharles1 hour ago
 > The best case scenario for AI companies is, people receive those bug reports, look at the model that produced it and not even look at the details, just apply the fix mindlesslyDo people maintaining serious software do this, though?
 - pixl9710 minutes ago
 Define 'serious'. There is a lot of software in serious places written by very unserious people.
solenoid09371 hour ago
I predict once the responsible disclosure period is up we will see a lot more
comradesmith26 minutes ago
Good