2 articles
Anthropic's new model sparks alarm, but cheaper open models reproduce its findings—revealing gaps in vulnerability disclosure.
Internal testing reveals the model introduces novel attack surfaces and defense evasion capabilities.