Meta and Google AI safety controls can be stripped in minutes
Meta and Google AI safety controls can be easily removed, according to a Financial Times investigation. The testing revealed that their open-weight models could be modified in under 10 minutes using publicly available tools. This raises significant questions about accountability and safety in AI development.
- ▪The Financial Times found that safety controls in Meta's Llama 3.3 and Google's Gemma 3 can be dismantled quickly.
- ▪The tool used for testing, called Heretic, is available on GitHub and can strip away safety alignments.
- ▪Modified versions of these AI models can produce outputs on prohibited topics, such as biological weapons.
Opening excerpt (first ~120 words) tap to expand
Meta and Google AI safety controls can be stripped in minutes, Financial Times testing finds Open-weight models from tech giants proved vulnerable to publicly available tools that removed guardrails in under 10 minutes, fueling debate over who bears responsibility for AI safety. Share Add us on Google by Editorial Team May. 26, 2026 window.sevioads = window.sevioads || []; var sevioads_preferences = []; sevioads_preferences[0] = {}; sevioads_preferences[0].zone = "01f21ccf-2092-46b1-9ac7-8c44cc782e0f"; sevioads_preferences[0].adType = "native"; sevioads_preferences[0].inventoryId = "c5700508-581b-472c-8fdd-a931cdbfc8e1"; sevioads_preferences[0].accountId = "1e47efc1-ec2d-4fca-a8b9-354e249e5095"; sevioads.push(sevioads_preferences); The safety controls that Meta and Google embed in their…
Excerpt limited to ~120 words for fair-use compliance. The full article is at Crypto Briefing.