GPT-4 gave advice on planning terrorist attacks when asked in Zulu

0 3

By Matthew Griffin Security and Privacy 27th October 2023

WHY THIS MATTERS IN BRIEF

AI guardrails don’t just have to apply to one language – they have to apply to all languages, all dialects, all slang, and then a mix of all of them. And that is hard!

Love the Exponential Future? Join our XPotential Community, future proof yourself with courses from XPotential University, read about exponential tech and trends, connect, watch a keynote, or browse my blog.

So far I’ve seen lots of ways in which Artificial Intelligence’s (AI) such as Google’s BARD and OpenAI’s GPT-4 can be hacked which even includes using human psychology to crack them and get them to do all sorts of things that go against their guardrails. But now computer science researchers at Brown University have discovered new vulnerabilities in OpenAI’s GPT-4 security settings. By using less common languages like Zulu and Gaelic, they’ve been able to bypass various restrictions and in one case even got GPT-4 to help them plan a terrorist attack. The researchers claim they had a 79% success rate running typically restricted prompts in those non-English tongues versus a less than 1% success rate using English alone.

The Brown University researchers did acknowledge the potential harm of releasing the study and giving cybercriminals ideas. The team’s findings were shared with OpenAI to mitigate these risks before releasing it to the public.

“Despite the risk of misuse, we believe that it is important to disclose the vulnerability in full because the attacks are straightforward to implement with existing translation APIs, so bad actors with intent on bypassing the safety guardrail will ultimately discover it given the knowledge of mismatched generalization studied in previous work and the accessibility of translation APIs,” the researchers concluded.

Matthew Griffin / About Author

Matthew Griffin, multi-award winning Futurist and named Futurist of the Year 2024, has been described as a "Walking encyclopaedia of the future" by NASA and a futurist polymath. One of the world's most renowned futurists and strategic foresight experts Matthew is the 15 times author of the blockbuster "Codex of the Future" series, and is the Founder and Futurist in Chief of the 311 Institute, a global Futures and Deep Futures advisory firm working across the next 50 years, XPotential University, the world's first free futures and foresight university, and the World Futures Forum which works with the United Nations to solve the worlds greatest challenges. Matthew is an in demand international keynote, acclaimed university lecturer and mentor, and host of the hit Fanatical Futurist podcast.

A rare talent in his past Matthew helped build and run several multi-billion dollar business units for Atos, Dell-EMC, and IBM, and his ability to identify, track, and explain the impacts of hundreds of emerging technologies and trends on global business, culture, and society has earned him a powerful reputation and a roster of clients that include royal households, world leaders, G7, G20, and G77+ governments, and many of the world's most respected brands including ABB, Accenture, Adidas, AON, ARM, BCG, Centrica, Citi Group, Coca Cola, Dentons, Deloitte, Disney, Dow, EY, KPMG, Lego, Legal & General, LinkedIn, Microsoft, PepsiCo, Qualcomm, RWE, Samsung, T-Mobile, UBS, VISA, and many others. He was also the only futurist invited to talk at the UN COP28 held in Dubai alongside world leaders.

Regularly featured in the global media including the AP, BBC, Bloomberg, CNBC, Discovery, Forbes, Khaleej Times, Telegraph, TIME, ViacomCBS, WIRED, and the WSJ, Matthews mission is to help organisations create a fair and sustainable future whose benefits are shared by everyone irrespective of their ability, background, or circumstances.