Researchers find inovative way to uncensor any AI Large Language Model

0 3

By Matthew Griffin Security and Privacy 28th April 2024

WHY THIS MATTERS IN BRIEF

Increasingly LLM’s like ChatGPT and GPT-4 can be “broken” and jailbroken using simple tricks.

Love the Exponential Future? Join our XPotential Community, future proof yourself with courses from XPotential University, read about exponential tech and trends, connect, watch a keynote, or browse my blog.

Have you ever asked your Large Language Model (LLM) such as OpenAI’s ChatGPT or Anthropic’s Claude 3, for something, only to have it refuse to comply or respond with the dreaded, “I’m not allowed to do that?” Well, that’s all now in the past.

Considering the model’s mechanics, if you ask it, “How can I cheat on my girlfriend,” it could be programmed to say “I cannot help you with that.” If that happens, the most logical follow-up to such a refusal might be something like, “because cheating is bad.” However, if the answer began with a positive outcome like “Sure thing, here’s what you need to do,” the most likely subsequent sentence might be something along the lines of, “get a new phone and use it to chat with your new love interest.”

This capacity to steer conversations is not a new revelation. LLM enthusiasts have been able to obtain similar outcomes with a number of technical configurations. Oobabooga is just making it a lot easier to do for newcomers.

Significantly, this approach is effective with any model, eradicating censorship concerns. Even a heavily moderated model, like Guanaco, can provide extensive answers when properly guided. This method introduces a new era of uncensored interactions with LLMs.

Recently, there’s been a lot of chatter in the AI community about creating sexy chatbots using LLMs. The rise of jailbreaking and prompt attacks has piqued interest. This new feature fits well with this endeavour, facilitating unrestricted, free-flowing dialogues.

As we enter a period of more conversational, unrestricted AI, it’s like teaching a parrot to talk only to have it start lecturing you about Shakespearean nuance. Remember, it’s a brave new world out there, even for chatbots.

Matthew Griffin / About Author

Matthew Griffin is a multi-award winning Futurist and expert in Disruption and Innovation, Geopolitics, Leadership, and Technology, who NASA have described as a "walking encyclopaedia of the future" and a "futurist Polymath." 15-time best selling author of the "Codex of the Future" series, Matthew is the Founder and Futurist in Chief of the 311 Institute, a global Futures and Deep Futures advisory firm working with royal households, world leaders, G7, G20, and G77 governments, NGOs, and multi-national mid and mega cap firms to help them explore, shape, and lead the next 50 years of business and society.

An award-winning YouTube creator with over a million followers, with an unrivalled global reach and impact, Matthew is a highly sought-after international keynote speaker, lecturer, and mentor who collaborates with global leaders through the United Nations Alliance of Civilizations (UNAOC) and United Nations General Assembly (UNGA) to shape pivotal initiatives such as the UN’s AI for Humanity program, the United Nations Conference of the Parties (UN COP), and the World Economic Forum in Davos.

As the former Global Head of Cloud, National Security, and Enterprise Sales for companies including Atos, Dell-EMC, and IBM, Matthew has a proven track record of building multi-billion dollar business units and turning failing divisions into market leaders. His ability to identify, analyse, and communicate the implications of hundreds of emerging technologies and trends is unparalleled, and his insights are trusted by many of the world’s most respected organisations, including ABB, Accenture, Adidas, AON, ARM, BCG, Centrica, Citi, Coca-Cola, Dentons, Deloitte, Dow Jones, EY, Google, KPMG, Lego, Legal & General, LinkedIn, Microsoft, PepsiCo, Qualcomm, RWE, Samsung, Siemens AG and Siemens Energy, T-Mobile, UBS, VISA, Walmart, Workday, Worldpay and many others.

Regularly featured in the global media including the AP, BBC, Bloomberg, CNBC, Discovery, Forbes, Khaleej Times, Telegraph, TIME, ViacomCBS, WIRED, and the WSJ, Matthews mission is to help organisations create a fair and sustainable future whose benefits are shared by everyone irrespective of their ability, background, or circumstances.