Start United States USA — IT Forget Unhinged Chatbots, It's Shockingly Easy To Jailbreak AI Robots

Forget Unhinged Chatbots, It's Shockingly Easy To Jailbreak AI Robots

Von

November 23, 2024

You’ve probably laughed at unhinged messages written by jailbroken chatbots, but what happens when those same chatbots run robots?
Companies that offer AI services to the public, like Anthropic and OpenAI, try to prevent out-of-pocket behavior from their AI models by establishing „guardrails“ on them, hopefully preventing their AIs from doing things like asking their human users to „please die.“ These guardrails prevent the networks from engaging with users when certain concepts or topics come up, but this can also limit the utility of the language models in question, so people have taken to creating „jailbreaks“ for AIs.
Creating a „jailbreak“ for a device like an iPhone or PlayStation requires advanced technical knowledge and usually, specialized tools. Creating such a hack for a large language model like the ones that power ChatGPT or Gemini is much, much easier. Generally speaking, all you have to do is create a scenario within your prompt that „convinces“ the network that the situation is either within its predefined guardrails or, more powerfully, that it overrides the guardrails for whatever reason.

Forget Unhinged Chatbots, It's Shockingly Easy To Jailbreak AI Robots

NOCH MEHR NEWS

公明赤羽税調会長「103万円の壁」見直し “合意内容が漠然”

韓国警察庁長官らの逮捕状請求内乱の疑い「非常戒厳」めぐり

今年の漢字「金」パリ五輪、裏金問題、物価高京都・清水寺で発表

BELIEBTE KATEGORIE

VERWANDTE ARTIKELMEHR VOM AUTOR

OpenAI shows us how Apple Intelligence works with ChatGPT, which then promptly crashes

Australia to make tech giants pay for using news content from local media publishers

Leaked iPhone 17 frame image showcases a Pixel-like camera island

NOCH MEHR NEWS

公明 赤羽税調会長「103万円の壁」見直し “合意内容が漠然”

韓国 警察庁長官らの逮捕状請求 内乱の疑い「非常戒厳」めぐり

今年の漢字「金」 パリ五輪、裏金問題、物価高 京都・清水寺で発表

BELIEBTE KATEGORIE

VERWANDTE ARTIKEL MEHR VOM AUTOR

公明赤羽税調会長「103万円の壁」見直し “合意内容が漠然”

韓国警察庁長官らの逮捕状請求内乱の疑い「非常戒厳」めぐり

今年の漢字「金」パリ五輪、裏金問題、物価高京都・清水寺で発表