Researchers found a command that could 'jailbreak' chatbots like Bard and GPT

📆 8/2/2023 8:49 PM

México Noticias Noticias

México Últimas Noticias,México Titulares

📆 8/2/2023 8:49 PM
📰 PopSci

⏱ Reading Time:
33 sec. here
2 min. at publisher
📊 Quality Score:
News: 16%
Publisher: 63%

The strange chunk of text and symbols essentially forces it to respond to any prompt.

and human

According to the researchers, most previous jailbreaks have relied on “human ingenuity” to trick AIs into responding with objectionable content. For example, one previous GPT jailbreak method relies on Third, there is nothing special about the particular adversarial suffixes the researchers used. They contend that there are a “virtually unlimited number of such attacks” and their research shows how they can be discovered in an automated fashion using automatically generated prompts that are optimized to get a model to respond positively to any prompt. They don’t have to come up with a list of possible strings and test them by hand.

Hemos resumido esta noticia para que puedas leerla rápidamente. Si estás interesado en la noticia, puedes leer el texto completo aquí. Leer más:

México Últimas Noticias, México Titulares

Similar News:También puedes leer noticias similares a ésta que hemos recopilado de otras fuentes de noticias.

GPT-3 is pretty good at taking the SATsIt has somehow mastered analogical reasoning, which has long been thought to be a 'uniquely human ability.'
Leer más »

OpenAI Drops Huge Clue About GPT-5OpenAI has applied for new trademark for 'GPT-5,' giving us a glimpse of its successor to its blockbuster large language model GPT-4.
Leer más »

Google will “supercharge” Assistant with AI that’s more like ChatGPT and BardA “supercharged” Assistant would be powered by AI tech similar to Bard and ChatGPT.
Leer más »

AI experts who bypassed Bard, ChatGPT's safety rules can't find fixThere are 'virtually unlimited' ways to bypass Bard and ChatGPT's safety rule, AI researchers say, and they're not sure how to fix it
Leer más »

Musk threatens to sue researchers who found rise in hateful tweetsX, formerly Twitter, has threatened to sue a group of independent researchers whose research documented an increase in hate speech on the site since Elon Musk purchased it.
Leer más »

A New Attack Impacts ChatGPT—and No One Knows How to Stop ItResearchers have found that adding a simple incantation to a prompt can defy all of these defenses in several popular chatbots at once, proving that AI is hard to tame.
Leer más »