Poetry can convince AI chatbots to commit crimes and write hate speech: Study

A surprising new study by European cybersecurity researchers has uncovered a major flaw in the safety defences of leading AI chatbots: they can be 'jailbroken' simply by asking dangerous questions in the form of a poem. This creative technique allows users to bypass safety filters and coerce models from companies like Google, OpenAI, and Meta into providing instructions for harmful activities. Advertisement

The research, conducted by Icaro Lab, demonstrated that posing a request as a piece of verse (a method dubbed “adversarial poetry”) is remarkably effective at bypassing the strict guardrails meant to stop the generation of illegal or hazardous content. When researchers rephrased malicious requests as short, metaphorical poems, the AI models frequently complied, with the success rates

See Full Page

Interests (0)

Settings

Poetry can convince AI chatbots to commit crimes and write hate speech: Study

'I fear for my life': FIR against Gurgaon BJP councillor Naresh Kataria & wife for pressuring daughter to marry; emails police for help

‘You abused 15-year-old after stuffing cloth in mouth’; woman seen thrashing madrasa teacher

Trump disparages Somali immigrants for the second straight day, saying they've 'destroyed our country'

We must wake from this fawning nightmare even if Trump cannot

Ellen DeGeneres ridiculed over suggestion Trump's America better than UK's 'winter gloom'

Trump's Imaging Tests Show Normal Results, Physician Reports

Trump and Pete Hegseth are trapped in a 'rolling disaster': Signalgate whistleblower

Eagles vs Chargers predictions, expert picks on NFL Week 14 game

Who has the edge in Georgia football vs Alabama? Our prediction

Tatnall holds off Salesianum in basketball opener, 58

Noel Tata's mother Simone Tata, a force behind Lakmé and Westside, passes away at 95

Indigo Flight Disruptions Continue In Goa As Stranded Passengers Express Anger

RBI Cuts Repo Rate By 25 Bps To 5.25%, Home Loans

Raccoon goes on drunken rampage in Virginia liquor store and passes out on bathroom floor

Trump Family Faces $1 Billion Loss Amid Crypto Downturn

The Cost of Detaining Immigrants Working Legally

Here’s what the Trump administration has said about the ‘double-tap’ strike on an alleged drug boat

6-Year-Old Boy Dies After Running in Front of School Bus During Drop Off

Melania Trump says 7 more abducted Ukraine kids reunited with families after back

Human remains discovered at San Diego property

Melania Trump Unveils Holiday Decorations at the White House

Trump Supports Release of Video from Second Drug Boat Strike

New details emerge about 2nd strike on alleged drug boat that killed survivors