AI's ability to 'think' makes it more vulnerable to new jailbreak attacks, new research suggests

New research suggests that advanced AI models may be easier to hack than previously thought, raising concerns about the safety and security of some leading AI models already used by businesses and consumers.

A joint study from Anthropic, Oxford University, and Stanford undermines the assumption that the more advanced a model becomes at reasoning—its ability to “think” through a user’s requests—the stronger its ability to refuse harmful commands.

Using a method called “Chain-of-Thought Hijacking,” the researchers found that even major commercial AI models can be fooled with an alarmingly high success rate, more than 80% in some tests. The new mode of attack essentially exploits the model’s reasoning steps, or chain-of-thought, to hide harmful commands, effectively tricking the AI into ig

See Full Page

Interests (0)

Settings

AI's ability to 'think' makes it more vulnerable to new jailbreak attacks, new research suggests

Microsoft is removing one of the best features of Phone Link

Creekview football marks historic win with second-half push past Newman Smith

Woman sentenced for flags theft

Mark Zuckerberg says 'The Social Network' nailed his wardrobe: 'Every single shirt or fleece they had in that movie is a shirt or fleece that I own'

Trump’s 'law school graduate' VP gets history lesson over 'absurd' constitutional claim

Mamdani victory in New York causes confusion in Democratic Party

Ivanka Trump Stuns In Plunging Mini Dress At Birthday Outing: Photos

Supreme Court issues emergency order to block full SNAP payments after some states quickly paid the food benefits

Some desperate travelers turn to U-Haul as the government shutdown cuts flights and sends car rentals soaring

Trump asks Justice Department to probe meatpackers on prices

Mamdani Responds to Trump's National Guard Threats

Nancy Pelosi won't seek reelection, ending her storied career in the US House

IRS Direct File won't be available next year. Here's what that means for taxpayers

SNAP Benefits Lapse Leaves Millions in Uncertainty

Cowboys Defensive End Marshawn Kneeland Dies at 24

Novo Nordisk Exec Faints in Oval Office With Trump

Trump Administration Seeks Pause on SNAP Funding Ruling

Supreme Court Upholds Trump Administration's Passport Policy

Supreme Court to Review Kim Davis's Appeal on Marriage Ruling

Cornell University research funds to be restored in deal with Trump administration