OpenAI’s bots admit wrongdoing in new ‘confession’ tests • The Register

The Register

The Register2 hrs ago

OpenAI’s bots admit wrongdoing in new ‘confession’ tests • The Register

Some say confession is good for the soul, but what if you have no soul? OpenAI recently tested what happens if you ask its bots to "confess" to bypassing their guardrails.

We must note that AI models cannot "confess." They are not alive, despite the sad AI companionship industry. They are not intelligent. All they do is predict tokens from training data and, if given agency, apply that uncertain output to tool interfaces.

Terminology aside, OpenAI sees a need to audit AI models more effectively due to their tendency to generate output that's harmful or undesirable – perhaps part of the reason that companies have been slow to adopt AI , alongside concerns about cost and utility.

"At the moment, we see the most concerning misbehaviors, such as scheming⁠ , only in stress-tests and adversar

20

Venmo Down for Thousands of Users, Downdetector Reports

Venmo Down for Thousands of Users, Downdetector Reports

GV Wire23 hrs ago

23

How Google creates the Year in Search

How Google creates the Year in Search

Fast Company Technology

Fast Company Technology12 hrs ago

12

One Tech Tip: Up your Christmas shopping game with AI tools

One Tech Tip: Up your Christmas shopping game with AI tools

The Daily Sentinel

The Daily Sentinel17 hrs ago

7

Larry Magid: Paper isn’t dead yet, and neither are printers and scanners

Larry Magid: Paper isn’t dead yet, and neither are printers and scanners

The Mercury News San Jose

The Mercury News San Jose8 hrs ago

44

Meta’s Zuckerberg plans deep cuts for metaverse efforts

Meta’s Zuckerberg plans deep cuts for metaverse efforts

Detroit News7 hrs ago

77

Nvidia’s AI healthcare vision spans new drugs, robots, and beyond

Nvidia’s AI healthcare vision spans new drugs, robots, and beyond

Fast Company Technology

Fast Company Technology12 hrs ago

124

How will Waymo's self-driving cars handle New Orleans potholes and parades?

How will Waymo's self-driving cars handle New Orleans potholes and parades?

Nola Business13 hrs ago

141

How Autonomous Delivery Robots Are Quietly Shaping the Future of Urban Robotics in Cities

How Autonomous Delivery Robots Are Quietly Shaping the Future of Urban Robotics in Cities

Tech Times10 hrs ago

64

Venmo announces it is ‘back up and running’ following widespread service outage

Venmo announces it is ‘back up and running’ following widespread service outage

KY39 hrs ago

28

Xero to start charging developers API usage fees • The Register

Xero to start charging developers API usage fees • The Register

The Register16 hrs ago

121

Donald Trump, 79, Returns With Mystery Addition to Bruised Hand

Donald Trump, 79, Returns With Mystery Addition to Bruised Hand

The Daily Beast

The Daily Beast21 hrs ago

1501

Looks like you've reached the bottom