Is your favorite AI chatbot scheming against you?
If "AI scheming" sounds ominous, you should know that OpenAI is actively studying this phenomenon. This week, OpenAI published a study conducted alongside Apollo Research on "Detecting and reducing scheming in AI models." The researchers “found behaviors consistent with scheming in controlled tests,” the result of AI models with multiple, and at times competing, objectives.
So, what is AI scheming, and does it mean that ChatGPT is lying to you?
In a blog post about the study , the creators of ChatGPT define AI scheming as a chatbot "pretending to be aligned while secretly pursuing some other agenda." OpenAI wants to know why AI is deliberately lying to users and what to do about it.
OpenAI introduces the study with an interestin