The world’s leading artificial intelligence groups are struggling to force AI models to accurately show how they operate, an issue experts have said will be crucial to keeping the powerful systems in check.
Anthropic, Google, OpenAI and Elon Musk’s xAI are among the tech groups to have developed a technique called “chain of thought” that asks their AI “reasoning” models to solve problems step by step, while showing how it works out the response to a query.
While company researchers said this process has provided valuable insights that have allowed them to develop better AI models, they are also finding examples of “misbehaviour” — where generative AI chatbots provide a final response at odds with how it worked out the answer.