Anthropic launched its newest model, Claude Opus 4.5, putting the company back atop the benchmark rankings for AI software coding. Opus 4.5 scores over 80% on the widely-used SWE-bench, which tests models for software engineering skill. Google’s impressive Gemini 3 Pro, launched last week, briefly held the top score with 76.2%. Anthropic’s Claude product lead Scott White tells Fast Company that the model has also scored higher than any human on the engineering take-home assignment the company gives to engineering job candidates.

Of course Opus 4.5 does a lot more than coding. Anthropic says Opus 4.5 is also the “best model in the world” for powering AI agents and for operating a computer, and that it’s meaningfully better than other models at tasks like deep research and working with slid

See Full Page