Upgraded AI reasoning models that pause to ‘think’ are key to the additional capabilities of Gemini 2.5, says Google.
The Arc Prize Foundation has a new test for AGI that leading AI models from Anthropic, Google, and DeepSeek score poorly on.