MMLU-Pro holds steady at 85.0, AIME 2025 slightly improves to 89.3, while GPQA-Diamond dips from 80.7 to 79.9. Coding and agent benchmarks tell a similar story, with Codeforces ratings rising from ...
Researchers at DeepSeek released a new experimental model designed to have dramatically lower inference costs when used in ...
Google's Gemini 2.5 Flash Lite is now the fastest proprietary model (and there's more big Gemini updates) Google continues to improve its Gemini family of large language models (LLMs) and its audio ...
Discover Perplexity's new Search API, giving developers real-time access to a vast web index for advanced AI apps.
National AI usage among businesses is 9.2%, with a projected increase to 11.6% in six months. Utah leads state AI business adoption at 15.7%%; Delaware's projected rate is highest at 19.1%.