← Back to topics

Topics

AI Models

Foundation models, multimodal systems, reasoning, and product shifts from new model releases.

23 articles Latest 2026-05-21 Subscribe to topic RSS

Related tags

#AI-Model#LLM#Gemini#Reasoning#Multimodal#GPT#GPT-5.4#Image-Generation#Nano-Banana

Top sources

OpenAI (5)Anthropic (2)arXiv (2)Google (2)Alibaba / Tongyi Qianwen (1)ARC Prize (1)

Articles

Don't Expect AI Progress to Sigmoid Anytime Soon

Scott Alexander pushes back against the 'all exponentials become sigmoids' argument used to dismiss AI progress concerns, showing how history is littered with premature plateau predictions, and arguing Lindy's Law suggests continued progress for ~7 more years.

Google Launches Googlebook AI-Native Laptop Line

Google unveils Googlebook, a laptop series designed for Gemini Intelligence with Magic Pointer AI cursor, AI widget generation, and deep Android phone integration, shipping Fall 2026.

OpenAI models, Codex, and Managed Agents land on AWS

OpenAI and AWS expand their partnership to bring GPT-5.5, Codex, and new Bedrock Managed Agents to AWS customers, giving enterprises a direct path to deploy frontier AI within their existing cloud infrastructure.

LLMs make surface quality unreliable in knowledge work

One Happy Fellow argues that LLMs break the proxy measures organizations use to judge knowledge work. When spelling, formatting, review rituals, and professional tone can be generated cheaply, teams need better ways to verify whether work is actually true, useful, and decision-grade.

DeepSeek V4 preview brings 1M context into open model competition

DeepSeek has released and open-sourced the V4 preview, with Pro and Flash variants and 1M context as the default across official services. The release matters less as a benchmark update than as a push to make long-context agent workflows cheaper and more deployable.

Introducing Forge | Mistral AI

OpenAI releases GPT-5.4, combining recent advances in reasoning, coding, and agentic workflows into a single frontier model. Achieves a new state-of-the-art 83.0% on GDPval benchmark with native computer-use capabilities.