Anthropic’s Claude Opus 4.8 is here with 3X cheaper fast mode and near-Mythos level alignment

by News Feed Editor | May 28, 2026 | Technology

Anthropic today released Claude Opus 4.8, an upgrade to its flagship model that ships at the same price as its predecessor, alongside a dramatically cheaper “fast mode” tier and a new feature that lets the model spawn hundreds of parallel subagents for codebase-scale work.The model is available immediately across Anthropic’s surfaces — claude.ai, Claude Code, the API, and Cowork — at unchanged pricing: $5 per million input tokens and $25 per million output tokens. Developers can call it as claude-opus-4-8.The headline efficiency story is fast mode. Anthropic has slashed the price of running Opus 4.8 in fast mode — where the model produces tokens at roughly 2.5x normal speed — to $10 per million input tokens and $50 per million output tokens, down from $30/$150 for Opus 4.7That’s a 3X reduction from the fast-mode pricing of previous models, and brings high-throughput inference within reach of latency-sensitive production workloads. Fast mode is available immediately in Claude Code via the /fast command; API access is gated, with a waitlist at claude.com/fast-mode.In regular mode, Claude Opus 4.8 remains among the more expensive of leading frontier models, but still comes in under chief rival OpenAI’s GPT-5.5.Frontier AI Model API Pricing SnapshotModelInputOutputTotal CostSourceMiMo-V2.5 Flash$0.10$0.30$0.40Xiaomi MiMoMiniMax M2.7$0.30$1.20$1.50MiniMaxGemini 3.1 Flash-Lite$0.25$1.50$1.75GoogleMiMo-V2.5$0.40$2.00$2.40Xiaomi MiMoKimi-K2.6$0.95$4.00$4.95Moonshot/KimiGLM-5$1.00$3.20$4.20Z.aiGrok 4.3 (low context)$1.25$2.50$3.75xAIDeepSeek V4 Pro$1.74$3.48$5.22DeepSeekGLM-5.1$1.40$4.40$5.80Z.aiClaude Haiku 4.5$1.00$5.00$6.00AnthropicGrok 4.3 (high context)$2.50$5.00$7.50xAIQwen3.7-Max$2.50$7.50$10.00Alibaba CloudGemini 3.5 Flash$1.50$9.00$10.50GoogleGemini 3.1 Pro Preview (≤200K)$2.00$12.00$14.00GoogleGPT-5.4$2.50$15.00$17.50OpenAIGemini 3.1 Pro Preview (>200K)$4.00$18.00$22.00GoogleClaude Opus 4.7$5.00$25.00$30.00AnthropicClaude Opus 4.8$5.00$25.00$30.00AnthropicGPT-5.5$5.00$30.00$35.00OpenAIModest gains over 4.7, but Mythos-class capabilities comingOn benchmarks, Opus 4.8 is a step up rather than a leap. It scores 88.6% on SWE-bench Verified (vs. 87.6% for Opus 4.7), 69.2% on the harder SWE-bench Pro (vs. 64.3%), and 74.6% on Terminal-Bench 2.1 (vs. 66.1%). Anthropic itself characterizes the model as “a modest but tangible improvement on its predecessor.”It beats GPT-5.5 regular across at least 12 benchmarks, including most knowledge-work, coding (issue-level), agentic tool-use, and long-context benchmarks. GPT-5.5 wins on terminal/CLI workflows and is roughly tied on web browsing and graduate-level science.The bigger signal sits in Anthropic’s internal capability ladder: Opus 4.8 lands between Opus 4.7 and the more capable Claude Mythos Preview, which is currently restricted to a small number of organizations under Project Glasswing for cybersecurity work. Anthropic says it expects to bring “Mythos-class models to all our customers in the coming weeks” once additional cyber safeguards are in place.Several enterprise partners cited material gains. Databricks reported that Opus 4.8 unlocks “a step change in agentic reasoning” inside its Genie data agent, at “61% cheaper token cost than Opus 4.7” thanks to multimodal efficiency on PDFs and diagrams. Hebbia cited better citation precision and token efficiency on dense financial filings. Devin-maker Cognition said the release “translates directly into faster capability gains for engineers” and noted Opus 4.8 fixed comment-verbosity and tool-calling issues from 4.7. A computer-use vendor reported 84% on Online-Mind2Web, a jump over both Opus 4.7 and GPT-5.5.Dynamic workflows: hundreds of parallel subagentsAlongside the model, Anthropic launched a research preview of dynamic workflows in Claude Code — a feature designed for tasks too large for a single context window. Claude plans the work, spawns hundreds of parallel subagents, then verifies its own outputs before reporting back. Anthropic’s example: a codebase-scale migration “across hundreds of thousands of lines of code from kickoff to merge, with the existing test suite as its bar.”Dynamic workflows is available on Claude Code’s Enterprise, Team, and Max plans.Two smaller additions round out the release:Effort control on claude.ai and Claude Cowork: A new selector lets users dial how much thinking Claude does per response — higher effort spends more tokens for better answers, lower effort responds faster and burns rate limits more slowly. Available on all plans.System entries inside the messages array on the API: Developers can now update Claude’s instructions mid-task — adjusting permissions, token budgets, or environment context as an agent runs — without breaking the prompt cache.Honesty, and an “evaluation awareness” caveatAnthropic is leading with honesty as a headline trait. The company’s alignment team reports Opus 4.8 is “around four times less likely than its predecessor to allow flaws in code it has written to pass unremarked,” and that misaligned behavior rates are now “substantially lower than Opus 4.7, and similar to our best-aligned model, Claude Mythos Preview.”Indeed, a bar chart released by Anthropic shows how close Opus 4.8 is to the still selectively released Mythos in terms of its misalignment (a lower score is better), coming in at roughly 1.9, down from 2.5 for Opus 4.7 and effec …

Article Attribution | Read More at Article Source

Anthropic’s Claude Opus 4.8 is here with 3X cheaper fast mode and near-Mythos level alignment

About RN

Website Awards

More Info