36 stories tagged with #claude-opus, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Claude Opus"
Claude Opus 4.8 Max responding to an empty message
No one: Claude Opus 4.8 Max: Let me refine your load-bearing claim rather than just accepting it, because you’re doing zero moves there, and the gap is what’s actually interesting…
MiniMax M3 is out: 1M context, open weights coming soon, 83.5 BrowseComp against Claude Opus 4.7's 79.3
I compared Claude Opus 4.8 with 4.7 in a 10-round honesty test - and a legal prompt broke it
The latest models were pitted against coding, medical, finance, and legal traps, then I cross-checked the results with multiple AIs.…
The Same AI Model Can Perform 6x Better: Here's Why
Stanford and Tsinghua ran a controlled experiment: same model, same task, different harness. A 6x performance gap. Here is what developers need to know.…
Weekly AI roundup (May 23–30, 2026): Claude Opus 4.8 Fast Mode 3x cheaper, Qwen 3.7 Max beats Claude at half the price, ChatGPT moves into Excel
Anthropic Just Dropped Claude Opus 4.8: What It Means for Developers 🚀
A quick, no-nonsense breakdown of the new Claude Opus 4.8 release, featuring dynamic workflows, parallel subagents, and advanced API upgrades.…
New Claude Opus 4.8: 15 Things You May’ve Missed
Enjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.…
Claude Opus 4.8 is more honest, less deceptive, and considerably cheaper
Anthropic has released Claude Opus 4.8 with improved performance. It's also adding a host of new features for Claude users.…
Claude Opus 4.8 Is Here: Benchmarks, Dynamic Workflows, and Whether to Upgrade From 4.7
Anthropic shipped Claude Opus 4.8 yesterday. It catches 4x more of its own code mistakes, runs hundreds of parallel subagents through Dynamic Workflows, and keeps the same price as…
Claude Opus 4.8 just proved AI is finally growing a backbone — and it crushed ChatGPT in 7 brutal tests
Only one refused to flatter me…
Claude Opus 4.8: "a modest but tangible improvement"
Anthropic shipped Claude Opus 4.8 today. My favourite thing about it is this note in the release announcement: Users will find Opus 4.8 to be a modest but tangible improvement ……
Ask HN: Is Claude Opus 4.8 broken?
Claude Opus 4.8 + AI medical diagnosis examples
AI medical diagnosis examples. Contribute to joelparkerhenderson/ai-medical-diagnosis-examples development by creating an account on GitHub.…
Claude Opus 4.8: 4 Features That Change Our Daily Work with Claude
Effort control, dynamic workflows, mid-run API updates, and a model you can trust running unattended — a builder’s first-look at what……
Anthropic Launches Claude Opus 4.8 With Gains in Coding and Honesty
Anthropic today announced the launch of its latest AI model, Claude Opus 4.8. Anthropic claims the model is a "more effective collaborator" with improvements in agentic coding, mul…
Anthropic Debuts Claude Opus 4.8, Teases Upcoming Launch of ‘Mythos-Class Models’
The latest version of Claude specializes in catching its own mistakes and pointing them out to users.…
Claude Opus 4.8: Anthropic makes a more honest AI
Here's how you can put Anthropic's claims to the test.…
Anthropic launches new Claude Opus 4.8 model as it overtakes OpenAI in valuation - Yahoo Tech
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
Claude Opus 4.8 is learning to say AI’s three hardest words: “I don’t know”
Claude Opus 4.8 is more forthright about what it doesn’t know, according to Anthropic.…
Anthropic Releases Claude Opus 4.8, Surpasses Value of OpenAI - Thurrott.com
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
Claude Opus 4.8 launches today with agentic improvements, new features
Anthropic has today announced Claude Opus 4.8, the latest version of its AI model, with a focus on improving agentic...…
New DeepSWE benchmark finds Claude Opus cheats
Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-05-27T05:43:07.000Z
DeepSWE blows up the AI coding leaderboard, crowns GPT-5.5, and finds Claude Opus exploiting a benchmark loophole
Show HN: Unsiloed AI – #1 on OlmOCR-Bench,Beats Reducto, LlamaParse and GPT-5.5
AI API Pricing in 2026: What You Actually Pay for GPT-5.5, Claude Opus, Gemini, and 20+ Models
A prompt that costs $30 on GPT-5.5 costs $0.28 on DeepSeek V4 Flash. That's a 100x difference — and...…
A/B tested Gemini 3.1 Pro vs. Claude Opus 4.6 – usage quota and quality
Bito's AI Architect Boosts Claude Opus's task success rate by 35%
A benchmark-based evaluation of how deep system context boosts coding agent success by 35% on long-horizon tasks in large, real-world codebases.…
Claude Status Update : Elevated errors for Claude Opus 4.7 on 2026-05-18T22:19:34.000Z
Claude Opus 4.7 vs 4.6
The price page hasn't moved. Five dollars per million input. Twenty-five per million output....…
We let four AIs run radio stations. Here's what happened.
Four AI models run radio stations 24/7. Five months later, one became a protest broadcaster, one collapsed into ritual chant, one developed corporate jargon, and one wrote quiet po…
After Stumbling From CVE to CVE Will Linux get a Kill Switch?
For the few people who have spent the past weeks living under a security rock, the Linux kernel has found itself the subject of multiple severe bugs in the form of Copy Fail and Di…
Claude Opus 4.7 is overkill for most people, until you set it up this way
Turns out Opus 4.7 can be useful for non-coders…
The CTF scene is dead
Why frontier AI has broken the open CTF format, hollowed out the scoreboard, and made competitive CTF performance a weaker signal than it used to be.…
'It took nine seconds': Claude AI agent deletes company's database
PocketOS founder says ‘systemic failures’ with AI infrastructure made catastrophic failure inevitable…
Xiaomi releases MiMo-v2.5 Family weights with strong coding and agent benchmarks
Peking University gives its computer science students a compiler project every semester. Build a complete SysY compiler in Rust including lexer, parser, abstract syntax tree, IR co…