资讯

根据 OpenAI 提供的数据,o3-pro 在人类测试者中的胜率为 64%,在 4 项稳定性测试中也略优于 o3。但正如 Sam Altman 所说,当你「以不同方式」使用它时,才能真正看到它的能力扩展。
The reasoning capabilities in vision-based and multimodal systems still lag in abstract problem-solving tasks.
克雷西 发自 凹非寺量子位 | 公众号 QbitAI 1.93bit量化之后的 DeepSeek-R1(0528),编程能力依然能超过Claude 4 Sonnet? 最新优化版R1在编程榜单aider上取得了60%的成绩,不仅超过了Claude 4 ...
Anthropic has launched Claude 4, featuring two new AI models: Opus 4 and Sonnet 4. Think of them as super-smart digital assistants. Opus 4 is the powerhouse, excelling at complex tasks like coding ...
Social media platform Reddit sued the artificial intelligence company Anthropic on Wednesday, alleging that it is illegally "scraping" the comments of Reddit users to train its chatbot Claude.
Windsurf, the popular vibe-coding startup that’s reportedly being acquired by OpenAI, says Anthropic significantly reduced its first-party access to its Claude 3.7 Sonnet and Claude 3.5 Sonnet ...
In a recent study from the research arm of Michael Bloomberg's media empire that was presented at a computational linguistics conference in April, 11 of the latest LLMs, including OpenAI's GPT-4o, ...
Anthropic has unveiled Claude 3.5 Sonnet, the latest addition to its AI model lineup, marking a significant leap in performance, usability, and enterprise readiness. This model surpasses its ...
AI I put ChatGPT-4o vs Claude 3.7 Sonnet through a 7-round face-off — one left the other in the dust AI I tested Gemini 2.5 Pro vs Claude 4 Sonnet with the same 7 prompts — here’s who came ...
You may like Claude's voice mode is now free for everyone — here's how to try it Claude 3.7 Sonnet now supports real-time web searching — but there's a catch ChatGPT's Advanced Voice Mode just ...
Anthropic is rolling out a new voice mode for its AI chatbot Claude ... while Sonnet is a lightweight model designed for regular use and a big improvement on the previous Sonnet 3.7 model.
Enter Claude 4 Sonnet and Opus ... or time-sensitive projects. Priced at $3 per 1 million input tokens and $15 per 1 million output tokens, Sonnet is a more accessible option for developers.