Anthropic says its new Claude 3 AI chatbot scores better on key benchmarks than GPT-4

The battle between AI chatbots is more than a two-horse race. Anthropic, the company formed by several ex-OpenAI employees, claims its new Claude 3 language model outperforms ChatGPT and Google’s Gemini in several key industry benchmarks. It even hit “near-human” levels on some tasks, the company wrote in a blog.

There are three new chatbots under the Claude 3 umbrella, including Haiku, Sonnet, and Opus. Sonnet powers the Claude.ai chatbot and is offered for free with an email sign-in. Meanwhile, Opus is the largest and most powerful LLM and will be available with a $20 per month subscription via the “Claude Pro” service. It’s also multi-modal, so it can work with both text and image inputs, unlike past versions.

All Claude 3 models “can power live customer chats, auto-completions and data extraction tasks where responses must be immediate and in real-time,” the company said. On top of promising “near-instant results,” they can supposedly handle longer, multi-step instructions with increased accuracy.

Anthropic says its new Claude 3 AI chatbot scores better on key benchmarks than GPT-4

Opus showed better graduate-level reasoning than GPT-4, scoring 14.7 percent higher in that test than GPT-4. It also beat OpenAI’s chatbot in tasks involving math, coding, reasoning and knowledge.

They also top past Claude models. “For the vast majority of workloads, Sonnet is 2x faster than Claude 2 and Claude 2.1 with higher levels of intelligence. It excels at tasks demanding rapid responses, like knowledge retrieval or sales automation. Opus delivers similar speeds to Claude 2 and 2.1, but with much higher levels of intelligence,” according to Anthropic.

Meanwhile Haiku, the smallest version of Claude 3, is “the fastest and most cost-effective model on the market.” To that end, it’s capable of reading a dense research paper complete with charts and graphs in under three seconds.

The company also noted that Claude 3 “can process a wide range of visual formats, including photos, charts, graphs and technical diagrams,” aiding companies that use PDFs, flowcharts, or presentation slides. It’ll also be less likely to refuse harmless content thanks to a more nuanced understanding of requests, while still recognizing “real harm.”

Anthropic has said that Claude AI is guided by 10 secret foundational pillars of fairness. Claude 3 was trained on both nonpublic internal and public-facing data, using hardware from Amazon Web Services (AWS) and Google Cloud (Amazon recently invested $4 billion in Anthropic).

Claude 3 Opus and Claude 3 Sonnet are available now through Anthropic’s API, with Haiku set to follow soon. Sonnet is also accessible through Amazon Bedrock and in private preview on Google Cloud’s Vertex AI Model Garden.

This article originally appeared on Engadget at https://www.engadget.com/anthropic-says-its-new-claude-3-ai-chatbot-scores-better-on-key-benchmarks-than-gpt-4-071343736.html?src=rss

Go Here to Read this Fast! Anthropic says its new Claude 3 AI chatbot scores better on key benchmarks than GPT-4

Originally appeared here:
Anthropic says its new Claude 3 AI chatbot scores better on key benchmarks than GPT-4

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Anthropic says its new Claude 3 AI chatbot scores better on key benchmarks than GPT-4

More posts

Red Hat bets big on AI with its Neural Magic acquisition

How many software updates does the OnePlus 13 get?

The best air purifier for 2025

UK Government launches ransomware protection proposals