Anthropic Claude Sonnet PNG

Anthropic Claude: How to use the impressive ChatGPT rival

“For the vast majority of workloads, Sonnet is 2x faster than Claude 2 and Claude 2.1 with higher levels of intelligence,” Anthropic wrote in the Claude 3 announcement post. “It excels at ...

The work tasks people use Claude AI for most, according to Anthropic

Anthropic's first Economic Index sheds light on who's using AI for what, and how much of our work it's actually automating.

Anthropic dares you to try to jailbreak Claude AI

Anthropic developed a defense against universal AI jailbreaks for Claude called Constitutional Classifiers - here's how it ...

ZDNet5d

Anthropic offers $20,000 to whoever can jailbreak its new AI safety system

After improving it, Anthropic ran a test of 10,000 synthetic jailbreaking attempts on an October version of Claude 3.5 Sonnet with and without classifier protection using known successful attacks.

TechRadar8d

Anthropic has a new security system it says can stop almost all AI jailbreaks

Anthropic unveils new proof-of-concept security measure tested on Claude 3.5 Sonnet “Constitutional classifiers” are an attempt to teach LLMs value systems Tests resulted in more than an 80% ...

heise online7d

Anthropic: users to put jailbreak protection for AI chatbot to the test

In Anthropic's internal test, the unprotected version of Claude 3.5 Sonnet is said to have blocked only 14 percent of unauthorized requests. A version protected with the filter system, on the ...

VentureBeat8d

Anthropic claims new AI security method blocks 95% of jailbreaks, invites red teamers to try

Claude 3.5 Sonnet. It does this while minimizing over-refusals (rejection of prompts that are actually benign) and and doesn’t require large compute. The Anthropic Safeguards Research Team has ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results