Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

Whereas Claude Opus 4 might be restricted to paying Anthropic clients, a second mannequin, Claude Sonnet 4, might be obtainable for each paid and free tiers of customers. Opus 4 is being marketed as a robust, giant mannequin for complicated challenges, whereas Sonnet 4 is described as a wise, environment friendly mannequin for on a regular basis use.

Each of the brand new fashions are hybrid, that means they’ll supply a swift reply or a deeper, more reasoned response relying on the character of a request. Whereas they calculate a response, each fashions can search the net or use different instruments to enhance their output.

AI corporations are at present locked in a race to create really helpful AI agents which might be capable of plan, cause, and execute complicated duties each reliably and free from human supervision, says Stefano Albrecht, director of AI on the startup DeepFlow and coauthor of Multi-Agent Reinforcement Studying: Foundations and Fashionable Approaches. Typically this includes autonomously utilizing the web or different instruments. There are nonetheless security and safety obstacles to beat. AI brokers powered by giant language fashions can act erratically and perform unintended actions—which turns into much more of an issue once they’re trusted to behave with out human supervision.

“The extra brokers are capable of go forward and do one thing over prolonged intervals of time, the extra useful they are going to be, if I’ve to intervene much less and fewer,” he says. “The brand new fashions’ means to make use of instruments in parallel is attention-grabbing—that might save a while alongside the way in which, in order that’s going to be helpful.”

For example of the kinds of questions of safety AI corporations are nonetheless tackling, brokers can find yourself taking surprising shortcuts or exploiting loopholes to succeed in the objectives they’ve been given. For instance, they could ebook each seat on a aircraft to make sure that their consumer will get a seat, or resort to creative cheating to win a chess game. Anthropic says it managed to scale back this habits, often called reward hacking, in each new fashions by 65% relative to Claude Sonnet 3.7. It achieved this by extra intently monitoring problematic behaviors throughout coaching, and bettering each the AI’s coaching surroundings and the analysis strategies.

Source link

Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

Manus has kick-started an AI agent boom in China

What’s next for AI and math

Inside the tedious effort to tally AI’s energy appetite

Best Buy Offers HP 14-Inch Chromebook for Almost Free for Memorial Day, Nowhere to be Found on Amazon

The Best Sleeping Pads For Campgrounds—Our Comfiest Picks (2025)

Time has a new look: HUAWEI WATCH 5 debuts with exclusive watch face campaign

Most Popular

Best Buy Offers HP 14-Inch Chromebook for Almost Free for Memorial Day, Nowhere to be Found on Amazon

The Best Sleeping Pads For Campgrounds—Our Comfiest Picks (2025)

Time has a new look: HUAWEI WATCH 5 debuts with exclusive watch face campaign

Our Picks

MPL Philippines breaks peak viewership record with 1.8m viewers

I replaced my iPad with a $100 Android tablet, and here’s my buying advice after a week

Get the Toornament Report 2024

Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

Related Posts