Model Blog Post - Search News

Why OpenAI's 'goblin' problem matters — and how you can release the goblins on your own

If OpenAI can accidentally train its flagship model to obsess over goblins, what other more subtle and potentially harmful ...

‘The Goblins Came Back to Haunt Us’: OpenAI Explains How ChatGPT’s ‘Nerdy’ Personality Got Out of Control

For at least a year, some ChatGPT users have noticed the LLM’s quirky habit of bringing up goblins, gremlins, trolls, and other creatures in its answers. The weird tic apparently became more common as ...

Hosted on MSN

Meet Claude Mythos: Leaked Anthropic post reveals the powerful upcoming model

An accidental leak has now been officially confirmed by AI company Anthropic regarding its most powerful AI model yet. The model, now known as "Claude Mythos," was originally uncovered in a report ...

OpenAI tells ChatGPT models to stop talking about goblins

The ChatGPT developer discovered the "strange affinity for goblins" while testing tools powered by newer systems, such as its ...

ZDNet

Anthropic retired a popular AI model and now it's blogging on Substack

Anthropic's retired Opus 3 AI model will write a blog. The "philosophical and whimsical" model was asked what it wanted to do next. Claude says it wants to explore the relationship between humans and ...

Ars Technica

Google’s AI Overview is flawed by design, and a new company blog post hints at why

On Thursday, Google capped off a rough week of providing inaccurate and sometimes dangerous answers through its experimental AI Overview feature by authoring a follow-up blog post titled, “AI ...

1monon MSN

Anthropic’s next model could be a ‘watershed moment’ for cybersecurity. Experts say that could also be a concern

The next wave of AI-powered cybersecurity attacks will be like nothing we’ve seen before.

CSOonline

Leak reveals Anthropic’s ‘Mythos,’ a powerful AI model aimed at cybersecurity use cases

The draft blog post describes a compute‑intensive LLM with advanced reasoning that Anthropic plans to roll out cautiously, starting with enterprise security teams. Anthropic didn’t intend to introduce ...

27d

Anthropic’s latest AI model could let hackers carry out attacks faster than ever. It wants companies to put up defenses first

Anthropic will make its new AI model available to some of the world’s biggest cybersecurity and software firms in an effort ...

26d

Meta debuts new AI model, attempting to catch Google, OpenAI after spending billions

Meta debuted its first major large language model, Muse Spark, spearheaded by chief AI officer Alexandr Wang, who leads Meta Superintelligence Labs.

1mon

Meet Claude Mythos: Leaked Anthropic post reveals the powerful upcoming model

In a statement provided to Fortune after the leak, an Anthropic spokesperson described Claude Mythos as an AI performance ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results