Google’s Gemini 1.5 Pro dethrones GPT-4o

Google’s experimental Gemini 1.5 Pro model has surpassed OpenAI’s GPT-4o in generative AI benchmarks.

For the past year, OpenAI’s GPT-4o and Anthropic’s Claude-3 have dominated the landscape. However, the latest version of Gemini 1.5 Pro appears to have taken the …

Before launching, GPT-4o broke records on chatbot leaderboard under a secret name

Enlarge (credit: Getty Images)

On Monday, OpenAI employee William Fedus confirmed on X that a mysterious chat-topping AI chatbot known as "gpt-chatbot" that had been undergoing testing on LMSYS's Chatbot Arena and frustrating experts was, in fact, OpenAI's newly announced

Mysterious “gpt2-chatbot” AI model appears suddenly, confuses experts

Enlarge (credit: Getty Images)

On Sunday, word began to spread on social media about a new mystery chatbot named "gpt2-chatbot" that appeared in the LMSYS Chatbot Arena. Some people speculate that it may be a secret test version of OpenAI's

Mysterious “gpt2-chatbot” AI model appears suddenly, confuses experts

Enlarge (credit: Getty Images)

On Sunday, word began to spread on social media about a new mystery chatbot named "gpt2-chatbot" that appeared in the LMSYS Chatbot Arena. Some people speculate that it may be a secret test version of OpenAI's

Words are flowing out like endless rain: Recapping a busy week of LLM news

Enlarge / An image of a boy amazed by flying letters. (credit: Getty Images)

Some weeks in AI news are eerily quiet, but during others, getting a grip on the week's events feels like trying to hold back the tide.

“The king is dead”—Claude 3 surpasses GPT-4 on Chatbot Arena for the first time

Enlarge (credit: Getty Images / Benj Edwards)

On Tuesday, Anthropic's Claude 3 Opus large language model (LLM) surpassed OpenAI's GPT-4 (which powers ChatGPT) for the first time on Chatbot Arena, a popular crowdsourced leaderboard used by AI researchers to gauge

文 » A