Lmarena Coding - Search News

Hosted on MSN

xAI hired gig workers to boost Grok on a key AI leaderboard and 'beat' Anthropic's Claude in coding

Tech companies are fiercely competing to build the best AI coding tools — and for xAI, the top rival to beat seems to be Anthropic. Elon Musk's AI company used contractors to train Grok on coding ...

NextBigFuture

XAI Releases Grok 4.1 and It Tops the LMArena Leaderboard

In LMArena, Grok4.1 (Thinking) and Grok4.1 ranks first. In the earlier benchmark tests, Grok4.1 (Thinking) ranked first with a score of 1510. Currently, it is still first but with a score of 1483.

Mashable

LMArena has some competition: Scale AI launches Seal Showdown, a new benchmarking tool

In the years since OpenAI launched ChatGPT to the world, kicking off the generative AI boom, developers have relied on LMArena (previously Chatbot Arena) as the default AI leaderboard. Now, Scale AI ...

The Next Web

Who decides the best AI?

The AI industry has become adept at measuring itself. Benchmarks improve, model scores rise, and every new release arrives with a list of metrics meant to signal progress. And yet, somewhere between ...

Yahoo Finance

LMArena Raises $150 Million to Build the World's Most Trusted AI Evaluation Platform

SAN FRANCISCO, Jan. 6, 2026 /PRNewswire/ -- LMArena, the community platform redefining how the world measures the progress of AI, today announced it has raised $150 million in new funding, achieving a ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results