Scale AI Closes $1 Billion Round, Unveils Expert-rated LLM Leaderboards

AI Models/Tools

AWS Unveils Strands, a Simpler Way to Build AI Agents

May 23, 2025

What used to take months to build an AI agent now takes days or weeks.

Google to Reinvent Search as an Integrated AI Assistant

May 20, 2025

3D chats, Warby Parker smart glasses and other AI upgrades are coming soon

Darth Vader Has Something to Say to You, Young Padawan

May 16, 2025

Voice AI lets Star Wars and Fortnite fans chat with Darth Vader in real time in Battle Royale.

A Nigerian Startup Aims to Democratize Access to Legal Services

May 15, 2025

Agbo Obinnaya, cofounder of Case Radar AI, seeks to provide all-in-one legal services powered by AI.

Is the AI Model Leaderboard Rigged? This Paper Says Yes

May 13, 2025

In 'The Leaderboard Illusion,' researchers from Stanford, MIT and others say AI models from Google, OpenAI and Meta have an advantage.

FDA Taps Generative AI to Speed Up Drug Review Process

May 8, 2025

In a historic move, the FDA is deploying generative AI across all its centers to speed up the drug review process

Why Benchmarks and Tests Aren’t Good Ways to Evaluate LLMs

April 18, 2025

An opinion piece from the research scientist of the ETS Research Institute

Key takeaways:

MIT-dropout founded Scale AI has raised $1 billion in a Series F round, with Amazon and Meta among the new investors.
The data labeling startup serves nearly all the leading AI models and counts as clients OpenAI, Meta, Microsoft and others.
Scale also launched expert-rated LLM leaderboards using private data that it said “can’t be gamed.”

Scale AI, a data labeling startup that serves nearly all the leading AI models, has closed a $1 billion Series F financing round for a reported valuation of $13.8 billion with nearly all existing investors participating – plus new ones including Amazon and Meta.

Founded by MIT dropout Alexandr Wang, Scale AI’s latest funding was led by existing investor Accel with participation from Y Combinator, Nat Friedman, Index Ventures, Founders Fund, Coatue, Thrive Capital, Spark Capital, NVIDIA, Tiger Global Management, Greenoaks, and Wellington Management.Â

New investors are Amazon, Meta, AMD Ventures, Qualcomm Ventures, Cisco Investments, Intel Capital, ServiceNow Ventures, DFJ Growth, WCM, and Elad Gil.

“In 2016, I was studying AI at MIT. Even then, it was clear that AI is built from three fundamental pillars: data, compute, and algorithms. I founded Scale to supply the data pillar that advances AI by fueling its entire development lifecycle,” Wang wrote in a blog post.

Since then, Scale AI has grown in scope to supply data to the AI models of OpenAI, Meta, Microsoft and others. Last August, OpenAI named Scale AI as its preferred partner to help clients fine-tune OpenAI models for their own purposes.

Expert-rated LLM leaderboards

Scale AI also launched its own leaderboards of AI models that use private datasets, which it says “can’t be gamed.” Leaderboards are important because they provide a standardized framework by which to evaluate LLMs, so users can choose the best model for their purposes.

However, the reliability of existing leaderboards faces criticism for being biased (human voting) or the model is trained on the same dataset as the benchmarks, among other maladies. Popular leaderboards are found on Hugging Face, MMLU, Chatbot Arena, MT-Eval and others.

Scale AI hopes to offer more reliable metrics. Its leaderboards use Elo-scale rankings, a mathematical system that ranks participants based on skill levels in games like chess. These are calculated by experts using domain-specific methodologies. Human evaluators compare the responses of two models to the same prompt and rate which one is better along several domains and capabilities.

Author

Deborah Yao

View all posts