The Scale Team

65 articles

February 11, 2025

Advancing Safe and Reliable AI: Scale's Research in Post-Training, Reasoning, and Evaluation

Scale AI leads groundbreaking research to build safer, more capable AI systems through innovative approaches in post-training optimization, agent development, and evaluation frameworks. Their comprehensive work spans from improving model performance and reliability to developing robust safety measures, all while maintaining a commitment to open collaboration and industry-wide advancement. Through the Safety, Evaluations, and Alignment Lab (SEAL) and various research initiatives, Scale AI is shaping the future of responsible AI development.

February 10, 2025

Company

Scale AI Partnering with the U.S. AI Safety Institute to Evaluate AI Models

Scale’s AISI-approved AI model evaluations are setting a new standard for pre-deployment testing. By offering voluntary, efficient, and third-party validated assessments, we are empowering AI developers to create more reliable models—without the complexities that typically slow down the process.

January 23, 2025

Research

Scale AI and CAIS Unveil Results of Humanity’s Last Exam, a Groundbreaking New Benchmark

Scale AI and the Center for AI Safety (CAIS) are proud to publish the results of Humanity’s Last Exam, a groundbreaking new AI benchmark that was designed to test the limits of AI knowledge at the frontiers of human expertise.

January 3, 2025

Government

Scale Public Sector: Building on Our Progress in 2025

As we return to work after the holiday break, the Scale AI Public Sector team wanted to reflect on our work heading into 2025. As strategic rivalries continue to intensify and adversaries form new alliances globally to challenge U.S. leadership in AI, the mission of Scale’s Public Sector team has never been more vital. We are dedicated to ensuring that the U.S. and its allies have the best technology to lead in this increasingly complex global landscape. The snapshot below captures a few key highlights from last year:

November 19, 2024

Product

Microsoft Azure and Scale AI Collaborate to help Enterprises Deliver Powerful GenAI Solutions

Microsoft Azure and Scale AI Collaborate to help Enterprises Deliver Powerful Agentic GenAI Solutions with Customized and Fine-Tuned Azure AI Models.

November 5, 2024

Product

Defense Llama: The LLM Purpose-Built for American National Security

Scale AI is proud to announce Defense Llama, the Large Language Model (LLM) built on Meta’s Llama 3 that is specifically customized and fine-tuned to support American national security missions.

September 16, 2024

General

Submit Your Toughest Questions for Humanity's Last Exam

Scale AI and CAIS are excited to announce the launch of Humanity's Last Exam, a project aimed at measuring how close we are to achieving expert-level AI systems

July 23, 2024

Product

Meta and Scale Partner to Drive Enterprise Adoption of Llama 3.1 405B Using Scale GenAI Platform

Scale is proud to be a Llama 3.1 Launch Partner! Llama 3.1 405B is the largest openly available foundation model rivaling the best closed-source models. Meta and Scale partnered to help businesses customize, evaluate, and deploy Llama 3.1 405B for enterprise use cases using Scale GenAI Platform.

July 10, 2024

General

AWS + Scale Partner to Bring Generative AI to Enterprises and Public Sector Customers

Amazon Web Services (AWS) names Scale AI as the first model customization and evaluation partner on Amazon Bedrock.

June 6, 2024

Government

Responsible AI with Scale Evaluation for the Public Sector

With the rapid advancement of AI model capabilities, it is necessary, now more than ever, to test and evaluate AI systems to ensure that it is safe to deploy for its intended use case. Scale AI is committed to promoting AI safety through our T&E offering, Scale Evaluation.