The industry’s first language model performance report ensures reliability, accuracy, and effectiveness in developing trustworthy industrial AI solutions.
AUSTIN, Texas & OSLO, Norway–BUSINESS WIRE–
Cognite, the global leader in AI for industry, today announced the launch of the Cognite Atlas AI™ LLM & SLM Benchmark Report for Industrial Agents. The first-of-its-kind report addresses the shortcomings of general benchmark datasets by tailoring large language model (LLM) and small language model (SLM) evaluations to focus on specialized industrial tasks, ensuring the reliability, accuracy, and effectiveness of industrial AI solutions.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20241013808704/en/
Cognite Atlas AI™ is an industrial agent workbench that extends Cognite Data Fusion®, the leading industrial Data and AI platform. With unmatched data management and comprehensive AI capabilities, Cognite earned Frost & Sullivan’s Global Company of the Year Award in the digital industrial platforms market and Frost Radar™: Digital Industrial Platforms market powerhouse status, solidifying the company as an authority on Data and AI for industry.
The Cognite Atlas AI™ Benchmark Report for Industrial Agents will initially focus on natural language search as a key data retrieval tool for industrial AI agents. The test set includes a wide range of data models designed for sectors like Oil & Gas and Manufacturing, with real-life question-answer pairs to evaluate performance across different scenarios. Answers are assessed using a wide range of evaluation metrics. These benchmark datasets enable systematic evaluation of the system’s performance in answering complex questions, like tracking open safety-critical work orders in a facility.
Future versions of the report will evaluate additional AI tools, such as those for summarizing, analyzing, and reasoning with industrial data, to assess the full performance of industrial AI agents.
The first Cognite Atlas AI™ LLM & SLM Benchmark Report for Industrial Agents will be available to