Inference Test - Search News

14d

Train-to-Test scaling explained: How to optimize your end-to-end AI compute budget for inference

AI reasoning does not necessarily require spending huge amounts on frontier models. Instead, smaller models can yield ...

1mon

MLCommons Releases New MLPerf Inference v6.0 Benchmark Results

Today, MLCommons ® announced new results for its industry-standard MLPerf ® Inference v6.0 benchmark suite. This release includes several important advances that ensure the benchmark suite tests ...

SiliconANGLE

MLCommons releases results of its latest MLPerf AI inference benchmark test

MLCommons today released the latest results of its MLPerf Inference benchmark test, which compares the speed of artificial intelligence systems from different hardware makers. MLCommons is an industry ...

SDxCentral

NVIDIA Grace Hopper superchip sweeps MLPerf inference benchmarks

In its debut on the MLPerf industry benchmarks, the NVIDIA GH200 Grace Hopper Superchip ran all data center inference tests. The GH200 links a Hopper GPU with a Grace CPU in one superchip. The ...

Hosted on MSN

NVIDIA GB300 GPUs deliver huge AI efficiency gains in Deepseek R1 inference test

NVIDIA’s latest Blackwell-based GB300 GPUs are starting to show what they can do, and early results point to a massive jump in efficiency compared to the company’s previous generation. A recent ...

VentureBeat

Hugging Face shows how test-time scaling helps small language models punch above their weight

In a new case study, Hugging Face researchers have demonstrated how small language models (SLMs) can be configured to outperform much larger models. Their findings show that a Llama 3 model with 3B ...

EurekAlert!

Common way to test for leaks in large language models may be flawed

This slide shows how a membership inference attack might start. Assessing the product of an app asked to generate an image of a professor teaching students in “the style of” artist Monet could lead to ...

The Next Platform

Google Shows Off Its Inference Scale And Prowess

If the hyperscalers are masters of anything, it is driving scale up and driving costs down so that a new type of information technology can be cheap enough so it can be widely deployed. The ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results