NVDA 208.65 ▼0.97%GOOGL 349.68 ▼4.99%MSFT 367.34 ▼3.18%AMD 551.63 ▲2.65%INTC 140.94 ▲5.19%TSMC 467.67 ▲1.20%AMZN 232.79 ▼4.75%META 563.85 ▼2.32%AAPL 297.01 ▼0.34%PLTR 119.50 ▼6.98%
Markets at last close

AMD · Chips

AMD Instinct MI355X passes 1M tokens/sec in MLPerf 6.0

·1 min read

AMD positioned its MLPerf Inference 6.0 submission as more than a routine performance update. The company said it expanded into first-time workloads, crossed the 1-million-tokens-per-second threshold at multinode scale, and showed that partners can reproduce the results across a broader ecosystem. The submission was presented as a response to customer demand for inference platforms that balance single-node speed, scale-out efficiency, faster model bring-up, reproducibility across partner systems, and confidence that the software stack can keep pace.

AMD said MLPerf Inference 6.0 gave it a way to show those capabilities in one submission. The company framed the results around a broader evaluation standard for inference infrastructure, where customers are no longer focused on just one metric. Competitive single-node performance and efficient scaling were highlighted alongside software readiness and the ability to support new models more quickly.

AMD also emphasized that the figures were not isolated to its own submission. A broad partner ecosystem submitted across four AMD Instinct GPU types that closely reproduced numbers submitted by AMD. The company said this wider participation strengthens its case that the platform can deliver consistent inference results beyond a single in-house configuration.

AMD further pointed to the first three-GPU heterogeneous MLPerf submission as evidence that AMD hardware and AMD ROCm software can orchestrate meaningful inference throughput even across systems in different geographies. That result was used to underscore the role of both hardware and software in scaling deployments and in supporting more varied system configurations.

Originally reported by techpowerup.comRead the source →
Related coverage
All AMD news →