NVDA 208.65 ▼0.97%GOOGL 349.68 ▼4.99%MSFT 367.34 ▼3.18%AMD 551.63 ▲2.65%INTC 140.94 ▲5.19%TSMC 467.67 ▲1.20%AMZN 232.79 ▼4.75%META 563.85 ▼2.32%AAPL 297.01 ▼0.34%PLTR 119.50 ▼6.98%
Markets at last close

Nvidia · Robotics

Robot world models draw $6 billion but lack a shared foundation

·1 min read

World model companies drew approximately $6 billion in Q1 2026 alone, fueled by the idea that physical AI can follow the path transformers created for language models. Fusion Fund investors Charlotte Xia and Matt Wong argue the analogy is incomplete because robots lack the equivalent of a universal text token: joint angles, point clouds, force readings, video streams and action spaces vary across bodies and sensors.

NVIDIA’s Cosmos 3 launch and DreamZero results gave the thesis a concrete boost. Cosmos 3 is described as an open omnimodel trained on 20 trillion tokens of multimodal data, while a 14-billion-parameter DreamZero variant achieved a 2x improvement over VLA baselines on zero-shot generalization, reaching 62% average task progress versus 27% for pretrained VLA baselines.

The field remains split among pixel-based, explicit 3D geometric and latent representations, with no clear winner. World Labs has raised $1 billion around explicit 3D work, AMI Labs raised $1.03 billion around latent models and Physical Intelligence raised $600 million around VLA approaches, reinforcing architectural divergence.

The analysis points to shared evaluation, data curation and full-stack vertical deployment as durable opportunities. Until robotics finds a common measurement layer or a physical equivalent of tokens, world models may advance quickly without converging around a single architecture.

Originally reported by techtimes.comRead the source →
Related coverage
All Nvidia news →