NVDA 192.53 ▼1.64%GOOGL 337.39 ▼1.84%MSFT 372.97 ▲5.71%AMD 521.58 ▼2.06%INTC 128.32 ▼3.42%TSMC 432.35 ▼0.61%AMZN 232.69 ▲2.50%META 550.25 ▲1.36%AAPL 283.78 ▲3.14%PLTR 112.93 ▲5.28%
Markets at last close

DeepSeek · Models

Peking University and DeepSeek release DSpark for faster LLM inference

·1 min read

Peking University and DeepSeek have jointly open-sourced DSpark, a speculative decoding framework designed to improve the efficiency of large language model inference. The release focuses on accelerating model responses while maintaining performance under strict latency requirements.

DSpark boosts LLM inference speed by 60-85% and can deliver up to 661% throughput gain under strict latency constraints. The framework positions speculative decoding as a practical route to faster deployment of language models where response time and serving capacity are critical.

Originally reported by pandaily.comRead the source →
Related coverage
All DeepSeek news →