Content Deep Dive
How we run GPT OSS 120B at 500+ tokens per second on NVIDIA GPUs
Company
Baseten
Date Published
Aug. 7, 2025
Author
Amir Haghighat 4 others
Word count
938
Language
English
Hacker News points
None
URL
www.baseten.co/blog/sota-performance-for-gpt-oss-120b-on-nvidia-gpus
Summary
No summary generated yet.