Company
Date Published
Author
-
Word count
466
Language
English
Hacker News points
None

Summary

Modal has introduced a new inference-focused accelerator, the NVIDIA L40S GPU, priced at $1.95/hr, which offers substantial performance benefits over their current most popular accelerator, the NVIDIA A10 GPU. The L40S provides twice the on-device DDR6 random access memory of the A10, allowing users to run larger models on large inputs without a throughput-killing offload to CPU RAM. This results in a 40% speedup for memory-bound jobs and over a 100% speedup for compute-bound jobs using 16bit Tensor Cores. The L40S also outperforms the A10 in terms of streaming multiprocessor architecture, compute capability, and GPU RAM, with improved bandwidth and arithmetic capabilities. Modal users can now access this new accelerator through their platform, with $30/month in free compute available for sign-up.