Company
Date Published
Author
Anket Sah
Word count
411
Language
English
Hacker News points
None

Summary

The DeepSeek V3-0324 endpoint is a new AI development platform that offers lightning-fast responses, up to 128K massive context window, and no rate limiting, all for the low price of $0.88 per 164K output. It features a 685B parameter model with a Mixture-of-Experts (MoE) design and has been trained on 14.8 trillion tokens using an auxiliary-loss-free load balancing strategy and multi-token prediction (MTP). The endpoint outperforms other models in structured reasoning and creative tasks, achieving high scores in benchmarks such as MATH-500, Massive Multitask Language Understanding(MMLU Pro), GPQA Diamond, AIME, and LiveCodeBench. With its ease of integration on the Lambda Inference API, developers can quickly get started with DeepSeek v3-0324 and unlock open-source inference without artificial limitations.