Home / Companies / Northflank / Blog / Post Details
Content Deep Dive

How to deploy and self-host DeepSeek-V3.1 on Northflank

Blog post from Northflank

Post Details
Company
Date Published
Author
Will Stewart
Word Count
996
Company Posts That Month
35
Language
English
Hacker News Points
-
Summary

DeepSeek-V3.1 is a significant advancement in the DeepSeek family of large language models, offering a 671B parameter Mixture-of-Experts architecture with a 128K context window for enhanced reasoning capabilities. This model supports both chat and think inference modes, allowing users to toggle between standard interactions and more reasoning-intensive tasks via an Open WebUI interface. It operates efficiently on 8× NVIDIA H200 GPUs using vLLM, providing high-throughput inference and improved reasoning speed compared to its predecessors. Users can deploy or self-host DeepSeek-V3.1 on the Northflank platform using either a one-click template or a manual setup, ensuring flexibility in deployment while avoiding rate limits with an OpenAI-compatible API. The model's open-weight design allows for secure, scalable deployment with cost-efficient pricing, making it one of the most capable open-weight language models available.

Trends Found in this Post
Trend Post Mentions Total Month Mentions Posts Companies MoM
LLM 1 3,922 600 189 -6%
Reinforcement learning 1 98 39 26 -36%