Home / Companies / HuggingFace / Hacker News

HuggingFace on HN

29 posts with 25+ points in 2025

Filters
Year:
Posts by Month (29 total)
Hacker News Posts
Title Points Comments Date
DeepSeek-v3.2: Pushing the frontier of open large language models [pdf] 978 -- 2025-12-01
Deepseek R1-0528 451 -- 2025-05-28
Open-R1: an open reproduction of DeepSeek-R1 394 -- 2025-01-28
Smollm3: Smol, multilingual, long-context reasoner LLM 388 -- 2025-07-08
Nanonets-OCR-s – OCR model that transforms documents into structured markdown 361 -- 2025-06-16
Apertus 70B: Truly Open - Swiss LLM by ETH, EPFL and CSCS 319 -- 2025-09-02
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning 263 -- 2025-12-01
The Smol Training Playbook: The Secrets to Building World-Class LLMs 262 -- 2025-10-30
Kokoro WebGPU: Real-time text-to-speech 100% locally in the browser 227 -- 2025-02-07
Qwen3-4B-Thinking-2507 166 -- 2025-08-06
Qwen3-235B-A22B-Thinking-2507 152 -- 2025-07-25
Show HN: Penny-1.7B Irish Penny Journal style transfer 149 -- 2025-06-02
Qwen-Image-Layered: transparency and layer aware open diffusion model 130 -- 2025-12-19
Qwen3 30B-A3B 87 -- 2025-07-30
Voxtral-Mini-3B-2507 – Open source speech understanding model 64 -- 2025-07-15
Open-sourcing 5,000hrs of self-driving dataset 63 -- 2025-03-11
Qwen Image 54 -- 2025-08-04
Train faster static embedding models with sentence transformers 52 -- 2025-01-15
Show HN: ChatToSTL – AI text-to-CAD for 3D printing 52 -- 2025-06-12
Janus-Pro: Autoregressive framework unifying multimodal understanding&generation 49 -- 2025-01-27
DeepSeek-R1-Distill-Qwen-1.5B Surpasses GPT-4o in certain benchmarks 39 -- 2025-01-20
Fully autonomous AI agents should not be developed 38 -- 2025-02-07
Qwen3-235B-A22B-Instruct-2507 36 -- 2025-07-21
The Ultra-Scale Playbook: Training LLMs on GPU Clusters 33 -- 2025-02-19
Qwen3-Coder-30B-A3B-Instruct 32 -- 2025-07-31
Reachy Mini – The Open-Source Robot for Today's and Tomorrow's AI Builders 30 -- 2025-07-09
grok-2 on Hugging Face 27 -- 2025-08-23
DeepSeek-v3.1 26 -- 2025-08-21
DeepSeek-v3.1-Base 25 -- 2025-08-19