Light
Home
/
Companies
/
HuggingFace
/
Hacker News
HuggingFace on HN
53 posts with 10+ points in 2025
Filters
Min points:
1
10
25
50
100
250
500
Year:
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
Posts by Month (53 total)
Hacker News Posts
Search:
Title
Points
Comments
Date
DeepSeek-v3.2: Pushing the frontier of open large language models [pdf]
978
--
2025-12-01
Deepseek R1-0528
451
--
2025-05-28
Open-R1: an open reproduction of DeepSeek-R1
394
--
2025-01-28
Smollm3: Smol, multilingual, long-context reasoner LLM
388
--
2025-07-08
Nanonets-OCR-s – OCR model that transforms documents into structured markdown
361
--
2025-06-16
Apertus 70B: Truly Open - Swiss LLM by ETH, EPFL and CSCS
319
--
2025-09-02
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning
263
--
2025-12-01
The Smol Training Playbook: The Secrets to Building World-Class LLMs
262
--
2025-10-30
Kokoro WebGPU: Real-time text-to-speech 100% locally in the browser
227
--
2025-02-07
Qwen3-4B-Thinking-2507
166
--
2025-08-06
Qwen3-235B-A22B-Thinking-2507
152
--
2025-07-25
Show HN: Penny-1.7B Irish Penny Journal style transfer
149
--
2025-06-02
Qwen-Image-Layered: transparency and layer aware open diffusion model
130
--
2025-12-19
Qwen3 30B-A3B
87
--
2025-07-30
Voxtral-Mini-3B-2507 – Open source speech understanding model
64
--
2025-07-15
Open-sourcing 5,000hrs of self-driving dataset
63
--
2025-03-11
Qwen Image
54
--
2025-08-04
Train faster static embedding models with sentence transformers
52
--
2025-01-15
Show HN: ChatToSTL – AI text-to-CAD for 3D printing
52
--
2025-06-12
Janus-Pro: Autoregressive framework unifying multimodal understanding&generation
49
--
2025-01-27
DeepSeek-R1-Distill-Qwen-1.5B Surpasses GPT-4o in certain benchmarks
39
--
2025-01-20
Fully autonomous AI agents should not be developed
38
--
2025-02-07
Qwen3-235B-A22B-Instruct-2507
36
--
2025-07-21
The Ultra-Scale Playbook: Training LLMs on GPU Clusters
33
--
2025-02-19
Qwen3-Coder-30B-A3B-Instruct
32
--
2025-07-31
Reachy Mini – The Open-Source Robot for Today's and Tomorrow's AI Builders
30
--
2025-07-09
grok-2 on Hugging Face
27
--
2025-08-23
DeepSeek-v3.1
26
--
2025-08-21
DeepSeek-v3.1-Base
25
--
2025-08-19
Mistral Small 3.2 (24B-Instruct-2506)
23
--
2025-06-20
DeepSeek-v3.1
23
--
2025-08-19
Kyutai 1.6B Streaming TTS
21
--
2025-07-03
Qwen3 235B beats Claude on some code benchmarks
21
--
2025-07-21
Selene Mini: Open-sourced SOTA small language-model-as-a-judge
20
--
2025-01-29
The smallest VLM ever: 250M parameters
19
--
2025-01-23
Deepseek V3-0324
18
--
2025-03-24
DeepSeek R1
17
--
2025-01-20
Vector Search with DuckDB
17
--
2025-02-26
DiffuCoder-7B-CpGRPO: A code generation LLM developed by Apple
17
--
2025-07-04
Qwen3 0.6B now on HuggingFace (quantized)
16
--
2025-04-28
TeapotLLM- an open-source <1B model for hallucination-resistant Q&A on a CPU
14
--
2025-04-16
DeepSeek-Prover-V2-671B
14
--
2025-04-30
DeepSeek-R1-0528 performance improvements
14
--
2025-05-29
Co-Doodle with Gemini
13
--
2025-03-19
Open-source DeepResearch – Freeing our search agents
12
--
2025-02-04
FUTO open-sources 1M row keyboard swipe dataset
12
--
2025-04-04
smolagents: A simple library to build AI agents
11
--
2025-01-02
DeepSeek-TNG-R1T2-Chimera
11
--
2025-07-02
Phi-4 weights have been released under MIT license
10
--
2025-01-08
Hugging Face to sell open-source robots thanks to Pollen Robotics acquisition
10
--
2025-04-23
Open Source 1.7tb Dataset of What AI Crawlers Are Doing
10
--
2025-07-03
Parquet Content-Defined Chunking
10
--
2025-09-09
Wan2.2-S2V-14B – audio-driven cinematic video generation model
10
--
2025-08-26