Home / Companies / HuggingFace / Hacker News

HuggingFace on HN

48 posts with 100+ points since 2022

Filters
Since:
Posts by Month (48 total)
Hacker News Posts
Title Points Comments Date
DeepSeek-v3.2: Pushing the frontier of open large language models [pdf] 978 -- 2025-12-01
Uncensor any LLM with abliteration 586 -- 2024-06-13
Show HN: Sweep, Open-weights 1.5B model for next-edit autocomplete 530 -- 2026-01-21
Deepseek R1-0528 451 -- 2025-05-28
Llama-3.3-70B-Instruct 425 -- 2024-12-06
Try Stable Diffusion's Img2Img Mode 415 -- 2022-08-29
Open-R1: an open reproduction of DeepSeek-R1 394 -- 2025-01-28
Smollm3: Smol, multilingual, long-context reasoner LLM 388 -- 2025-07-08
GLM-4.7-Flash 371 -- 2026-01-19
Nanonets-OCR-s – OCR model that transforms documents into structured markdown 361 -- 2025-06-16
A Replacement for BERT 348 -- 2024-12-19
MonadGPT – What would have happened if ChatGPT was invented in the … 323 -- 2023-11-24
Apertus 70B: Truly Open - Swiss LLM by ETH, EPFL and CSCS 319 -- 2025-09-02
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning 263 -- 2025-12-01
The Smol Training Playbook: The Secrets to Building World-Class LLMs 262 -- 2025-10-30
LLM in a Flash: Efficient LLM Inference with Limited Memory 252 -- 2023-12-20
Microsoft Phi-2 model changes licence to MIT 240 -- 2024-01-06
Falcon 180B 238 -- 2023-09-06
OpenLLaMA 13B Released 229 -- 2023-06-18
Kokoro WebGPU: Real-time text-to-speech 100% locally in the browser 227 -- 2025-02-07
Hugging Face Releases Agents 214 -- 2023-05-10
Space secrets leak disclosure 197 -- 2024-06-01
BigCode Project Releases StarCoder: A 15B Code LLM 185 -- 2023-05-04
Best 7B LLM on leaderboards made by an amateur following a medium … 181 -- 2024-01-05
Stability.ai sent a take down request to Runway ML's SD v1.5 citing … 179 -- 2022-10-20
We raised $100M for open and collaborative machine learning 175 -- 2022-05-09
Llama 3 8B is almost as good as Wizard 2 8x22B 168 -- 2024-04-19
SantaCoder: A new 1.1B code model for generation and infilling 168 -- 2022-12-22
Nvidia releases NVLM 1.0 72B open weight model 167 -- 2024-10-02
Qwen3-4B-Thinking-2507 166 -- 2025-08-06
StackLlama: A hands-on guide to train LlaMa with RLHF 165 -- 2023-04-06
Explaining the SDXL Latent Space 163 -- 2024-02-05
BLOOM: The largest open multilingual language model 160 -- 2022-07-12
Show HN: Text-to-video model from scratch (2 brothers, 2 years, 2B params) 156 -- 2026-01-22
Hugging Face and Google partner for AI collaboration 152 -- 2024-01-25
Qwen3-235B-A22B-Thinking-2507 152 -- 2025-07-25
Show HN: Penny-1.7B Irish Penny Journal style transfer 149 -- 2025-06-02
Wordalle – Guess the prompt used to generate a set of images … 137 -- 2022-07-01
Mistral-8x7B-Chat 131 -- 2023-12-10
A CC-By Open-Source TTS Model with Voice Cloning 131 -- 2024-11-04
Qwen-Image-Layered: transparency and layer aware open diffusion model 130 -- 2025-12-19
FineWeb: Decanting the web for the finest text data at scale 127 -- 2024-06-02
Yi-34B-Chat 115 -- 2023-11-24
GPT-3.5 and Wolfram Alpha via LangChain 107 -- 2023-01-18
The Falcon has landed in the Hugging Face ecosystem 105 -- 2023-06-05
HuggingChat: Chat with Open Source Models 103 -- 2024-02-21
Hugging Face and AWS partner to make AI more accessible 102 -- 2023-02-21
HuggingFace Training Cluster as a Service 101 -- 2023-09-05