| 586 |
Uncensor any LLM with abliteration |
2024-06-13 |
| 240 |
Microsoft Phi-2 model changes licence to MIT |
2024-01-06 |
| 197 |
Space secrets leak disclosure |
2024-06-01 |
| 181 |
Best 7B LLM on leaderboards made by an amateur following a medium tutorial |
2024-01-05 |
| 168 |
Llama 3 8B is almost as good as Wizard 2 8x22B |
2024-04-19 |
| 167 |
Nvidia releases NVLM 1.0 72B open weight model |
2024-10-02 |
| 163 |
Explaining the SDXL Latent Space |
2024-02-05 |
| 152 |
Hugging Face and Google partner for AI collaboration |
2024-01-25 |
| 131 |
A CC-By Open-Source TTS Model with Voice Cloning |
2024-11-04 |
| 127 |
FineWeb: Decanting the web for the finest text data at scale |
2024-06-02 |
| 103 |
HuggingChat: Chat with Open Source Models |
2024-02-21 |
| 95 |
More than 80 AI models from Qualcomm |
2024-02-28 |
| 94 |
LLaMA-Pro-8B |
2024-01-06 |
| 82 |
Apple/OpenELM: Efficient Open-Source Family Language Models |
2024-04-24 |
| 75 |
YouTube-Commons: Audio transcripts of 2,063,066 YouTube videos, CC-By license |
2024-04-18 |
| 66 |
Show HN: Simply Reading Analog Gauges – GPT4, CogVLM Can't |
2024-01-22 |
| 58 |
MSFT's WizardLM2 models have been taken down |
2024-04-16 |
| 54 |
LiteLlama-460M-1T has 460M parameters trained with 1T tokens |
2024-01-07 |
| 52 |
Fine-Tuning LLMs to 1.58bit |
2024-09-18 |
| 51 |
LLaMA 3 70B Llamafiles |
2024-04-19 |
| 425 |
Llama-3.3-70B-Instruct |
2024-12-06 |
| 348 |
A Replacement for BERT |
2024-12-19 |
| 52 |
Train faster static embedding models with sentence transformers |
2025-01-15 |
| 394 |
Open-R1: an open reproduction of DeepSeek-R1 |
2025-01-28 |
| 227 |
Kokoro WebGPU: Real-time text-to-speech 100% locally in the browser |
2025-02-07 |
| 63 |
Open-sourcing 5,000hrs of self-driving dataset |
2025-03-11 |
| 451 |
Deepseek R1-0528 |
2025-05-28 |
| 149 |
Show HN: Penny-1.7B Irish Penny Journal style transfer |
2025-06-02 |
| 52 |
Show HN: ChatToSTL – AI text-to-CAD for 3D printing |
2025-06-12 |
| 361 |
Nanonets-OCR-s – OCR model that transforms documents into structured markdown |
2025-06-16 |
| 388 |
Smollm3: Smol, multilingual, long-context reasoner LLM |
2025-07-08 |
| 64 |
Voxtral-Mini-3B-2507 – Open source speech understanding model |
2025-07-15 |
| 152 |
Qwen3-235B-A22B-Thinking-2507 |
2025-07-25 |
| 166 |
Qwen3-4B-Thinking-2507 |
2025-08-06 |
| 319 |
Apertus 70B: Truly Open - Swiss LLM by ETH, EPFL and CSCS |
2025-09-02 |
| 87 |
Qwen3 30B-A3B |
2025-07-30 |
| 54 |
Qwen Image |
2025-08-04 |