| 586 |
Uncensor any LLM with abliteration |
2024-06-13 |
| 415 |
Try Stable Diffusion's Img2Img Mode |
2022-08-29 |
| 323 |
MonadGPT – What would have happened if ChatGPT was invented in the 17th century? |
2023-11-24 |
| 252 |
LLM in a Flash: Efficient LLM Inference with Limited Memory |
2023-12-20 |
| 425 |
Llama-3.3-70B-Instruct |
2024-12-06 |
| 348 |
A Replacement for BERT |
2024-12-19 |
| 394 |
Open-R1: an open reproduction of DeepSeek-R1 |
2025-01-28 |
| 451 |
Deepseek R1-0528 |
2025-05-28 |
| 361 |
Nanonets-OCR-s – OCR model that transforms documents into structured markdown |
2025-06-16 |
| 388 |
Smollm3: Smol, multilingual, long-context reasoner LLM |
2025-07-08 |
| 319 |
Apertus 70B: Truly Open - Swiss LLM by ETH, EPFL and CSCS |
2025-09-02 |
| 262 |
The Smol Training Playbook: The Secrets to Building World-Class LLMs |
2025-10-30 |
| 978 |
DeepSeek-v3.2: Pushing the frontier of open large language models [pdf] |
2025-12-01 |
| 263 |
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning |
2025-12-01 |