| 586 |
Uncensor any LLM with abliteration |
2024-06-13 |
| 323 |
MonadGPT – What would have happened if ChatGPT was invented in the 17th century? |
2023-11-24 |
| 252 |
LLM in a Flash: Efficient LLM Inference with Limited Memory |
2023-12-20 |
| 425 |
Llama-3.3-70B-Instruct |
2024-12-06 |
| 348 |
A Replacement for BERT |
2024-12-19 |
| 394 |
Open-R1: an open reproduction of DeepSeek-R1 |
2025-01-28 |
| 451 |
Deepseek R1-0528 |
2025-05-28 |
| 361 |
Nanonets-OCR-s – OCR model that transforms documents into structured markdown |
2025-06-16 |
| 388 |
Smollm3: Smol, multilingual, long-context reasoner LLM |
2025-07-08 |
| 319 |
Apertus 70B: Truly Open - Swiss LLM by ETH, EPFL and CSCS |
2025-09-02 |