HuggingFace Blog
186 posts indexed since 2022
Post Details
| Title | Author | Published | Words | HN Pts |
|---|---|---|---|---|
| Building the Open Agent Ecosystem Together: Introducing OpenEnv | Joseph Spisak, Davide Testuggine, Zach Wentz, Pierre Andrews, Sanyam Bhutani, Hamid Shojanazeri, Pankit Thapar, Emre Guven, Lewis Tunstall, and Vaibhav Srivastav | 2025-10-23 | 1,117 | -- |
| VibeGame: Exploring Vibe Coding Games | Dylan Ebert | 2025-09-29 | 1,777 | -- |
| Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard | Yauhen Babakhin, Radek Osmulski, Ronay Ak, Gabriel de Souza Pereira Moreira, and Mengyao Xu | 2025-10-21 | 706 | -- |
| Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes | Bryan Catanzaro and Jonathan Cohen | 2025-10-22 | 1,684 | -- |
| Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm | You Liang Tan, Fengyuan Hu, Oyindamola Omotuyi, Oluwaseun Doherty, Chitoku Yato, and Shane Reetz | 2025-06-11 | 1,902 | -- |
| Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than … | Solomatin Roman, Kenneth C. Enevoldsen, and Isaac Chung | 2025-10-20 | 2,320 | -- |
| huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning | Lucain Pouget, Célina Hanouti, Lysandre, and Julien Chaumond | 2025-10-27 | 2,139 | -- |
| Supercharge your OCR Pipelines with Open Models | merve, Aritra Roy Gosthipaty, Daniel van Strien, Hynek Kydlicek, Andres Marafioti, Vaibhav Srivastav, and Pedro Cuenca | 2025-10-21 | 3,544 | -- |
| Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for … | Prachi Mishra | 2025-10-28 | 921 | -- |
| Hugging Face and VirusTotal collaborate to strengthen AI security | Adrien Carreira and Bernardo Quintero | 2025-10-22 | 507 | -- |
| Voice Cloning with Consent | Margaret Mitchell and Lucie-Aimée Kaffee | 2025-10-28 | 1,394 | -- |
| Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with … | Jiqing.Feng, Matrix Yao, Ke Ding, and Ilyas Moutawwakil | 2025-10-16 | 1,374 | -- |
| Advancing Predictive ADMET Modeling Through Community-Driven Science: The ExpansionRx-OpenADMET Blind Challenge | Georgia Channing and Hugo MacDermott | 2025-10-27 | 943 | -- |
| Vision Tokens vs Text Tokens: Understanding the 10× Compression | Yi Cui | 2025-10-22 | 535 | -- |
| Projected Abliteration | Jim Lai | 2025-10-25 | 2,218 | -- |
| Streaming datasets: 100x More Efficient | Andres Marafioti, Quentin Lhoest, ben burtenshaw, Pedro Cuenca, and merve | 2025-10-27 | 1,306 | -- |
| Sentence Transformers is joining Hugging Face! | Tom Aarsen | 2025-10-22 | 1,011 | -- |
| Unlock the power of images with AI Sheets | Ame Vi, Daniel Vila, Francisco Aranda, Damián Pumar, Leandro von Werra, and Thomas Wolf | 2025-10-21 | 1,495 | -- |
| Get your VLM running in 3 simple steps on Intel CPUs | Ezequiel Lanza, Helena, Nikita, Ella Charlaix, and Ilyas Moutawwakil | 2025-10-15 | 1,479 | -- |
| Introducing RTEB: A New Standard for Retrieval Evaluation | Frank Liu, Kenneth C. Enevoldsen, Solomatin Roman, Isaac Chung, Tom Aarsen, and Fődi, Zoltán | 2025-10-01 | 2,833 | -- |
| Building a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac | Steven Palma and Andres Diaz-Pinto | 2025-10-29 | 1,115 | -- |
| Uncensor any LLM with abliteration | Maxime Labonne | 2024-06-13 | 3,144 | -- |
| GSMA Open-Telco LLM Benchmarks 2.0: The first dedicated LLM Evaluation for Telecoms | Lina Bariah, Antonio De Domenico, Louis Powell, Mohamed Sana, Merouane Debbah, Mark Austin, Farbod Tavakkoli, George George, Nicola Piovesan, Simone Mangiante, cherrared, Sumeyye Bas, GHADA SOLIMAN, Dilara Zeynep Gurer, Laszlo Suto, and Pierre Wang | 2025-10-20 | 3,090 | -- |
| NVIDIA Isaac GR00T in LeRobot | lior ben horin, Kartik S, Aravindh Shan, Asawaree, and You Liang Tan | 2025-10-28 | 1,182 | -- |
| LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR | Said Taghadouini, Baptiste Aubertin, and Adrien Cavaillès | 2025-10-23 | 4,470 | -- |
| Granite 4.0 Nano: Just how small can you go? | Kate Soule and Rameswar Panda | 2025-10-28 | 544 | -- |
| Code a simple RAG from scratch | Xuan-Son Nguyen | 2024-10-29 | 2,933 | -- |
| How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA … | Asawaree | 2025-10-28 | 1,078 | -- |
| Can Your LLM Think Like a Professional? Introducing ProfBench | Zhilin Wang, Jaehun Jung, Ximing Lu, Shizhe Diao, jiaqiz, VivienneZhang, Nik Spirin, and Dong | 2025-10-28 | 1,337 | -- |
| 🛡️ Nemotron PII: Synthesized Data for Privacy-Preserving AI | Maarten Van Segbroeck | 2025-10-28 | 988 | -- |
| SOTA OCR on-device with Core ML and dots.ocr | Christopher Fleetwood and Pedro Cuenca | 2025-10-02 | 1,910 | -- |
| Australian-made LLM beats OpenAI and Google at legal retrieval | Umar Butler, Abdur-Rahman Butler, and Adrian Lucas Malec | 2025-10-23 | 930 | -- |
| NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image … | Yao Xu, Timo Roman, Lukas Voegtle, Philipp Fischer, Amala Sanjay Deshmukh, Kateryna Chumachenko, and Jarno Seppänen | 2025-10-28 | 1,014 | -- |
| Promoter-GPT: Writing DNA Instructions with Language Models | Adele de Hoffer | 2025-10-22 | 3,509 | -- |
| LeRobot v0.4.0: Super Charging OSS Robotics Learning | Steven Palma, Michel Aractingi, Pepijn Kooijmans, Caroline Pascal, Jade Choghari, Francesco Capuano, Adil Zouitine, Martino Russi, and Thomas Wolf | 2025-10-24 | 1,980 | -- |
| KV Caching Explained: Optimizing Transformer Inference Efficiency | Hafedh Hichri | 2025-01-30 | 1,230 | -- |
| Why Did MiniMax M2 End Up as a Full Attention Model? | MiniMax | 2025-10-30 | 1,640 | -- |
| The World’s First and Best Speed Painting Software | 2025-10-29 | 1,368 | -- | |
| 3+ Years of ML & Society at Hugging Face 🤗🤝🧑🤝🧑 | Yacine Jernite, Giada Pistilli, Lucie-Aimée Kaffee, and Sasha Luccioni | 2025-10-29 | 807 | -- |
| Nemotron-Personas-USA: Synthesized Data for Sovereign AI | Will Jennings, Dane Corneil, and Yev Meyer | 2025-10-28 | 630 | -- |
| svara-TTS — Open Multilingual TTS for India’s Voices | Aditya Chhabra | 2025-10-27 | 1,626 | -- |
| What makes good reasoning data | MiniMax | 2025-10-30 | 629 | -- |
| On the Shifting Global Compute Landscape | Tiezhen WANG and Irene Solaiman | 2025-10-29 | 3,172 | -- |
| Aligning to What? Rethinking Agent Generalization in MiniMax M2 | MiniMax | 2025-10-30 | 1,103 | -- |
| Evaluate Your Own RAG: Why Best Practices Failed Us | Charles AZAM, Antoine Hoorelbeke, Antoine Guyot, Maxence Leclercq, and Jérémy PICOSSON | 2025-11-05 | 3,569 | -- |
| Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation | Exploding Gradients | 2025-09-16 | 3,586 | -- |
| DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge | Yihua Zhang | 2025-02-07 | 2,499 | -- |
| ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases | Quentin Macé, Antonio Loison, Antoine EDY, Victor Xing, and Gautier Viaud | 2025-11-05 | 2,524 | -- |
| Classement compar:IA : des votes des utilisateurs au classement participatif des modèles | compar:IA | 2025-11-03 | 1,821 | -- |
| Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness | Steven Zheng | 2025-11-05 | 1,120 | -- |
| Running Large Transformer Models on Mobile and Edge Devices | MtugrulKaya | 2025-11-03 | 6,026 | -- |
| TorchSim: A new PyTorch-based molecular dynamics engine | Davide Sarpa | 2025-10-31 | 3,592 | -- |
| The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix | Asankhaya Sharma | 2025-11-03 | 1,833 | -- |
| ⚡ Power, Heat, and Intelligence ☁️ - AI Data Centers Explained 🏭 | Boris Gamazaychikov and Sasha Luccioni | 2025-11-05 | 2,952 | -- |
| Small Language Models (SLM): A Comprehensive Overview | John Johnson | 2025-02-22 | 1,456 | -- |
| Toward Community-Governed Safety | Giada Pistilli and Lucie-Aimée Kaffee | 2025-11-03 | 681 | -- |
| From GRPO to DAPO and GSPO: What, Why, and How | Yihua Zhang | 2025-08-09 | 5,841 | -- |
| Budget Alignment: Making Models Reason in the User’s Language | Shan Chen, Jirui Qi, and Zidi Xiong | 2025-11-04 | 3,207 | -- |
| Introduction to State Space Models (SSM) | Loïck BOURDOIS | 2024-07-19 | 6,663 | -- |
| Let's talk about LLM evaluation | Clémentine Fourrier | 2024-05-23 | 3,264 | -- |
| Who Routes LLM Routers? RouterArena: Building the Evaluation Foundation for LLM Routing | Yifan Lu, Riksin, Jiayi Yuan, Bruce Cui, SJ Chang, Hongyi Liu, and Jiarong Xing | 2025-11-11 | 1,552 | -- |
| SYNTH: the new data frontier | Pierre-Carl Langlais | 2025-11-10 | 1,995 | -- |
| Effective Prompting for Generative Vision Models | Sara Han Díaz and Bertrand Charpentier | 2025-11-10 | 1,013 | -- |
| 🌳 QAT: The Art of Growing a Bonsai Model | Yi Cui | 2025-11-09 | 1,267 | -- |
| Norm-Preserving Biprojected Abliteration | Jim Lai | 2025-11-06 | 2,135 | -- |
| Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face | Daniel Voigt Godoy | 2025-02-11 | 3,900 | -- |
| Mastering Tensor Dimensions in Transformers | Hafedh Hichri | 2025-01-12 | 2,555 | -- |
| Text-to-image Architectural Experiments | David Bertoin, Jon Almazán, and Roman | 2025-11-13 | 3,525 | -- |
| Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level … | Tensor-Slayer | 2025-11-07 | 1,843 | -- |
| We’re open-sourcing our text-to-image model and the process behind it | Jon Almazán, David Bertoin, and Roman | 2025-11-12 | 1,110 | -- |
| Building for an Open Future - our new partnership with Google Cloud | Jeff Boudier and Simon Pagezy | 2025-11-13 | 869 | -- |
| Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach | Pere Martra | 2024-11-24 | 3,670 | -- |
| ⛳ Optimizer: What Does It Do and Why We Need It | Yi Cui | 2025-11-12 | 1,313 | -- |
| To Think or Not to Think: A Router for Hybrid LLMs | Amir Mohseni | 2025-11-16 | 2,137 | -- |
| The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs | Xiaoran Liu (SII) | 2025-11-15 | 1,834 | -- |
| The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling | Elaine McVey Houskeeper and Georgia Channing | 2025-11-18 | 1,662 | -- |
| Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models | Torsten Scholak, Oleksiy Ostapenko, Raymond Li, Luke Kumar, and Joel Lamy-Poirier | 2025-11-19 | 1,709 | -- |
| Easily Build and Share ROCm Kernels with Hugging Face | Abdennacer Badaoui, Daniel Huang, colorswind, and Zesen Liu | 2025-11-17 | 3,120 | -- |
| Join the AMD Open Robotics Hackathon | Eric Ma and Guruprasad MP | 2025-11-13 | 506 | -- |
| PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs | Samuel Lima Braz | 2025-01-24 | 8,770 | -- |
| AI Model Optimization More Flexible Than Ever | Johanna Sommer, Sara Han Díaz, and Bertrand Charpentier | 2025-11-17 | 725 | -- |
| Visualizing How VLMs Work | Hafedh Hichri and Ed Daniels | 2025-10-07 | 1,851 | -- |
| 🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset | Cornelius Wolff, Daniel Gomm, and Madelon Hulsebos | 2025-11-19 | 944 | -- |
| Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms | Mattt | 2025-11-20 | 1,326 | -- |
| Introducing Cogito v2.1 | Deep Cogito Team | 2025-11-19 | 1,067 | -- |
| Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks | Eric Bezzam, Steven Zheng, Eustache Le Bihan, and Vaibhav Srivastav | 2025-11-21 | 936 | -- |
| 20x Faster TRL Fine-tuning with RapidFire AI | Kamran Bigdely, Arun Kumar, and Quentin Gallouédec | 2025-11-21 | 1,198 | -- |
| How to make NeuTTS-air generate over 200 seconds of audio in a … | Yatharth Sharma | 2025-11-21 | 792 | -- |
| Building Deep Research: How we Achieved State of the Art | Michael Griff, Dean Sacoransky, and Noah Nefsky | 2025-11-24 | 1,628 | -- |
| OVHcloud on Hugging Face Inference Providers 🔥 | Gilles Closset, Fabien Ric, and Elias Tourneux | 2025-11-24 | 788 | -- |
| Prefill and Decode for Concurrent Requests - Optimizing LLM Performance | Benjamin Merkel | 2025-04-16 | 2,165 | -- |
| Announcing the LLM Open Finance models | Raheel Qader, Gaëtan Caillaut, Jingshu, Mariam Nakhle, Arezki SADOUNE, MASSINISSA AHMIM, and Jean-Gabriel BARTHELEMY | 2025-11-24 | 601 | -- |
| DeLERP: Decomposed Linear Interpolation for Model Merging | Jim Lai | 2025-11-20 | 1,364 | -- |
| How MCP Blockly Makes MCP Server Creation Accessible for Everyone | Owen Kaplinsky | 2025-11-28 | 952 | -- |
| Curating datasets directly on the Hub | Daniel Vila | 2025-11-27 | 504 | -- |
| 10 Best Open-Source LLM Models (2025 Updated): Llama 4, Qwen 3 and … | Daya Shankar | 2025-11-13 | 2,419 | -- |
| Gemini-3 Benchmarkathon | Robert Scholz, Slimane Alaoui Soulimani Valenti, Ernest Beta, Odysseas S. Chlapanis, Adhithya kiran, Matteo Bürgler, Sophie Franco, Chu Fei Luo, Prof. Samuel Dahan, and Joel Niklaus | 2025-11-28 | 4,648 | -- |
| Building Jobly: Semantic Job Matching with RAG and Vector Embeddings | Valentina Nieddu and Giacomo Bandini | 2025-11-28 | 1,878 | -- |
| Continuous batching | Rémi Ouazan Reboul, Arthur Zucker, and Luc Georges | 2025-11-25 | 3,970 | -- |
| Welcome FLUX.2 - BFL’s new open image generation model 🤗 | YiYi Xu, Daniel Gu, Sayak Paul, Alvaro Somoza, Dhruv Nair, Aritra Roy Gosthipaty, Linoy Tsaban, and Apolinário from multimodal AI art | 2025-11-25 | 3,460 | -- |
| A Guide to Hugging Face’s Papers Page | Adina Yakefu | 2025-11-25 | 973 | -- |
| makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch | Avinash Sooriyarachchi | 2024-05-07 | 3,812 | -- |
| Custom Policy Enforcement with Reasoning: Faster, Safer AI Applications | Traian Rebedea, Shyamala Prayaga, Makesh Sreedhar, Chris Parisien, and Isabel Hulseman | 2025-12-02 | 1,648 | -- |
| Transformers v5: Simple model definitions powering the AI ecosystem | Lysandre, Arthur Zucker, Cyril Vallez, and Vaibhav Srivastav | 2025-12-01 | 2,250 | -- |
| Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO … | Yihua Zhang | 2025-02-11 | 18,441 | -- |
| Building and evaluating Multimodal Rerankers | Ulrick BLE | 2025-11-30 | 4,201 | -- |
| An Edge-First Generalized LLM LoRA Fine-Tuning Framework for Heterogeneous GPUs | Subash SN, Akshay Nambiar, Patrik Lambert, Milan Gritta, and Amril Nurman | 2025-12-01 | 4,604 | -- |
| 📌 Rethinking Multimodality from an Industry Perspective: Captioning Is Far More Important … | Bohan Zhai and Shijia Yang | 2025-11-29 | 3,816 | -- |
| SARLO-80: Worldwide Slant SAR Language Optic Dataset at 80 cm Resolution | Solène Debuysère, Nicolas Trouvé, and Georgia Channing | 2025-12-01 | 1,551 | -- |
| Bringing Math to Life: Building StepWise Math for the MCP Hackathon | Vikas Gupta | 2025-11-27 | 948 | -- |
| Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement | Asankhaya Sharma | 2025-12-03 | 2,075 | -- |
| We Got Claude to Fine-Tune an Open Source LLM | ben burtenshaw and shaun smith | 2025-12-04 | 2,016 | -- |
| BERTs that chat: turn any BERT into a chatbot with dLLM | Zhanhui Zhou and Lingjie Chen | 2025-11-28 | 943 | -- |
| Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand | Quentin Gallouédec | 2025-12-04 | 1,219 | -- |
| AI Energy Score v2: Refreshed Leaderboard, now with Reasoning 🧠 | Sasha Luccioni and Boris Gamazaychikov | 2025-12-04 | 1,496 | -- |
| Introducing swift-huggingface: The Complete Swift Client for Hugging Face | Mattt | 2025-12-05 | 1,524 | -- |
| DeepFabric: Generate, Train and Evaluate with Datasets curated for Model Behavior Training. | Luke Hinds | 2025-12-04 | 3,284 | -- |
| TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval | Özay Ezerceli, Mahmud ElHuseyni 🇵🇸, SELVA TAŞ, Reyhan Bayraktar, Betül Terzioğlu, Yusuf Çelebi, Yağız Asker, and nmmursit | 2025-12-04 | 3,173 | -- |
| Engineering Notes: Training a LoRA for Z-Image Turbo with the Ostris AI … | Shawn | 2025-12-02 | 1,280 | -- |
| DeepMath: A lightweight math reasoning Agent with SmolAgents | Daniel Fleischer, Moshe Berchansky, and Moshe Wasserblat | 2025-12-04 | 1,123 | -- |
| Making Model Tuning Accessible: This is what we built observing 100s of … | Mehant, Yashasvi Chaurasia, Ashok Pon Kumar, and Praveen Jayachandran | 2025-12-05 | 1,821 | -- |
| A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: … | Yihua Zhang | 2025-02-04 | 7,388 | -- |
| Muon vs MuonClip vs Muon+AdamW for Fine-Tuning | Nishith Jain | 2025-12-09 | 705 | -- |
| How We Use Claude Code Skills to Run 1,000+ ML Experiments a … | Sigrid Jin | 2025-12-08 | 4,707 | -- |
| New in llama.cpp: Model Management | Xuan-Son Nguyen and Victor Mustar | 2025-12-11 | 740 | -- |
| Build Hallucination-Free RAG with Verbatim | Adam Kovacs | 2025-11-18 | 2,281 | -- |
| I Built a RAG System That Listens to Live BBC News and … | Rakshit Aralimatti | 2025-12-09 | 907 | -- |
| Make and publish your Reachy Mini App | Antoine Pirrone and Rouanet | 2025-12-03 | 1,081 | -- |
| Why You Should Care About Partial Differential Equations (PDEs) | Aishwarya Balaji, BryanBradfo, Jose Manuel Nápoles, Prateik Sinha, and Roey Ben Chaim | 2025-12-12 | 1,761 | -- |
| MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier | Surya Kant Sahu and Jaipal Singh | 2025-12-12 | 2,144 | -- |
| Diffusion Language Models: The New Paradigm | Pro Creations | 2025-06-10 | 1,644 | -- |
| Codex is Open Sourcing AI models | ben burtenshaw and shaun smith | 2025-12-11 | 2,426 | -- |
| Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance | Sathwik Tejaswi Madhusudhan, Sagar Davasam, and Torsten Scholak | 2025-12-09 | 1,908 | -- |
| CUGA on Hugging Face: Democratizing Configurable AI Agents | Jim Laredo, Avi Yaeli, Sami Marreed, AYHAN SEBIN, and Merve Unuvar | 2025-12-15 | 1,058 | -- |
| Topic 23: What is LLM Inference, it's challenges and solutions for it | Ksenia Se | 2025-01-17 | 1,511 | -- |
| Phare LLM benchmark V2: Reasoning models don't guarantee better security | Pierre Le Jeune, David Berenstein, Matteo, and Weixuan Xiao | 2025-12-16 | 2,631 | -- |
| Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation | kelseye.xh and Zhongjie Duan | 2025-12-16 | 1,416 | -- |
| The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator | Seph Mard, Isabel Hulseman, Besmira Nushi, Piotr Januszewski, Grzegorz Chlebus, VivienneZhang, Wojciech Prazuch, Pablo Ribalta, Nik Spirin, and Ferenc Galko | 2025-12-17 | 2,102 | -- |
| Nemotron 3 Nano - A new Standard for Efficient, Open, and Intelligent … | Chris Alexiuk, Shashank Verma, Chintan, Chris Wing, and Gordana Neskovic | 2025-12-15 | 2,382 | -- |
| Everything You Need to Know about Knowledge Distillation | Ksenia Se and Alyona Vert | 2025-03-06 | 3,517 | -- |
| EuroLLM-22B | EuroLLM Team, Miguel Moura Ramos, Duarte Alves, and Hippolyte Gisserot-Boukhlef | 2025-12-14 | 1,162 | -- |
| Gotchas in Tokenizer Behavior Every Developer Should Know | Quentin Gallouédec | 2025-04-18 | 2,659 | -- |
| What is the Hugging Face Community Building? | Avijit Ghosh, Yacine Jernite, and Irene Solaiman | 2025-07-15 | 1,377 | -- |
| Open Collaboration in Action: Inside the Open Safeguard Hackathon | Andrew Chang, juliet shen, and Yacine Jernite | 2025-12-18 | 1,248 | -- |
| cua-bench: A Framework for Benchmarking, Training Data, and RL Environments for Computer-Use … | Francesco Bonacci and Dillon DuPont | 2025-12-16 | 1,086 | -- |
| Spinning Up a CPU-Only Micro-LLM with LoRA for Literary Style | Kashif Salahuddin | 2025-12-16 | 1,000 | -- |
| Announcing LiteCoder-Terminal: Lightweight Terminal Agents with <1k Synthesized Trajectories | LiteCoder | 2025-12-18 | 677 | -- |
| Tokenization in Transformers v5: Simpler, Clearer, and More Modular | Ita Zaporozhets, Aritra Roy Gosthipaty, Arthur Zucker, Sergio Paniego, merve, and Pedro Cuenca | 2025-12-18 | 3,024 | -- |
| Shadow AI - Where are the CIOs? | Jeff Boudier | 2025-12-19 | 616 | -- |
| LLM based TTS models | Yatharth Sharma | 2025-12-18 | 871 | -- |
| AI Labs Must Resist Age Verification | Adam Molnar and Noah Weinberger | 2025-12-17 | 2,593 | -- |
| 🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About … | Ksenia Se | 2025-03-17 | 4,266 | -- |
| Backbone-Optimizer Coupling Bias: The Hidden Co-Design Principle | Juanxi Tian | 2025-12-20 | 5,279 | -- |
| Encoding the World's Medical Knowledge into 970K | David Mezzetti | 2025-12-22 | 934 | -- |
| Skill is All You Need: Lessons from Building Marketing Agents at Noumena | liuzeming, Arcobalneo, HUANLIN LUO, wubin, Huan Zhao, Lee, and Noumena-AI | 2025-12-25 | 2,334 | -- |
| AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems | Jaykumar Kasundra | 2025-12-23 | 2,080 | -- |
| Understanding InstaFlow/Rectified Flow | Isamu Isozaki | 2023-10-06 | 1,802 | -- |
| Nano-BEIR: A Multilingual Information Retrieval Benchmark with Quality-Enhanced Queries | KuKu | 2025-12-22 | 1,274 | -- |
| Decoding Strategies in Large Language Models | Maxime Labonne | 2024-10-29 | 4,166 | -- |
| The Optimal Architecture for Small Language Models | Asankhaya Sharma | 2025-12-26 | 2,348 | -- |
| Deriving the PPO Loss from First Principles | aayush garg | 2025-12-25 | 12,448 | -- |
| Continuity as a First-Class System Property in Artificial Intelligence | Jeremy Felps | 2025-12-30 | 1,462 | -- |
| System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience | Asankhaya Sharma | 2025-06-02 | 1,027 | -- |
| Deriving the DPO Loss from First Principles | aayush garg | 2025-12-30 | 7,331 | -- |
| Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B … | weitaofeng | 2026-01-01 | 1,778 | -- |
| OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve | Asankhaya Sharma | 2025-05-20 | 1,959 | -- |
| Building Conversational AI: A Deep Dive into Voice Agent Architectures and Best … | abdeljalil_elma | 2025-09-02 | 1,854 | -- |
| We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for … | Clem 🤗, Steve Nguyen, and Jeremy Laville | 2025-07-08 | 593 | -- |
| Create Mixtures of Experts with MergeKit | Maxime Labonne | 2024-03-28 | 2,007 | -- |
| The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on … | Yağız Çalık | 2026-01-02 | 5,072 | -- |
| What are Embeddings and Vector Databases? | Damien B | 2024-08-20 | 1,392 | -- |
| Introduction to Quantization cooked in 🤗 with 💗🧑🍳 | merve | 2023-08-25 | 1,372 | -- |
| Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture | Basma Boussaha, Mohammed Alyafeai, Ahmed Alzubaidi, Leen AlQadi, Shaikha Alsuwaidi, Omar saif alkaabi, Hamza Alobeidli, and Hakim Hacid | 2026-01-05 | 1,838 | -- |
| TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell | Konstantin | 2026-01-05 | 3,309 | -- |
| Introducing Falcon H1R 7B | Iheb Chaabane, Puneesh Khanna, Suhail M Shah, Slim Frikha, Shi Hu, Abdalgader Abubaker, Reda alami, Mike Lubinets, Mohamed El Amine Seddik, and Hakim Hacid | 2026-01-05 | 1,332 | -- |
| Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem | Marco Pavone | 2026-01-05 | 893 | -- |
| Understanding Low-Rank Adaptation (LoRA): A Revolution in Fine-Tuning Large Language Models | Ashish Chadha | 2026-01-03 | 2,023 | -- |
| NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI | Tsung-Yi Lin and Debraj Sinha | 2026-01-05 | 1,037 | -- |
| Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR | Kunal Dhawan, Adi- margolin, Gordana Neskovic, Maryam Motamedi, and Yasmina Benkhoui | 2026-01-05 | 1,860 | -- |
| NVIDIA brings agents to life with DGX Spark and Reachy Mini | Jeff Boudier, Nader Khalil, and Alec Fong | 2026-01-05 | 2,128 | -- |
| M2.1: Multilingual and Multi-Task Coding with Strong Generalization | MiniMax | 2026-01-05 | 2,306 | -- |
| Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot | Raffaello Bonghi, lior ben horin, Kartik S, and Kalyan Vadrevu | 2026-01-05 | 1,038 | -- |
| Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval … | Ronay Ak, Gabriel de Souza Pereira Moreira, and Bo Liu | 2026-01-06 | 1,492 | -- |
| OpenMed: Six Months of Open-Source Medical AI and the Road Ahead | Maziyar Panahi | 2026-01-06 | 2,424 | -- |
| Why We Built VIBE Bench: Rethinking Evaluation for Real Workloads | MiniMax | 2026-01-06 | 736 | -- |
| Diversity Vs Density: A data strategy comparison for fine-tuning VLMs | Akhil Theerthala | 2026-01-06 | 2,301 | -- |