Hugging Face Blog

Blog URL

huggingface.co/blog

Posts YTD

432 ↑ vs 37 last year

Avg Posts/Month

0.0 since 2022

Monthly Post Volume

Start year: 2023 2024 2025 2026

Post Details

Search:

Title	Author	Published	Words	HN Pts
Building the Open Agent Ecosystem Together: Introducing OpenEnv	Joseph Spisak, Davide Testuggine, Zach Wentz, Pierre Andrews, Sanyam Bhutani, Hamid Shojanazeri, Pankit Thapar, Emre Guven, Lewis Tunstall, and Vaibhav Srivastav	2025-10-23	1,117	--
VibeGame: Exploring Vibe Coding Games	Dylan Ebert	2025-09-29	1,777	--
Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard	Yauhen Babakhin, Radek Osmulski, Ronay Ak, Gabriel de Souza Pereira Moreira, and Mengyao Xu	2025-10-21	706	--
Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes	Bryan Catanzaro and Jonathan Cohen	2025-10-22	1,684	--
Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm	You Liang Tan, Fengyuan Hu, Oyindamola Omotuyi, Oluwaseun Doherty, Chitoku Yato, and Shane Reetz	2025-06-11	1,902	--
Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than …	Solomatin Roman, Kenneth C. Enevoldsen, and Isaac Chung	2025-10-20	2,320	--
huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning	Lucain Pouget, Célina Hanouti, Lysandre, and Julien Chaumond	2025-10-27	2,139	--
Supercharge your OCR Pipelines with Open Models	merve, Aritra Roy Gosthipaty, Daniel van Strien, Hynek Kydlicek, Andres Marafioti, Vaibhav Srivastav, and Pedro Cuenca	2025-10-21	3,544	--
Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for …	Prachi Mishra	2025-10-28	921	--
Hugging Face and VirusTotal collaborate to strengthen AI security	Adrien Carreira and Bernardo Quintero	2025-10-22	507	--
Voice Cloning with Consent	Margaret Mitchell and Lucie-Aimée Kaffee	2025-10-28	1,394	--
Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with …	Jiqing.Feng, Matrix Yao, Ke Ding, and Ilyas Moutawwakil	2025-10-16	1,374	--
Advancing Predictive ADMET Modeling Through Community-Driven Science: The ExpansionRx-OpenADMET Blind Challenge	Georgia Channing and Hugo MacDermott	2025-10-27	943	--
Vision Tokens vs Text Tokens: Understanding the 10× Compression	Yi Cui	2025-10-22	535	--
Projected Abliteration	Jim Lai	2025-10-25	2,218	--
Streaming datasets: 100x More Efficient	Andres Marafioti, Quentin Lhoest, ben burtenshaw, Pedro Cuenca, and merve	2025-10-27	1,306	--
Sentence Transformers is joining Hugging Face!	Tom Aarsen	2025-10-22	1,011	--
Unlock the power of images with AI Sheets	Ame Vi, Daniel Vila, Francisco Aranda, Damián Pumar, Leandro von Werra, and Thomas Wolf	2025-10-21	1,495	--
Get your VLM running in 3 simple steps on Intel CPUs	Ezequiel Lanza, Helena, Nikita, Ella Charlaix, and Ilyas Moutawwakil	2025-10-15	1,479	--
Introducing RTEB: A New Standard for Retrieval Evaluation	Frank Liu, Kenneth C. Enevoldsen, Solomatin Roman, Isaac Chung, Tom Aarsen, and Fődi, Zoltán	2025-10-01	2,833	--
Building a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac	Steven Palma and Andres Diaz-Pinto	2025-10-29	1,115	--
Uncensor any LLM with abliteration	Maxime Labonne	2024-06-13	3,144	--
GSMA Open-Telco LLM Benchmarks 2.0: The first dedicated LLM Evaluation for Telecoms	Lina Bariah, Antonio De Domenico, Louis Powell, Mohamed Sana, Merouane Debbah, Mark Austin, Farbod Tavakkoli, George George, Nicola Piovesan, Simone Mangiante, cherrared, Sumeyye Bas, GHADA SOLIMAN, Dilara Zeynep Gurer, Laszlo Suto, and Pierre Wang	2025-10-20	3,090	--
NVIDIA Isaac GR00T in LeRobot	lior ben horin, Kartik S, Aravindh Shan, Asawaree, and You Liang Tan	2025-10-28	1,182	--
LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR	Said Taghadouini, Baptiste Aubertin, and Adrien Cavaillès	2025-10-23	4,470	--
Granite 4.0 Nano: Just how small can you go?	Kate Soule and Rameswar Panda	2025-10-28	544	--
Code a simple RAG from scratch	Xuan-Son Nguyen	2024-10-29	2,933	--
How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA …	Asawaree	2025-10-28	1,078	--
Can Your LLM Think Like a Professional? Introducing ProfBench	Zhilin Wang, Jaehun Jung, Ximing Lu, Shizhe Diao, jiaqiz, VivienneZhang, Nik Spirin, and Dong	2025-10-28	1,337	--
🛡️ Nemotron PII: Synthesized Data for Privacy-Preserving AI	Maarten Van Segbroeck	2025-10-28	988	--
SOTA OCR on-device with Core ML and dots.ocr	Christopher Fleetwood and Pedro Cuenca	2025-10-02	1,910	--
Australian-made LLM beats OpenAI and Google at legal retrieval	Umar Butler, Abdur-Rahman Butler, and Adrian Lucas Malec	2025-10-23	930	--
NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image …	Yao Xu, Timo Roman, Lukas Voegtle, Philipp Fischer, Amala Sanjay Deshmukh, Kateryna Chumachenko, and Jarno Seppänen	2025-10-28	1,014	--
Promoter-GPT: Writing DNA Instructions with Language Models	Adele de Hoffer	2025-10-22	3,509	--
LeRobot v0.4.0: Super Charging OSS Robotics Learning	Steven Palma, Michel Aractingi, Pepijn Kooijmans, Caroline Pascal, Jade Choghari, Francesco Capuano, Adil Zouitine, Martino Russi, and Thomas Wolf	2025-10-24	1,980	--
KV Caching Explained: Optimizing Transformer Inference Efficiency	Hafedh Hichri	2025-01-30	1,230	--
Why Did MiniMax M2 End Up as a Full Attention Model?	MiniMax	2025-10-30	1,640	--
The World’s First and Best Speed Painting Software	xing	2025-10-29	1,368	--
3+ Years of ML & Society at Hugging Face 🤗🤝🧑‍🤝‍🧑	Yacine Jernite, Giada Pistilli, Lucie-Aimée Kaffee, and Sasha Luccioni	2025-10-29	807	--
Nemotron-Personas-USA: Synthesized Data for Sovereign AI	Will Jennings, Dane Corneil, and Yev Meyer	2025-10-28	630	--
svara-TTS — Open Multilingual TTS for India’s Voices	Aditya Chhabra	2025-10-27	1,626	--
What makes good reasoning data	MiniMax	2025-10-30	629	--
On the Shifting Global Compute Landscape	Tiezhen WANG and Irene Solaiman	2025-10-29	3,172	--
Aligning to What? Rethinking Agent Generalization in MiniMax M2	MiniMax	2025-10-30	1,103	--
Evaluate Your Own RAG: Why Best Practices Failed Us	Charles AZAM, Antoine Hoorelbeke, Antoine Guyot, Maxence Leclercq, and Jérémy PICOSSON	2025-11-05	3,569	--
Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation	Exploding Gradients	2025-09-16	3,586	--
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge	Yihua Zhang	2025-02-07	2,499	--
ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases	Quentin Macé, Antonio Loison, Antoine EDY, Victor Xing, and Gautier Viaud	2025-11-05	2,524	--
Classement compar:IA : des votes des utilisateurs au classement participatif des modèles	compar:IA	2025-11-03	1,821	--
Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness	Steven Zheng	2025-11-05	1,120	--
Running Large Transformer Models on Mobile and Edge Devices	MtugrulKaya	2025-11-03	6,026	--
TorchSim: A new PyTorch-based molecular dynamics engine	Davide Sarpa	2025-10-31	3,592	--
The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix	Asankhaya Sharma	2025-11-03	1,833	--
⚡ Power, Heat, and Intelligence ☁️ - AI Data Centers Explained 🏭	Boris Gamazaychikov and Sasha Luccioni	2025-11-05	2,952	--
Small Language Models (SLM): A Comprehensive Overview	John Johnson	2025-02-22	1,456	--
Toward Community-Governed Safety	Giada Pistilli and Lucie-Aimée Kaffee	2025-11-03	681	--
From GRPO to DAPO and GSPO: What, Why, and How	Yihua Zhang	2025-08-09	5,841	--
Budget Alignment: Making Models Reason in the User’s Language	Shan Chen, Jirui Qi, and Zidi Xiong	2025-11-04	3,207	--
Introduction to State Space Models (SSM)	Loïck BOURDOIS	2024-07-19	6,663	--
Let's talk about LLM evaluation	Clémentine Fourrier	2024-05-23	3,264	--
Who Routes LLM Routers? RouterArena: Building the Evaluation Foundation for LLM Routing	Yifan Lu, Riksin, Jiayi Yuan, Bruce Cui, SJ Chang, Hongyi Liu, and Jiarong Xing	2025-11-11	1,552	--
SYNTH: the new data frontier	Pierre-Carl Langlais	2025-11-10	1,995	--
Effective Prompting for Generative Vision Models	Sara Han Díaz and Bertrand Charpentier	2025-11-10	1,013	--
🌳 QAT: The Art of Growing a Bonsai Model	Yi Cui	2025-11-09	1,267	--
Norm-Preserving Biprojected Abliteration	Jim Lai	2025-11-06	2,135	--
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face	Daniel Voigt Godoy	2025-02-11	3,900	--
Mastering Tensor Dimensions in Transformers	Hafedh Hichri	2025-01-12	2,555	--
Text-to-image Architectural Experiments	David Bertoin, Jon Almazán, and Roman	2025-11-13	3,525	--
Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level …	Tensor-Slayer	2025-11-07	1,843	--
We’re open-sourcing our text-to-image model and the process behind it	Jon Almazán, David Bertoin, and Roman	2025-11-12	1,110	--
Building for an Open Future - our new partnership with Google Cloud	Jeff Boudier and Simon Pagezy	2025-11-13	869	--
Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach	Pere Martra	2024-11-24	3,670	--
⛳ Optimizer: What Does It Do and Why We Need It	Yi Cui	2025-11-12	1,313	--
To Think or Not to Think: A Router for Hybrid LLMs	Amir Mohseni	2025-11-16	2,137	--
The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs	Xiaoran Liu (SII)	2025-11-15	1,834	--
The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling	Elaine McVey Houskeeper and Georgia Channing	2025-11-18	1,662	--
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models	Torsten Scholak, Oleksiy Ostapenko, Raymond Li, Luke Kumar, and Joel Lamy-Poirier	2025-11-19	1,709	--
Easily Build and Share ROCm Kernels with Hugging Face	Abdennacer Badaoui, Daniel Huang, colorswind, and Zesen Liu	2025-11-17	3,120	--
Join the AMD Open Robotics Hackathon	Eric Ma and Guruprasad MP	2025-11-13	506	--
PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs	Samuel Lima Braz	2025-01-24	8,770	--
AI Model Optimization More Flexible Than Ever	Johanna Sommer, Sara Han Díaz, and Bertrand Charpentier	2025-11-17	725	--
Visualizing How VLMs Work	Hafedh Hichri and Ed Daniels	2025-10-07	1,851	--
🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset	Cornelius Wolff, Daniel Gomm, and Madelon Hulsebos	2025-11-19	944	--
Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms	Mattt	2025-11-20	1,326	--
Introducing Cogito v2.1	Deep Cogito Team	2025-11-19	1,067	--
Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks	Eric Bezzam, Steven Zheng, Eustache Le Bihan, and Vaibhav Srivastav	2025-11-21	936	--
20x Faster TRL Fine-tuning with RapidFire AI	Kamran Bigdely, Arun Kumar, and Quentin Gallouédec	2025-11-21	1,198	--
How to make NeuTTS-air generate over 200 seconds of audio in a …	Yatharth Sharma	2025-11-21	792	--
Building Deep Research: How we Achieved State of the Art	Michael Griff, Dean Sacoransky, and Noah Nefsky	2025-11-24	1,628	--
OVHcloud on Hugging Face Inference Providers 🔥	Gilles Closset, Fabien Ric, and Elias Tourneux	2025-11-24	788	--
Prefill and Decode for Concurrent Requests - Optimizing LLM Performance	Benjamin Merkel	2025-04-16	2,165	--
Announcing the LLM Open Finance models	Raheel Qader, Gaëtan Caillaut, Jingshu, Mariam Nakhle, Arezki SADOUNE, MASSINISSA AHMIM, and Jean-Gabriel BARTHELEMY	2025-11-24	601	--
DeLERP: Decomposed Linear Interpolation for Model Merging	Jim Lai	2025-11-20	1,364	--
How MCP Blockly Makes MCP Server Creation Accessible for Everyone	Owen Kaplinsky	2025-11-28	952	--
Curating datasets directly on the Hub	Daniel Vila	2025-11-27	504	--
10 Best Open-Source LLM Models (2025 Updated): Llama 4, Qwen 3 and …	Daya Shankar	2025-11-13	2,419	--
Gemini-3 Benchmarkathon	Robert Scholz, Slimane Alaoui Soulimani Valenti, Ernest Beta, Odysseas S. Chlapanis, Adhithya kiran, Matteo Bürgler, Sophie Franco, Chu Fei Luo, Prof. Samuel Dahan, and Joel Niklaus	2025-11-28	4,648	--
Building Jobly: Semantic Job Matching with RAG and Vector Embeddings	Valentina Nieddu and Giacomo Bandini	2025-11-28	1,878	--
Continuous batching	Rémi Ouazan Reboul, Arthur Zucker, and Luc Georges	2025-11-25	3,970	--
Welcome FLUX.2 - BFL’s new open image generation model 🤗	YiYi Xu, Daniel Gu, Sayak Paul, Alvaro Somoza, Dhruv Nair, Aritra Roy Gosthipaty, Linoy Tsaban, and Apolinário from multimodal AI art	2025-11-25	3,460	--
A Guide to Hugging Face’s Papers Page	Adina Yakefu	2025-11-25	973	--
makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch	Avinash Sooriyarachchi	2024-05-07	3,812	--
Custom Policy Enforcement with Reasoning: Faster, Safer AI Applications	Traian Rebedea, Shyamala Prayaga, Makesh Sreedhar, Chris Parisien, and Isabel Hulseman	2025-12-02	1,648	--
Transformers v5: Simple model definitions powering the AI ecosystem	Lysandre, Arthur Zucker, Cyril Vallez, and Vaibhav Srivastav	2025-12-01	2,250	--
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO …	Yihua Zhang	2025-02-11	18,441	--
Building and evaluating Multimodal Rerankers	Ulrick BLE	2025-11-30	4,201	--
An Edge-First Generalized LLM LoRA Fine-Tuning Framework for Heterogeneous GPUs	Subash SN, Akshay Nambiar, Patrik Lambert, Milan Gritta, and Amril Nurman	2025-12-01	4,604	--
📌 Rethinking Multimodality from an Industry Perspective: Captioning Is Far More Important …	Bohan Zhai and Shijia Yang	2025-11-29	3,816	--
SARLO-80: Worldwide Slant SAR Language Optic Dataset at 80 cm Resolution	Solène Debuysère, Nicolas Trouvé, and Georgia Channing	2025-12-01	1,551	--
Bringing Math to Life: Building StepWise Math for the MCP Hackathon	Vikas Gupta	2025-11-27	948	--
Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement	Asankhaya Sharma	2025-12-03	2,075	--
We Got Claude to Fine-Tune an Open Source LLM	ben burtenshaw and shaun smith	2025-12-04	2,016	--
BERTs that chat: turn any BERT into a chatbot with dLLM	Zhanhui Zhou and Lingjie Chen	2025-11-28	943	--
Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand	Quentin Gallouédec	2025-12-04	1,219	--
AI Energy Score v2: Refreshed Leaderboard, now with Reasoning 🧠	Sasha Luccioni and Boris Gamazaychikov	2025-12-04	1,496	--
Introducing swift-huggingface: The Complete Swift Client for Hugging Face	Mattt	2025-12-05	1,524	--
DeepFabric: Generate, Train and Evaluate with Datasets curated for Model Behavior Training.	Luke Hinds	2025-12-04	3,284	--
TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval	Özay Ezerceli, Mahmud ElHuseyni 🇵🇸, SELVA TAŞ, Reyhan Bayraktar, Betül Terzioğlu, Yusuf Çelebi, Yağız Asker, and nmmursit	2025-12-04	3,173	--
Engineering Notes: Training a LoRA for Z-Image Turbo with the Ostris AI …	Shawn	2025-12-02	1,280	--
DeepMath: A lightweight math reasoning Agent with SmolAgents	Daniel Fleischer, Moshe Berchansky, and Moshe Wasserblat	2025-12-04	1,123	--
Making Model Tuning Accessible: This is what we built observing 100s of …	Mehant, Yashasvi Chaurasia, Ashok Pon Kumar, and Praveen Jayachandran	2025-12-05	1,821	--
A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: …	Yihua Zhang	2025-02-04	7,388	--
Muon vs MuonClip vs Muon+AdamW for Fine-Tuning	Nishith Jain	2025-12-09	705	--
How We Use Claude Code Skills to Run 1,000+ ML Experiments a …	Sigrid Jin	2025-12-08	4,707	--
New in llama.cpp: Model Management	Xuan-Son Nguyen and Victor Mustar	2025-12-11	740	--
Build Hallucination-Free RAG with Verbatim	Adam Kovacs	2025-11-18	2,281	--
I Built a RAG System That Listens to Live BBC News and …	Rakshit Aralimatti	2025-12-09	907	--
Make and publish your Reachy Mini App	Antoine Pirrone and Rouanet	2025-12-03	1,081	--
Why You Should Care About Partial Differential Equations (PDEs)	Aishwarya Balaji, BryanBradfo, Jose Manuel Nápoles, Prateik Sinha, and Roey Ben Chaim	2025-12-12	1,761	--
MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier	Surya Kant Sahu and Jaipal Singh	2025-12-12	2,144	--
Diffusion Language Models: The New Paradigm	Pro Creations	2025-06-10	1,644	--
Codex is Open Sourcing AI models	ben burtenshaw and shaun smith	2025-12-11	2,426	--
Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance	Sathwik Tejaswi Madhusudhan, Sagar Davasam, and Torsten Scholak	2025-12-09	1,908	--
CUGA on Hugging Face: Democratizing Configurable AI Agents	Jim Laredo, Avi Yaeli, Sami Marreed, AYHAN SEBIN, and Merve Unuvar	2025-12-15	1,058	--
Topic 23: What is LLM Inference, it's challenges and solutions for it	Ksenia Se	2025-01-17	1,511	--
Phare LLM benchmark V2: Reasoning models don't guarantee better security	Pierre Le Jeune, David Berenstein, Matteo, and Weixuan Xiao	2025-12-16	2,631	--
Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation	kelseye.xh and Zhongjie Duan	2025-12-16	1,416	--
The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator	Seph Mard, Isabel Hulseman, Besmira Nushi, Piotr Januszewski, Grzegorz Chlebus, VivienneZhang, Wojciech Prazuch, Pablo Ribalta, Nik Spirin, and Ferenc Galko	2025-12-17	2,102	--
Nemotron 3 Nano - A new Standard for Efficient, Open, and Intelligent …	Chris Alexiuk, Shashank Verma, Chintan, Chris Wing, and Gordana Neskovic	2025-12-15	2,382	--
Everything You Need to Know about Knowledge Distillation	Ksenia Se and Alyona Vert	2025-03-06	3,517	--
EuroLLM-22B	EuroLLM Team, Miguel Moura Ramos, Duarte Alves, and Hippolyte Gisserot-Boukhlef	2025-12-14	1,162	--
Gotchas in Tokenizer Behavior Every Developer Should Know	Quentin Gallouédec	2025-04-18	2,659	--
What is the Hugging Face Community Building?	Avijit Ghosh, Yacine Jernite, and Irene Solaiman	2025-07-15	1,377	--
Open Collaboration in Action: Inside the Open Safeguard Hackathon	Andrew Chang, juliet shen, and Yacine Jernite	2025-12-18	1,248	--
cua-bench: A Framework for Benchmarking, Training Data, and RL Environments for Computer-Use …	Francesco Bonacci and Dillon DuPont	2025-12-16	1,086	--
Spinning Up a CPU-Only Micro-LLM with LoRA for Literary Style	Kashif Salahuddin	2025-12-16	1,000	--
Announcing LiteCoder-Terminal: Lightweight Terminal Agents with <1k Synthesized Trajectories	LiteCoder	2025-12-18	677	--
Tokenization in Transformers v5: Simpler, Clearer, and More Modular	Ita Zaporozhets, Aritra Roy Gosthipaty, Arthur Zucker, Sergio Paniego, merve, and Pedro Cuenca	2025-12-18	3,024	--
Shadow AI - Where are the CIOs?	Jeff Boudier	2025-12-19	616	--
LLM based TTS models	Yatharth Sharma	2025-12-18	871	--
AI Labs Must Resist Age Verification	Adam Molnar and Noah Weinberger	2025-12-17	2,593	--
🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About …	Ksenia Se	2025-03-17	4,266	--
Backbone-Optimizer Coupling Bias: The Hidden Co-Design Principle	Juanxi Tian	2025-12-20	5,279	--
Encoding the World's Medical Knowledge into 970K	David Mezzetti	2025-12-22	934	--
Skill is All You Need: Lessons from Building Marketing Agents at Noumena	liuzeming, Arcobalneo, HUANLIN LUO, wubin, Huan Zhao, Lee, and Noumena-AI	2025-12-25	2,334	--
AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems	Jaykumar Kasundra	2025-12-23	2,080	--
Understanding InstaFlow/Rectified Flow	Isamu Isozaki	2023-10-06	1,802	--
Nano-BEIR: A Multilingual Information Retrieval Benchmark with Quality-Enhanced Queries	KuKu	2025-12-22	1,274	--
Decoding Strategies in Large Language Models	Maxime Labonne	2024-10-29	4,166	--
The Optimal Architecture for Small Language Models	Asankhaya Sharma	2025-12-26	2,348	--
Deriving the PPO Loss from First Principles	aayush garg	2025-12-25	12,448	--
Continuity as a First-Class System Property in Artificial Intelligence	Jeremy Felps	2025-12-30	1,462	--
System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience	Asankhaya Sharma	2025-06-02	1,027	--
Deriving the DPO Loss from First Principles	aayush garg	2025-12-30	7,331	--
Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B …	weitaofeng	2026-01-01	1,778	--
OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve	Asankhaya Sharma	2025-05-20	1,959	--
Building Conversational AI: A Deep Dive into Voice Agent Architectures and Best …	abdeljalil_elma	2025-09-02	1,854	--
We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for …	Clem 🤗, Steve Nguyen, and Jeremy Laville	2025-07-08	593	--
Create Mixtures of Experts with MergeKit	Maxime Labonne	2024-03-28	2,007	--
The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on …	Yağız Çalık	2026-01-02	5,072	--
What are Embeddings and Vector Databases?	Damien B	2024-08-20	1,392	--
Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳	merve	2023-08-25	1,372	--
Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture	Basma Boussaha, Mohammed Alyafeai, Ahmed Alzubaidi, Leen AlQadi, Shaikha Alsuwaidi, Omar saif alkaabi, Hamza Alobeidli, and Hakim Hacid	2026-01-05	1,838	--
TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell	Konstantin	2026-01-05	3,309	--
Introducing Falcon H1R 7B	Iheb Chaabane, Puneesh Khanna, Suhail M Shah, Slim Frikha, Shi Hu, Abdalgader Abubaker, Reda alami, Mike Lubinets, Mohamed El Amine Seddik, and Hakim Hacid	2026-01-05	1,332	--
Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem	Marco Pavone	2026-01-05	893	--
Understanding Low-Rank Adaptation (LoRA): A Revolution in Fine-Tuning Large Language Models	Ashish Chadha	2026-01-03	2,023	--
NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI	Tsung-Yi Lin and Debraj Sinha	2026-01-05	1,037	--
Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR	Kunal Dhawan, Adi- margolin, Gordana Neskovic, Maryam Motamedi, and Yasmina Benkhoui	2026-01-05	1,860	--
NVIDIA brings agents to life with DGX Spark and Reachy Mini	Jeff Boudier, Nader Khalil, and Alec Fong	2026-01-05	2,128	--
M2.1: Multilingual and Multi-Task Coding with Strong Generalization	MiniMax	2026-01-05	2,306	--
Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot	Raffaello Bonghi, lior ben horin, Kartik S, and Kalyan Vadrevu	2026-01-05	1,038	--
Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval …	Ronay Ak, Gabriel de Souza Pereira Moreira, and Bo Liu	2026-01-06	1,492	--
OpenMed: Six Months of Open-Source Medical AI and the Road Ahead	Maziyar Panahi	2026-01-06	2,424	--
Why We Built VIBE Bench: Rethinking Evaluation for Real Workloads	MiniMax	2026-01-06	736	--
Diversity Vs Density: A data strategy comparison for fine-tuning VLMs	Akhil Theerthala	2026-01-06	2,301	--
🥃 Distilling Tiny Embeddings	David Mezzetti	2026-01-10	1,082	--
How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, …	Sherry Chen	2025-09-30	3,271	--
Introducing OptiMind, a research model designed for optimization	Anson Ho, Sirui Li, and Ishai Menache	2026-01-15	395	--
From Image-to-LoRA to In-Context Edit	kelseye.xh and Zhongjie Duan	2025-12-29	936	--
Common AI Model Formats	Xuan-Son Nguyen	2025-02-27	2,109	--
How We Built a Semantic Highlight Model To Save Token Cost for …	Cheney Zhang and Jiang Chen	2026-01-15	2,344	--
Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments	Bingyang Ye and Shan Chen	2026-01-13	2,717	--
Open Responses: What you need to know	shaun smith, ben burtenshaw, merve, and Pedro Cuenca	2026-01-15	1,344	--
Beyond Brute Force: Why LoongFlow is the “Thinking” Evolution of OpenEvolve	Xunan Dai	2026-01-16	1,108	--
ColPali: Efficient Document Retrieval with Vision Language Models 👀	Manuel Faysse	2024-07-05	1,399	--
SmolLM-Smashed: Tiny Giants, Optimized for Speed	David Berenstein	2026-01-13	982	--
VLM-OCR Recipes on GPU Infrastructure	Florent Gbelidji	2026-01-15	2,281	--
The Large Language Model Course	Maxime Labonne	2025-01-16	4,256	--
Reviewer Two (but it's an OpenEnv)	Chris von Csefalvay	2026-01-13	1,653	--
Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments	ben burtenshaw	2026-01-20	1,158	--
LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family	Said Taghadouini, Adrien Cavaillès, and Baptiste Aubertin	2026-01-19	934	--
Differential Transformer V2	Li Dong	2026-01-20	3,136	--
🪄 Interpreto: A Unified Toolkit for Interpretability of Transformer Models	Fanny Jourdan and Antonin Poché	2026-01-20	2,112	--
New in llama.cpp: Anthropic Messages API	Xuan-Son Nguyen and Victor Mustar	2026-01-19	541	--
One Year Since the “DeepSeek Moment”	Adina Yakefu and Irene Solaiman	2026-01-20	1,617	--
Optimizing GLM4-MoE for Production: 65% Faster TTFT with SGLang	Novita AI	2026-01-22	1,047	--
Security, Governance and Performance for Dell On-Prem AI Builders	Balachandran Rajendran, Juan Julián, Alvaro Bartolome, Enrique Hernández Calabrés, Simon Pagezy, and Jeff Boudier	2026-01-21	1,064	--
RexRerankers: SOTA Rankers for Product Discovery and AI Assistants	Rahul Bajaj, Anuj Garg, and Jaya Nupur	2026-01-24	3,704	--
Challenges of Synthetic Dataset Generation	Rishiraj Acharya	2026-01-21	942	--
Reverse Engineering a $500M Mystery: From HashHop to Memory-Augmented Language Models	Asankhaya Sharma	2026-01-23	1,825	--
AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality	Dhaval Patel, James Rayfield, Saumya Ahuja, Chathurangi Shyalika, Shuxin Lin, and Zhou	2026-01-21	1,505	--
“DeepSeek R1 时刻” 一周年	vansin	2026-01-20	315	--
Benchmark Smarter: Tailor Your Model Evaluation Suite with EvalScope	kelseye.xh	2026-01-22	1,973	--
Waypoint-1: Real-time Interactive Video Diffusion from Overworld	Andrew Lapp, Louis Castricato, Scott Fox, Shahbuland Matiana, and David Rossi	2026-01-20	853	--
A Beginner-Friendly PyTorch Tutorial: Build and Train Your First Model	Daniel Voigt Godoy	2025-01-20	8,215	--
Why Your AI Strategy Needs Hugging Face Storage	Adrian Lepers	2026-01-26	1,008	--
NVIDIA Earth-2 Open Models Span the Whole Weather Stack	Mike Pritchard, Jaideep Pathak, Jean Kossaifi, and Aayush Gupta	2026-01-26	736	--
Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs	Omar saif alkaabi, Ahmed Alzubaidi, Hamza Alobeidli, Shaikha Alsuwaidi, Mohammed Alyafeai, Leen AlQadi, Basma Boussaha, and Hakim Hacid	2026-01-27	1,585	--
Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective	Jason Zhu, Hejian Sang, Arup De, Rohit Jain, and Yanning Chen	2026-01-27	4,160	--
Friends and Grandmothers in Silico	Itay Yona	2026-01-24	4,089	--
Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever.	Maziyar Panahi	2025-07-16	2,205	--
Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek	Adina Yakefu and Irene Solaiman	2026-01-27	1,324	--
Nemotron-Personas-Brazil: Co-Designed Data for Sovereign AI	Andre Manoel, Yev Meyer, Shyamala Prayaga, Will Jennings, and bardiya sadeghi	2026-01-28	903	--
The Great Classification Showdown: OSS vs BERT on Consumer Hardware	Ben Toussaint	2026-01-26	1,938	--
We got Claude to teach open models how to write CUDA kernels!	ben burtenshaw, shaun smith, merve, and Pedro Cuenca	2026-01-28	2,350	--
Slashing torch.compile Warmup & LoRA Swapping Times with Pruna	John Rachwan, Johanna Sommer, Bertrand Charpentier, and Sara Han Díaz	2026-01-28	1,513	--
Nemotron-Personas-Singapore: Co-Designed Data for Sovereign AI	Will Jennings, Dane Corneil, Yev Meyer, Verdi March, Shyamala Prayaga, and bardiya sadeghi	2026-01-27	1,041	--
TruthTensor: LLM Evalution in Prediction Markets Under Drift and Market Baseline	Elena Pashkova, shirin Shahabi, Hudson, and Ronald Chan	2026-01-29	1,631	--
Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp	Doctor Shotgun and Geechan	2026-01-30	2,508	--
Introducing NVIDIA Cosmos Policy for Advanced Robot Control	Pranjali Joshi, Tsung-Yi Lin, Jinwei Gu, and Prachi Mishra	2026-01-29	1,333	--
Introducing Daggr: Chain apps programmatically, inspect visually	merve, yuvraj sharma, Abubakar Abid, hysts, and Pedro Cuenca	2026-01-29	1,559	--
MamayLM, передова мовна модель для української мови	Hanna Yukhymenko, Anton Alexandrov, and Martin Vechev	2025-04-23	1,941	--
Fine-Tuning FunctionGemma on TPU to Create a Virtual Fitness Coach in 10 …	Alvaro Moran	2026-02-02	2,906	--
Announcing ReasoningLens — Visualizing and Diagnosing LLM Reasoning at a Glance	Jun Zhang, Jason Zheng, Boxi Cao, and ReasoningLens	2026-02-03	693	--
Training Design for Text-to-Image Models: Lessons from Ablations	David Bertoin, Roman Frigg, and Jon Almazán	2026-02-03	7,420	--
The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+	Adina Yakefu and Irene Solaiman	2026-02-03	1,602	--
H Company's new Holo2 model takes the lead in UI Localization	Ramzi De Coster, Hamza Benchekroun, and Aurélien Lac	2026-02-03	214	--
Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s …	Ronay Ak and Gabriel de Souza Pereira Moreira	2026-02-04	1,048	--
Nvidia Agentic Smart Router on Dell Enterprise Hub : Deepdive on Architecture,Design …	Khushboo Rathi and Balachandran Rajendran	2026-02-03	995	--
Getting Started With Hugging Face in 10 Minutes	Vladislav Guzey	2025-03-10	1,514	--
CRAFT: Continuous Reasoning and Agentic Feedback Tuning	Valentin, Denis Timonin, Alexandr, and Alexey	2026-02-05	813	--
Introducing SyGra Studio	Surajit Dasgupta, Bidyapati Pradhan, Amit Kumar Saha, Vipul Mittal, and Sriram Puttagunta	2026-02-05	747	--
🚀 SyGra V2.0.0	Sriram Puttagunta, Surajit Dasgupta, Bidyapati Pradhan, Amit Kumar Saha, and Vipul Mittal	2026-02-05	724	--
Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth	Maxime Labonne	2024-07-29	2,923	--
From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails …	Maziyar Panahi	2026-02-07	5,766	--
Transformers.js v4 Preview: Now Available on NPM!	Joshua and Nico Martin	2026-02-09	1,185	--
Rank-Stabilized LoRA: Unlocking the Potential of LoRA Fine-Tuning	D K	2024-02-20	1,793	--
Training Qwen3 VL to label bbox : synthetic data, environment and training …	Ulrick BLE	2026-02-09	2,544	--
🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs	Guanchu	2026-02-11	616	--
Building a Mood-Based Movie Recommendation Engine with Voyage-4-nano, Hugging Face, and MongoDB …	Arkadiusz Borucki	2026-02-08	3,315	--
Enabling Large Scale RLHF of GPTOSS with Megatron backend in VeRL	LEI WANG	2026-02-10	5,934	--
Why SGLang is a Game-Changer for LLM Workflows	Makwana Paresh	2025-07-07	1,639	--
OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments	Christian Washington, Ankit Jasuja, Santosh Sah, Lewis Tunstall, and ben burtenshaw	2026-02-12	1,656	--
LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search …	Antoine Chaffin and Raphael	2026-02-12	4,993	--
Transformers	Esmail Atta Gumaan	2024-07-02	2,730	--
Forge: Scalable Agent RL Framework and Algorithm	MiniMax, Hyn, zhi zhang, Jiayuan Song, Da Chen, xkc, Yaoyao, kennyKK, and zpysky1125	2026-02-13	3,387	--
How to Use Multiple GPUs in Hugging Face Transformers: Device Map vs …	Aritra Roy Gosthipaty	2026-02-12	606	--
Custom Kernels for All from Codex and Claude	ben burtenshaw, Sayak Paul, Aritra Roy Gosthipaty, and shaun smith	2026-02-13	1,792	--
Model2Vec: Distill a Small Fast Model from any Sentence Transformer	Thomas van Dongen and Stéphan Tulkens	2024-10-14	2,441	--
What superpower does Kimi-K2.5 bring to the table?	Leco Li	2026-02-13	1,154	--
Unbelievable! Run 70B LLM Inference on a Single 4GB GPU with This …	Gavin Li	2023-11-30	1,279	--
The Chinese GLM-5 Model Now Ranks #2 in Arabic Language Performance	Karim Ouda	2026-02-16	322	--
Compute and Competition in AI: Different FlOPs for Different Folks	Yacine Jernite and Sasha Luccioni	2026-02-12	1,917	--
How to Build a Benchmark with a Private Test Set on Hugging …	Georgia Channing	2026-02-16	1,775	--
Qwen3.5: Nobody Agrees on Attention Anymore	Maxime Labonne	2026-02-17	1,192	--
NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル	Atsunori Fujita, Kotaro Yamamoto, Masaya Ogushi, Vincent Gong, Ameya Sunil Mahabaleshwarkar, and Yoshi Suhara	2026-02-17	297	--
DenseR: Dense Rewards For Free in LLM Reasoning	Hritik Bansal	2026-02-18	3,977	--
De-mystifying Multimodal Learning: Enabiling Vision in Language Models	Matteo Nulli	2026-02-17	2,797	--
One-Shot Any Web App with Gradio's gr.HTML	yuvraj sharma, hysts, and Freddy Boulton	2026-02-18	829	--
Gemma3NPC - A Solution for Live NPC Interactions	Hexi Wang and Keegan Carey	2025-08-14	5,954	--
RynnEC: Bringing MLLMs into Embodied World	Ronghao Dang, YuqianYuan, yunxuan mao, Kehan Li, jiangpin, zhikai wang, and Xin Li	2025-08-14	1,382	--
IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and …	Ayhan Sebin, Rohan Arora, and Saurabh Jha	2026-02-18	2,253	--
Did GPT 5.2 make a breakthrough discovery in theoretical physics?	David Louapre	2026-02-19	4,541	--
ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?	Antoine Chaffin, Luca Arnaboldi, Amélie Chatelain, and Florent Krzakala	2026-02-19	2,306	--
「データ不足」の壁を越える：合成ペルソナが日本のAI開発を加速	Atsunori Fujita, Masaya Ogushi, Will Jennings, Yev Meyer, Kotaro Yamamoto, Yoshi Suhara, Vincent Gong, and Dane Corneil	2026-02-19	280	--
I Let a Lobster Run My Jetson: What OpenClaw Taught Me About …	Andres Marafioti	2026-02-19	1,509	--
Train AI models with Unsloth and Hugging Face Jobs for FREE	ben burtenshaw, Daniel (Unsloth), Michael Han, Maxime Labonne, Daniel van Strien, and shaun smith	2026-02-20	944	--
Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)	Aritra Roy Gosthipaty	2025-01-19	4,342	--
GGML and llama.cpp join HF to ensure the long-term progress of Local …	Georgi Gerganov, Xuan-Son Nguyen, Aleksander Grygier, Lysandre, Victor Mustar, and Julien Chaumond	2026-02-20	936	--
Introducing Legal RAG Bench	Umar Butler and Abdur-Rahman Butler	2026-02-20	3,235	--
FINAL Bench: The Real Bottleneck to AGI Is Self-Correction	VIDRAFT_LAB	2026-02-21	1,146	--
How We Learned to Talk to Machines	Tyler Williams	2026-02-20	1,156	--
Kimi K2.5: Still Worth It After Two Weeks?	Maxime Labonne	2026-02-23	1,448	--
Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism?	VIDRAFT_LAB	2026-02-24	2,770	--
Follow the White Rabbit: Using Embeddings So You Never Get Lost in …	David Corvoysier	2026-02-23	1,420	--
MAEB: Evaluating Audio Embeddings at Scale	Adnan El Assadi, Solomatin Roman, Kenneth C. Enevoldsen, and Isaac Chung	2026-02-24	1,349	--
A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and …	Karina Zadorozhny	2026-01-19	7,738	--
Deploying Open Source Vision Language Models (VLM) on Jetson	Mitesh Patel, Johnny Nuñez Cano, and Raymond Lo	2026-02-24	1,591	--
GEM Image: Building an AI That Actually Gets Educational Diagrams Right	AIPrep	2026-02-21	966	--
Mixture of Experts (MoEs) in Transformers	Aritra Roy Gosthipaty, Pedro Cuenca, merve, Ilyas Moutawwakil, Arthur Zucker, Sergio Paniego, and Pablo Montalvo	2026-02-26	2,054	--
Your MoE Model Does Not Have to Select Fixed Number of Experts	Tong Zhu, Xuyang Hu, Xiaoye Qu, Guanjie Chen, and Yu Cheng	2026-02-26	4,405	--
Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty?	Yichen Feng, Yuetai Li, Chunjiang Liu, Yue Huang, Zhengqing Yuan, Fengqing Jiang, Zichen Chen, and Zhangchen Xu	2026-02-25	1,792	--
Bringing Autonomous Driving RL to OpenEnv and TRL	Sergio Paniego	2026-02-26	1,814	--
A framework and leaderboard for Retrieval Pipelines evaluation on ViDoRe v3	Quentin Macé, Gabriel de Souza Pereira Moreira, Antoine EDY, Radek Osmulski, and Bo Liu	2026-02-27	1,886	--
Create, Evaluate, and Connect AI Skills \| SkillNet: A Large-Scale Agentic "Skill …	Yuan Liang, Ningyu Zhang, and Xu Ziwen	2026-02-28	2,039	--
构建、评估与连接 AI 技能 \| SkillNet：大规模智能体“技能图谱”知识库	Yuan Liang, Ningyu Zhang, and Xu Ziwen	2026-02-28	370	--
Getting More from Your Test-Time Compute Budget with Portfolio Beam Search	Dan Elbaz, Oren Salzman, Oren Pereg, Daniel Korat, and Ronen Laperdon	2026-02-24	3,527	--
easytranscriber: Speech Recognition with Accurate Timestamps in the HF Ecosystem	Faton Rekathati	2026-03-03	1,169	--
The ML Engineer's Guide to Protein AI	Maziyar Panahi	2026-03-03	3,612	--
PRX Part 3 — Training a Text-to-Image Model in 24h!	David Bertoin, Roman Frigg, and Jon Almazán	2026-03-03	1,732	--
Introducing Kanon 2 Enricher — the world’s first hierarchical graphitization model	Umar Butler and Abdur-Rahman Butler	2026-03-03	1,571	--
AI Coding Assistants Keep Shipping Vulnerable Code -- Here's What We're Doing …	Scott Thornton	2026-02-26	371	--
LLM Architectures Explained: What Powers Today’s Top Models	Sara Han Díaz and Bertrand Charpentier	2026-03-04	1,628	--
TiRex on the Edge	Robert Weber, Christian Ganhör, and Lukas Fischer	2026-03-05	506	--
Bringing Robotics AI to Embedded Platforms: Dataset Recording, VLA Fine‑Tuning, and On‑Device …	Gaetan Bahl	2026-03-05	1,851	--
Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines	YiYi Xu, Alvaro Somoza, Dhruv Nair, and Sayak Paul	2026-03-05	1,907	--
NEO-unify: Building Native Multimodal Unified Models End to End	Haiwen Diao, Lewei Lu, and Ziwei Liu	2026-03-05	623	--
Building Tucano 2: Open-Source Language Models That Actually Think in Portuguese	Nicholas Kluge Corrêa, Aniket Sen, Shiza Fatimah, Sophia Falk, and Lucie Flek	2026-03-05	2,258	--
De-mystifying Multimodal Learning: The Hidden Inefficiency in Vision Language Modelling	Matteo Nulli	2026-03-04	2,120	--
Konkani LLM: Bringing a Multi-Script Low-Resource Language to the AI Era	Reuben fernandes	2026-03-07	861	--
Structural Problems in AI Benchmarking and the Case for a Unified Evaluation …	VIDRAFT_LAB	2026-03-08	1,171	--
MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning	VIDRAFT_LAB	2026-03-09	1,663	--
LeRobot v0.5.0: Scaling Every Dimension	Steven Palma, Pepijn Kooijmans, Jade Choghari, Caroline Pascal, Khalil Meftah, Martino Russi, Nicolas Rabault, Michel Aractingi, Virgile BATTO, and Thomas Wolf	2026-03-09	1,931	--
Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge	George Saon and Madison Lee	2026-03-09	385	--
Ulysses Sequence Parallelism: Training with Million-Token Contexts	Kashif Rasul and Stas Bekman	2026-03-09	3,003	--
Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries	Amine Dirhoussi, Quentin Gallouédec, Kashif Rasul, Lewis Tunstall, Edward Beeching, Albert Villanova del Moral, Nouamane Tazi, and Leandro von Werra	2026-03-10	9,358	--
Kanon 2 Reranker: the most powerful reranker for legal RAG	Umar Butler and Abdur-Rahman Butler	2026-03-10	471	--
How NVIDIA Builds Open Data for AI	Will Jennings, Yev Meyer, Leanna Chraghchian, Rebecca Kao, Jane Polak Scowcroft, and Annie Surla	2026-03-10	1,590	--
🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language …	VIDRAFT_LAB	2026-03-10	2,482	--
Introducing Storage Buckets on the Hugging Face Hub	Lucain Pouget, Eliott Coyac, Adrien Carreira, Victor Mustar, Julien Chaumond, Quentin Lhoest, Pierric Cistac, Sylvestre Bcht, Hugo Larcher, Rajat Arya, Di Xiao, and Assaf Vayner	2026-03-10	1,591	--
ShopRLVE-GYM: Adaptive Verifiable Environments for E-Commerce Conversational Agents	Rahul Bajaj and Jaya Nupur	2026-03-08	4,976	--
Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds	Joseph Jennings and Brandon Norick	2026-03-11	710	--
Scaling Pedagogical Pre-training: From Optimal Mixing to 10 Billion Tokens	Asankhaya Sharma	2026-03-06	4,656	--
How NVIDIA AI-Q Reached #1 on DeepResearch Bench I and II	David Austin	2026-03-12	1,749	--
Build an Agent That Thinks Like a Data Scientist: How We Hit …	Jiwei Liu, Maximilian Jeblick, and Jack Yu	2026-03-13	2,052	--
Arabic TTS Arena: Ranking Voice Models the Way Chess Ranks Grandmasters	Mohamed Rashad	2026-03-12	1,698	--
Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline	Radek Osmulski, Reza Esfandiarpoor, Yauhen Babakhin, Gabriel de Souza Pereira Moreira, and Bo Liu	2026-03-13	1,520	--
Pruna 0.3.2: More OSS Algos, More Ways to Optimize	Minette Kaunismäki, Begüm Çığ, Gaspar Rochette, Sara Han Díaz, and Bertrand Charpentier	2026-03-11	922	--
SILMA TTS: A Lightweight Open Bilingual Text to Speech Model	Karim Ouda	2026-03-15	524	--
The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare …	Sean Huver, Nigel Nelson, Lukas Zbinden, and Mostafa Toloui	2026-03-16	865	--
Tokenization is Killing our Multilingual LLM Dream	Omar Kamali	2026-03-15	3,383	--
Expanding the Alpamayo Open Platform for Developing Reasoning AVs Across Models, Data, …	Marco Pavone	2026-03-16	1,259	--
Holotron-12B - High Throughput Computer Use Agent	Pierre-Louis Cedoz, Hamza Benchekroun, Aurélien Lac, delfosse, Tony Wu, Mats L. Richter, Antoine Bonnet, Kai Yuan, Aleix Cambray (H-AI), and Alexandra	2026-03-17	868	--
Super Analyzer: Combining Reasoning and Coding Capabilities to Improve Code Performance	Girish Ganesan and Balachandran Rajendran	2026-03-13	1,363	--
Efficient LLM Pretraining: Packed Sequences and Masked Attention	Lukas	2024-10-07	1,906	--
LoRA Fine-Tuning BitNet b1.58 LLMs on Heterogeneous Edge GPUs via QVAC Fabric	Subash SN, Akshay Nambiar, Milan Gritta, Zhen Cong Chen, Arsalan Anwari, Gianfranco Cordella, and Amril Nurman	2026-03-17	3,124	--
Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI	Vinay Raman, Ameya Sunil Mahabaleshwarkar, Hayley Ross, Bilal Kartal, Aditya Malte, Zijia Chen, Ali Taghibakhshi, Sharath Turuvekere Sreenivas, Saurav Muralidharan, Khalil Ben Khaled, Nima Tajbakhsh, Pavlo Molchanov, Oluwatobi Olabiyi, and Yoshi Suhara	2026-03-17	1,552	--
State of Open Source on Hugging Face: Spring 2026	Avijit Ghosh, Lucie-Aimée Kaffee, Yacine Jernite, and Irene Solaiman	2026-03-17	2,883	--
Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding	Talor Abramovich, Maor Ashkenazi, Izzy Putterman, Benjamin Chislett, Tiyasa Mitra, Bita Rouhani, Ran Zilberstein, and Yonatan Geifman	2026-03-19	2,333	--
ATE-2: State-of-the-Art Armenian Text Embeddings and the ArmBench-TextEmbed Benchmark	Hrant Davtyan, Zaruhi Navasardyan, Spartak Bughdaryan, and bag_min	2026-03-19	438	--
What's New in Mellea 0.4.0 + Granite Libraries Release	Abraham Daniels	2026-03-20	469	--
Build a Domain-Specific Embedding Model in Under a Day	Steve H, Rucha Apte, Sean Sodha, and Oliver Holworthy	2026-03-20	2,729	--
Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to …	Yev Meyer and Dane Corneil	2025-06-10	588	--
Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic …	Yunus Cukran	2026-03-21	986	--
NanoVDR: A 70M Text-Only Model That Retrieves Visual Documents as Well as …	Zhuchenyang Liu	2026-03-16	1,493	--
Pocket Models for iOS: Explore On-Device AI with GGUF Models, Data Memory, …	Hamit Hasanhocaoglu, Arda Dogantemur, Metecan Duyal, and StJohn Deakins	2026-03-18	1,270	--
Introducing AI chunking to semchunk	Umar Butler and Abdur-Rahman Butler	2026-03-23	2,228	--
Canada Must Not Turn AI Chatbots Into a New Surveillance Frontier	Noah Weinberger	2026-03-16	1,934	--
A New Framework for Evaluating Voice Agents (EVA)	Tara Bogavelli, Gabrielle Gauthier Melancon, Katrina Stankiewicz, Nifemi Bamgbose, Hoang Nguyen, Raghav Mehndiratta, Hari Subramani, and Fanny Riols	2026-03-24	2,147	--
SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation	Maziyar Panahi, merve, Jamie@Doubleword, Josh, Seb Ringrose, and Fergus Finn	2026-03-23	3,730	--
Introducing Cohere-transcribe: state-of-the-art speech recognition	Julian Mack, Ekagra Ranjan, Walter Beller-Morales, Bharat venkitesh, and Pierre Richemond	2026-03-26	1,485	--
G2P Shrinks Speech Models	Hexgrad	2025-02-05	1,562	--
Strand-Rust-Coder-v1: Rust Coding Model Fine-Tuned on Peer-Ranked Synthetic Data	Aleksei Ivashov, Vladyslav Larin, Vishesh Tripathi, and Ivan Nikitin	2025-12-11	5,450	--
Liberate your OpenClaw 🦀	Clem 🤗, ben burtenshaw, Pedro Cuenca, Jeff Boudier, merve, Niels Rogge, Victor Mustar, and Mishig Davaadorj	2026-03-27	593	--
White Hat Security Agent Prompts 600K Dataset by Yatin Taneja	Yatin Taneja	2026-03-23	1,181	--
Letter of Superintelligence ~ Yatin Taneja	Yatin Taneja	2026-03-23	1,031	--
ORBA: Orthogonal Reflection Bounded Ablation — A Geometrically Exact Detour in Directional …	Jim Lai	2026-03-25	5,092	--
Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models	VIDRAFT_LAB	2026-03-29	1,563	--
How I contributed a new model to the Transformers library using Codex	Niels Rogge	2026-03-30	2,696	--
Training mRNA Language Models Across 25 Species for $165	Maziyar Panahi	2026-03-31	6,915	--
TRL v1.0: Post-Training Library Built to Move with the Field	Quentin Gallouédec, Steven Liu, Pedro Cuenca, and Sergio Paniego	2026-03-31	3,093	--
Falcon Perception	wamiq para and FalconPerception	2026-04-01	2,955	--
Using Storage Buckets as a Working Layer for Data Pipelines	Daniel van Strien	2026-03-26	1,095	--
Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents	Madison Lee, Rogerio Feris, Eli Schwartz, Dhiraj Joshi, Pengyuan Li, and Isaac Sanchez	2026-03-31	1,316	--
"The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge"	VIDRAFT_LAB	2026-03-31	2,884	--
🌈 SKT AI LABS 🌈	ѕкт αι ℓαвѕ	2026-03-30	555	--
Holo3: Breaking the Computer Use Frontier	Ramzi De Coster, Pierre-Louis Cedoz, Tony Wu, Hamza Benchekroun, mandreux-hai, delfosse, Aurélien Lac, maxime, Axel Moyal, Antonio Loison, Kai Yuan, and Ronan Riochet	2026-04-01	813	--
Take Control of What Your LLM Knows and Does — with the …	Xu Ziwen, Ningyu Zhang, Jizhan Fang, and Yunzhi Yao	2025-07-15	1,757	--
Run Gemma 4 on Intel® Arc™ GPUs Out-Of-the-Box	Matrix Yao, Chendi Xue, FanZhao, Xinyu Chen, Alex Gu, Wuxun Zhang, Xinyi Li, jianan, Yi Wang, and Yintong Lu	2026-04-01	1,495	--
Illustrated LLM OS: An Implementational Perspective	Anshuman Mishra	2023-12-03	1,302	--
Welcome Gemma 4: Frontier multimodal intelligence on device	merve, Pedro Cuenca, Sergio Paniego, ben burtenshaw, Steven Zheng, Alvaro Bartolome, and Nathan Habib	2026-04-02	6,003	--
ArmBench-LLM 1.0: Benchmarking LLMs on Armenian Language Tasks	Hrant Davtyan, Zaruhi Navasardyan, Spartak Bughdaryan, and bag_min	2026-04-02	1,205	--
YC-Bench: Can Your AI Agent Run a Startup Without Going Bankrupt?	Adit, Riddle He, Vincent Tu, Anand Kumar, and Nazneen Rajani	2026-04-02	169	--
Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their …	Gustavo A Lujan and kedar kolluri	2026-04-03	2,730	--
Run Gemma 4 on Intel® Xeon® Out-Of-the-Box	Jiang Li, Xinyu Chen, Chendi Xue, FanZhao, Yi Wang, Wuxun Zhang, Alex Gu, Xinyi Li, jianan, Yintong Lu, and Matrix Yao	2026-04-01	1,464	--
gradio.Server: Any Custom Frontend with Gradio's Backend	yuvraj sharma and Abubakar Abid	2026-04-01	1,160	--
From doctest to runnable Markdown	Tarek Ziadé	2026-04-04	1,460	--
Darwin V6: Diagnostic-Guided Evolutionary Model Merging	VIDRAFT_LAB	2026-04-08	1,003	--
How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs	Niels Rogge	2026-04-07	1,246	--
BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders	Nicolas-BZRD and Théo Deschamps-Berger	2026-04-07	1,772	--
Safetensors is Joining the PyTorch Foundation	Luc Georges and Lysandre	2026-04-08	807	--
ALTK‑Evolve: On‑the‑Job Learning for AI Agents	Vatche Isahagian, Vinod Muthusamy, Jayaram Radhakrishnan, Gaodan Fang, Punleuk Oum, and G Thomas	2026-04-08	1,180	--
Building Harvey-style tabular review from scratch, but better	Abdur-Rahman Butler	2026-04-09	4,508	--
Multimodal Embedding & Reranker Models with Sentence Transformers	Tom Aarsen	2026-04-09	2,886	--
Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs	Andrew Lapp, Louis Castricato, Scott Fox, Shahbuland Matiana, and David Rossi	2026-04-09	857	--
Magpie Speech — Applying an LLM Data Synthesis Method to an LLM-Based …	Aratako	2025-08-14	3,032	--
Using OCR models with llama.cpp	Xuan-Son Nguyen	2026-04-10	816	--
Anonymizer SLM series: Privacy-first PII replacement models (0.6B/1.7B/4B)	Pratyush Ranjan Tiwari and Eternis Team	2025-08-27	1,962	--
"Darwin-27B-Opus: Surpassing the Foundation Model Without Training"	VIDRAFT_LAB	2026-04-13	1,806	--
Releasing LiteCoder-Terminal-SFT	LiteCoder	2026-04-13	833	--
When Speech AI Meets the Long Tail of Languages: Inside the VAANI …	Sujith Pulikodan, Sanka, Nihar Desai, Suryansh Shukla, and Prasanta Kumar Ghosh	2026-04-14	901	--
Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain — …	VIDRAFT_LAB	2026-04-15	1,224	--
Meet HoloTab by HCompany. Your AI browser companion.	Marc Thibault, Pierre-Louis Cedoz, Hamza Benchekroun, Kai Yuan, Aurélien Lac, Tony Wu, Antonio Loison, Axel Moyal, and Emrick Sinitambirivoutin	2026-04-15	516	--
✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use	Ziyang Luo and Kaixin Li	2025-01-03	576	--
Stop benchmarking inference providers	Nathan Habib	2026-04-14	815	--
Nucleus-Image: Scaling Text-to-Image with Sparse Mixture of Experts	Nucleus AI	2026-04-14	1,546	--
Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents	Ankita Naik, danish, Ben, Anupama Murthi, and Praveen	2026-04-15	3,111	--
Understanding Vector Quantization in VQ-VAE	Aritra Roy Gosthipaty	2024-08-28	1,771	--
The PR you would have opened yourself	Pedro Cuenca and Awni Hannun	2026-04-16	2,504	--
easyaligner: Forced alignment of text and audio, made easy	Faton Rekathati	2026-04-16	1,591	--
Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers	Tom Aarsen	2026-04-16	3,791	--
Building a Fast Multilingual OCR Model with Synthetic Data	Ryan Chesler	2026-04-17	2,218	--
Ecom-RLVE: Adaptive Verifiable Environments for E-Commerce Conversational Agents	Rahul Bajaj, Jaya Nupur, Anuj Garg, and ben burtenshaw	2026-04-16	2,563	--
NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots	Edith Llontop and Kalyan Vadrevu	2026-04-17	797	--
Vessel Browser: The Open Source Browser Designed for Autonomous Agents	Tyler Williams	2026-04-17	845	--
QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard	Leen AlQadi, Ahmed Alzubaidi, Mohammed Alyafeai, Maitha Alhammadi, Shaikha Alsuwaidi, Omar saif alkaabi, Basma Boussaha, and Hakim Hacid	2026-04-21	1,731	--
How to Ground a Korean AI Agent in Real Demographics with Synthetic …	Will Jennings, Hyunwoo Kim, Jinho Lee, jihyeonRyu, Kiran Praveen, Yev Meyer, Kirit Thadaka, and Shyamala Prayaga	2026-04-21	1,502	--
Save the traces! 🐳	Pedro Cuenca	2026-04-21	461	--
Multilingual Tool Calling in 70+ Languages, On Device	Bronson, Kato Steven Mubiru, Gimei Alex, OJ Onyeagwu, and Adnan El Assadi	2026-04-20	1,636	--
DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models	Raphael Sourty, Antoine Chaffin, Paulo Moura, and Amélie Chatelain	2026-04-21	5,774	--
AI and the Future of Cybersecurity: Why Openness Matters	Margaret Mitchell, Yacine Jernite, and Clem 🤗	2026-04-21	1,245	--
Introducing the Bright Data CLI for Automated Web Data Pipelines	Bright Data	2026-04-20	1,786	--
mlinter: a linter for Transformers modeling files	Tarek Ziadé	2026-04-22	1,827	--
Gemma 4 VLA Demo on Jetson Orin Nano Super	Asier Arranz	2026-04-22	1,575	--
ML Intern Takes Our Post-Training Internship Test	Carlos Miguel Patiño, Aksel Joonas Reedi, and Lewis Tunstall	2026-04-23	924	--
Hy3 preview: A Rebuilt Hunyuan, a 21B-Active MoE, and a New Reasoning …	Leco Li	2026-04-23	1,035	--
How to Use Transformers.js in a Chrome Extension	Nico Martin	2026-04-23	1,774	--
RL: A Structured Human Action & Intent Dataset for Physical AI and …	Gowtham and Marc Hebert	2026-04-21	2,351	--
DeepSeek-V4: a million-token context that agents can actually use	ben burtenshaw	2026-04-24	1,488	--
Measuring What Matters: Objective Metrics for Image Generation Assessment	Begüm Çığ, Bertrand Charpentier, and David Berenstein	2025-05-20	2,255	--
Building long-horizon SWE environments on Hugging Face: Frontier SWE × OpenEnv	swappy and Sourasish Basu	2026-04-26	1,224	--
How to build scalable web apps with OpenAI's Privacy Filter	yuvraj sharma, Freddy Boulton, and Abubakar Abid	2026-04-27	1,641	--
OpenRA-RL: An Open Platform for AI Agents in Real-Time Strategy Games	Xiaochuang Yuan, huixu, Yiyu Tian, momo, Ruiyue Wang, and Kaiser Sun	2026-04-27	3,015	--
An Introduction to AI Model Optimization Techniques	David Berenstein and Bertrand Charpentier	2025-04-18	1,647	--
Adaptive Ultrasound Imaging with Physics-Informed NV-Raw2Insights-US AI	Walter Simson, Jay Carlson, Tom Lassiter, Kevin Woo, and Sean Huver	2026-04-28	929	--
Running AI agents to automate outreach at scale	Niels Rogge	2026-04-27	2,296	--
Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio …	Tuomas Rintamaki, Amala Sanjay Deshmukh, Nabin Mulepati, Collin McCarthy, Pritam Biswas, Arushi Goel, Leili Tavabi, Alexandre Milesi, Danial Mohseni Taheri, Kateryna Chumachenko, Isabel Hulseman, Zhehuai Chen, Karan, and Tao	2026-04-28	3,186	--
BiomedBERT Small: Medical models at 22.7M parameters	David Mezzetti	2026-04-28	912	--
AI evals are becoming the new compute bottleneck	Avijit Ghosh, Yifan Mai, Georgia Channing, and Leshem Choshen	2026-04-29	3,881	--
Pallas for people who know JAX but not kernels yet	Aritra Roy Gosthipaty	2026-04-29	1,581	--
DeepInfra on Hugging Face Inference Providers 🔥	Aray Sultanbekova, Shang-Pin, Utemuratov, Yessen K, Oguz Vuruskaner, Célina Hanouti, Simon Brandeis, and Lucain Pouget	2026-04-29	878	--
Granite 4.1 LLMs: How They’re Built	Yousaf Shah	2026-04-29	2,848	--
The MCP Era Feels Like Déjà Vu	Mohamed Rashad and Hessah Alharbi	2026-04-29	2,023	--
Training low-bit ternary models with Axolotl	wing lian	2026-04-30	1,151	--
Open Source AI Agents \| Github/Repo List \| [2025]	tegridy	2025-02-21	477	--
ChatML vs Harmony: Understanding the new Format from OpenAI 🔍	Jisoo Kim	2025-08-09	1,914	--
Build a legal RAG app that won't be held in contempt	Tabs	2026-05-05	3,115	--
Adding Benchmaxxer Repellant to the Open ASR Leaderboard	Eric Bezzam, Steven Zheng, Eustache Le Bihan, Sergio Bruccoleri, Jeanine Sinanan-Singh, Casey Ford, Guanbo Wang, Yukai Huang, Ke Li, Yufeng Hao, and Liao Xiaoling	2026-05-06	1,400	--
Bringing Fusion Down to Earth: ML for Stellarator Optimization	Georgia Channing	2025-07-02	1,804	--
Learning Maths for the Last Time	Shane, LaneFiedler, Enderchef (Enderchefcoder), LH-Tech AI, Arman Rafiee, poe, and AxionLab	2026-05-06	1,325	--
Introducing the agentic robotics appstore for 10,000 Reachy Minis	Clem 🤗	2026-05-06	1,207	--
vLLM V0 to V1: Correctness Before Corrections in RL	Rafael Pardinas and Ehsan Kamalloo	2026-05-06	1,579	--
🧠 I trained my own French LLM from scratch — alone, with …	vloplok	2026-05-05	2,017	--
QVAC MedPsy: State-of-the-Art Medical and Healthcare Language Models for Edge Devices	Mathias Buus, Davide Vitabile, Alex Buffa, Akshay Nambiar, and Amril Nurman	2026-05-07	9,495	--
Improving Depth Anything V2 Robustness to Video Compression	Ethan F and Ronen Nissim	2026-05-07	3,407	--
MedQA: Fine-Tuning a Clinical AI on AMD ROCm — No CUDA Required	Harikrishna	2026-05-08	1,520	--
There is no such thing as a tokenizer-free lunch	Catherine Arnett	2025-09-25	3,807	--
EMO: Pretraining mixture of experts for emergent modularity	Kyle Wiggers and Ryan Wang	2026-05-08	1,830	--
CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models	Samuel	2026-05-08	1,783	--
"OncoAgent: A Dual-Tier Multi-Agent Framework for Privacy-Preserving Oncology Clinical Decision Support"	Máximo López Chenlo	2026-05-09	2,938	--
Building Blocks for Foundation Model Training and Inference on AWS	Keita Watanabe, Pavel Belevich, and Aman Shanbhag	2026-05-11	4,362	--
BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡	Xing Han Lù	2024-07-09	1,358	--
Two Years of Local AI on a Laptop: When Open Models Outpaced …	Mishig Davaadorj	2026-05-11	1,653	--
Hugging Face on JFrog Artifactory: An Enterprise Guide (and What Changes in …	Jeff Boudier	2026-05-08	5,080	--
Safety Evals Should Project Test-Time Compute	Tommaso Cerruti	2026-05-11	2,521	--
You do the work. Big Tech takes the model.	Urro	2026-05-11	3,960	--
Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier …	VIDRAFT_LAB	2026-05-15	882	--
Unlocking asynchronicity in continuous batching	Rémi Ouazan Reboul, Pedro Cuenca, and Aritra Roy Gosthipaty	2026-05-14	4,015	--
Self Evolving is the Endgame or final destiny	Rajkumar rawal	2026-05-12	683	--
How to Comply with SOC 2 and ISO 27001 with Hugging Face: …	Jeff Boudier	2026-05-14	3,007	--
Vividh-ASR: Diagnosing and Fixing Studio-Bias in Whisper for Indic Languages	Kavya Manohar, Kush Juvekar, and Kumarmanas Nethil	2026-05-15	3,877	--
Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context …	Radu Florian, Parul Awasthy, Aashka Trivedi, and Madison Lee	2026-05-14	3,411	--
DualPipe Explained: A Comprehensive Guide to DualPipe That Anyone Can Understand—Even Without …	Yihua Zhang	2025-02-28	6,290	--
PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend	AlexZhang, cuicheng, Jun Zhang, and Manhui Lin	2026-05-18	927	--
Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation	Ting-Yun Chang, Miguel Martin, Jonathan Allen, Ke Ding, and Pooya Jannaty	2026-05-18	2,653	--
The Open Agent Leaderboard	Elron Bandel	2026-05-18	1,703	--
Metric and Relative Monocular Depth Estimation: An Overview. Fine-Tuning Depth Anything V2 …	Daniil Suhoi	2024-07-10	4,455	--
OlmoEarth v1.1: A more efficient family of models	Kyle Wiggers	2026-05-19	898	--
Introducing the Ettin Reranker Family	Tom Aarsen	2026-05-19	5,698	--
Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚	Daniil Suhoi	2024-08-26	8,272	--
Software Forgets: Agent Traces Are the Memory	Caleb Fahlgren	2026-05-19	604	--
Talking to a 4-Year-Old: A Multilingual Benchmark for Children's AI Companions	Batuhan Aktas, Yuvraj, and fatih bugra akdogan	2026-05-03	4,557	--
Vocabulary-Augmented Prompting for Sango — Production African Language AI Without a Parallel …	MICWEN	2026-05-13	3,112	--
LeRobot Humanoid: An Open, Low-Cost, 3D-Printed Humanoid for Robot Learning	Virgile BATTO, Caroline Pascal, Steven Palma, Maxime Ellerbach, Nicolas Rabault, Martino Russi, and haixuan tao	2026-05-21	1,550	--
Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models	Mehran Maghoumi, Yonggan Fu, Pavlo Molchanov, and Khadkevich	2026-05-23	1,167	--
How to run Gemini Nano locally in your browser	Joshua	2024-07-11	1,328	--
An experiment with attention.	poe, Lane Fiedler, Shane, and Enderchef (Enderchefcoder)	2026-05-23	1,061	--
Specialization Beats Scale: A Strategic Variable Most AI Procurement Decisions Overlook	Erick Lachmann and Pimenta de Freitas Cardoso	2026-05-22	2,753	--
Why Open Models Are the Only Sustainable Way to Teach AI	Pénélope Gittos	2026-05-22	1,325	--
Harness, Scaffold, and the AI Agent Terms Worth Getting Right	Sergio Paniego and Aritra Roy Gosthipaty	2026-05-25	2,117	--
Relaunching PapersWithCode with new features	Niels Rogge	2026-05-24	498	--
Borealis — open data, code, weights recipe for training Audio LLM	Wortega	2026-05-25	2,303	--
Eight Days in China: What I Learned from the AI Labs, Robotics …	Matt White	2026-05-22	12,170	--
SANA-WM Bidirectional on Apple Silicon	Arjun Reddy	2026-05-20	1,105	--
Should we use genetics instead of system prompts for AI Agents & …	Fyx	2026-05-25	2,550	--
ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic …	Ayhan Sebin, Saurabh Jha, and Rohan Arora	2026-05-27	889	--
Give your agents ZeroGPU to ship viral AI apps autonomously	Victor Mustar	2026-05-26	941	--
Reachy Mini goes fully local	Amir Mahla and Andres Marafioti	2026-05-27	1,849	--
Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in …	Amine Dirhoussi, Quentin Gallouédec, Kashif Rasul, Lewis Tunstall, Edward Beeching, Albert Villanova del Moral, and Leandro von Werra	2026-05-27	4,227	--
Introduction to Trimming ✂	Loïck BOURDOIS, Tom Aarsen, Bram Vanroy, Woojun Jung, Manuel Romero, and Prithiv Sakthi	2026-05-28	19,577	--
MONET: Lowering the bar for World-Class Image Generation research.	Benjamin Aubin, Gonzalo Quintana, Onur, sanjeev sreetharan, Czerwinska, Damien Henry, and Clément Chadebec	2026-05-28	1,601	--
Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler	Aritra Roy Gosthipaty, Sayak Paul, Sergio Paniego, Rémi Ouazan Reboul, and Pedro Cuenca	2026-05-29	5,132	--
Dell Enterprise Hub at Dell Tech World 2026: new models, new platforms, …	Simon Pagezy, Enrique Hernández Calabrés, Juan Julián, Bagus Hanindhito, Girish Ganesan, ravikumar, Ian Roche, Jeff Boudier, and Balachandran Rajendran	2026-05-29	1,112	--
Server is at capacity	specimba, Lewis Tunstall, and Aksel Joonas Reedi	2026-05-27	266	--
ClawHub Security Signals: Large Corpus Multi-Scanner Dataset for Agent Skill Security Research	Vincent Koc, Patrick Erichsen, Jacob Tomlinson, Agustin Rivera, Mike Appel, and Nir Paz	2026-06-01	1,400	--
Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning …	Asawaree and Atharva Joshi	2026-06-01	1,960	--
Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic	Nicholas Fuller	2026-06-01	2,177	--
Agentic RL: Token-In, Token-Out Done Right	Quentin Gallouédec and Kashif Rasul	2026-05-29	3,670	--
MiniMax Goes Sparse: Decoding M3's Attention from a Single Diagram	Atlas Cloud	2026-05-29	1,680	--
A Deep Neural Network that turns Any Image into a Playable Game! …	Abhishek Sensharma	2026-06-01	365	--
Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains	Nikita Pavlichenko	2026-06-01	600	--
Holo3.1: Fast & Local Computer Use Agents	Maxime Langevin, Hamza Benchekroun, Axel Moyal, Emrick Sinitambirivoutin, Antonio Loison, Avshalom Manevich, Tony Wu, Pierre-Louis Cedoz, Aurélien Lac, and Ronan Riochet	2026-06-02	867	--
Taking Alpamayo to New Heights with Driving Foundation Models and Closed-Loop Training	Marco Pavone and Boris Ivanovic	2026-06-01	1,386	--
From Data Repositories to Production Data Pipelines: Bridging Hugging Face Datasets and …	Parag Ekbote	2026-06-01	1,357	--
AutoResearch on Diffusers' Pipeline for 10 Rounds on JarvisLabs	chansung park	2026-06-03	2,294	--
Adding MCP Tools to Reachy Mini	Alina Lozovskaya	2026-06-03	2,068	--
Direct Preference Optimization Beyond Chatbots	Erick Lachmann and Pimenta de Freitas Cardoso	2026-06-03	2,953	--
How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent	Maryam Motamedi, Adi- margolin, Francesco, Myungjong Kim, Enas Albasiri, and Jinhan Wang	2026-06-04	2,254	--
Fine-tune FLUX.2 [klein] with a LoRA under 60 minutes	Stephen Batifol	2026-06-04	2,617	--
EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios	Tara Bogavelli, Gabrielle Gauthier Melancon, Katrina Stankiewicz, Nifemi Bamgbose, Fanny Riols, Hoang Nguyen, Raghav Mehndiratta, Lindsay Brin, Joseph Marinier, Hari Subramani, and Anil Madamala	2026-06-04	1,990	--
Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining	Dan Su	2026-06-04	1,811	--
Designing the hf CLI as an agent-optimized way to work with the …	Célina Hanouti and Lucain Pouget	2026-06-04	2,856	--
Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI	Varun Singh , Isabel Hulseman, Anuj Doshi, and Shyamala Prayaga	2026-06-04	2,226	--
Does Depth Actually Help Reasoning? A Tiny Experiment on 2× T4	Wop and Lane Fiedler	2026-05-30	708	--
Thousand Token Wood: shipping a multi-agent economy on a 3B model	Lester Leong	2026-06-05	1,029	--
Build Small Hackathon With Cohere Models	Alejandro Rodriguez	2026-06-04	2,018	--
Building Pakistan Notice Helper: A Small AI Tool for a Very Local …	Abid Ali Awan	2026-06-08	2,724	--
Her · हेर — a detective for your Claude Code sessions	Ashish Chalke	2026-06-07	622	--
The Open Source Community is backing OpenEnv for Agentic RL	ben burtenshaw, Joseph Spisak, Lysandre, Davide Testuggine, will brown, Joy Liu, Peyton Walters, Chris Wing, Daniel (Unsloth), Andrew Zhou, Michael Han, Hamid Shojanazeri, Sanyam Bhutani, Zach Wentz, Emre Guven, Lewis Tunstall, and Sergio Paniego	2026-06-08	850	--
Job Searcher	Emre	2026-06-06	872	--
Five labs, five minds: building a multi-model finance drama on small models	Lester Leong	2026-06-06	1,141	--
Arcee Becomes the First Major American AI Lab to Replace AWS S3 …	Clem 🤗, Lucas Atkins, and Mark McQuade	2026-06-09	900	--
Run Claude Code, OpenCode & Frontier Coding Models on Your Own AI …	Girish Ganesan and Balachandran Rajendran	2026-06-06	1,723	--
How an Agent Built a 3D Paris Gallery by Chaining Two Hugging …	Mishig Davaadorj	2026-06-09	907	--
Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech	Shama Gupta, Lindsay Brin, and Fanny Riols	2026-06-09	2,621	--
Migrating Your GitHub CI to Hugging Face Jobs	Abubakar Abid	2026-06-09	1,753	--
Introducing North Mini Code: Cohere’s First Model For Developers	Cohere Code Agents Team	2026-06-09	2,737	--
Lolaby — AI-powered lullabies	André Oliveira and Vasco Oliveira	2026-06-11	1,484	--
Eyes, ears, and a voice: building Reachy Mini's media stack	Fabien Danieau, Alina Lozovskaya, Caroline Pascal, and Antoine Pirrone	2026-06-10	2,694	--
36 Prompts, One Infinite City	Mishig Davaadorj	2026-06-10	805	--
Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP	Aritra Roy Gosthipaty, Rémi Ouazan Reboul, Sergio Paniego, Pedro Cuenca, and Sayak Paul	2026-06-11	3,681	--
MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era	MiniMax	2025-01-15	876	--
MTEB Leaderboard: From a slow demo to feature-rich leaderboard	Solomatin Roman, Kenneth C. Enevoldsen, and Isaac Chung	2026-06-12	955	--
olmo-eval: An evaluation workbench for the model development loop	Tyler Murray and Kyle Wiggers	2026-06-12	1,545	--
Introducing Serge: GitHub-Native AI Code Review	Tarek Ziadé and Sayak Paul	2026-06-12	1,443	--
Mobile Manipulation with LeKiwi and PincOpen	Xingdong Zuo	2026-06-07	3,431	--
PhysicsIntern: from an Autonomous Benchmark-runner to a Research Sidekick	David Louapre	2026-06-11	2,125	--
Optimum Intel 2.0: An OpenVINO-First Toolkit for Running Open Models on Intel	Jeff Boudier and Ella Charlaix	2026-06-11	1,038	--
NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain	Harisabekti Dicky Subrata	2026-06-09	794	--
FINAL-Bench Quantum: An Open, Neutral Benchmark for Quantum-Computing Methods	VIDRAFT_LAB	2026-06-14	679	--
Building an AI Interview Coach for the BuildSmall Hackathon 2026	Ishan Awasthi	2026-06-15	1,163	--
PitchFight AI: Practice the Pitch Before the Real Room	Prakhar Parashar	2026-06-14	743	--
Eyas: AI Security Camera Agent	Seunghyun(Joe), Hanhee Lee, and Javier Huang	2026-06-15	1,983	--
GLM-5.2: Built for Long-Horizon Tasks	Z.AI	2026-06-17	2,853	--
From the Hugging Face Hub to robot hardware with Strands Agents and …	Sundar Raghavan and Cagatay Cali	2026-06-17	3,491	--
Party is over: regularizing ColBERT models to fix efficient ANN methods	Antoine Chaffin	2026-06-16	5,150	--
Closet Twin: Your AI-Powered Personal Stylist Built for the Build Small Hackathon	Nouhaila mfth	2026-06-14	694	--
MosaicLeaks: Can your research agent keep a secret?	Alexander Gurung and Rafael Pardinas	2026-06-18	1,889	--
Is it agentic enough? Benchmarking open models on your own tooling	Lysandre, Nathan Habib, and Pedro Cuenca	2026-06-18	3,363	--
MolmoMotion: Language-guided 3D motion forecasting	Kyle Wiggers	2026-06-17	1,901	--
Beyond LoRA: Can you beat the most popular fine-tuning technique?	Benjamin Bossan, Sayak Paul, Marian, and Kashif Rasul	2026-06-18	2,754	--
Intel XPU Kernel Skill: LLM-driven Triton kernel optimization for the Hugging Face …	Daniel Fleischer and Moshe Wasserblat	2026-06-17	2,201	--
Agentic Resource Discovery: Let agents search for tools, skills, and other agents.	ben burtenshaw and shaun smith	2026-06-17	1,418	--
Enterprise AI benchmarks: head-to-head comparison of Falconer, Notion, Atlassian Rovo, Claude Code, …	Maximiliano Benedetto and Matt Zhao	2026-06-18	1,668	--
QLORA SFT Distillation Effects on Qwen3.6 27B Agentic Coding Harness Fluency	Thomas Kim	2026-06-15	1,939	--
The Office Meets Silicon Valley	Felix	2026-06-15	1,709	--
How We Built OpenMythos: A Cybersecurity LLM Trained from Scratch	Nishith Jain	2026-06-15	1,139	--
I fine-tuned a model for free from one prompt, with TRL and …	Sergio Paniego	2026-06-15	922	--
No Photoshop, No Blender: Multimedia by Agent	Mishig Davaadorj	2026-06-19	1,166	--
PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters	AlexZhang, cuicheng, Jun Zhang, Manhui Lin, Yue Zhang, leo-q8, yubo, and Yi Liu	2026-06-22	1,089	--
Shipping huggingface_hub every week with AI, open tools, and a human in …	Lucain Pouget and Célina Hanouti	2026-06-23	2,343	--
🧬 Carbon-VEPor: Efficient Variant Effect Prediction with Carbon	Vivek Silimkhan	2026-06-15	1,743	--
We got local models to triage the OpenClaw repo for FREE!*	Onur Solmaz, ben burtenshaw, shaun smith, Pedro Cuenca, and Lysandre	2026-06-22	2,891	--
V-Zero	haoxiang sun	2026-06-22	859	--
Continuous batching for GRPO, now in TRL	Sergio Paniego	2026-06-19	712	--
Where Does the Signal Live? A Web Data Recipe for Medical Encoder …	bofeng huang, Sun Jacques, Diane Bouchacourt, Nicolas Barascud, and Fajwel Fogel	2026-06-20	2,019	--
Experimenting with the proposed Cross-Origin Storage API in Transformers.js	Thomas Steiner	2026-06-23	2,915	--
Building Moon Bot: A Slack-Native Coding Agent Backed by HuggingFace Buckets	Eliott Coyac, Caleb Fahlgren, and Franck Abgrall	2026-06-24	2,003	--
Build real agentic apps using CUGA: two dozen working examples on a …	Anupama Murthi, Hamid Adebayo, Sami Marreed, Praveen, and Asaf Adi	2026-06-23	3,392	--
The Best Open Source and Open-Weight LLM Models to Run Locally in …	Daya Shankar	2026-05-13	4,740	--
Interhuman’s Goblin: “Yeah, Friday at Five”	Siddharth Ravi	2026-06-24	2,371	--
Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel	Adil Asif, Alexandros Koumparoulis, Wenwen Gao, Sylendran Arunagiri, David Messina, and Bernard Nguyen	2026-06-24	2,234	--
Introducing the FFASR Leaderboard: Benchmarking ASR in the Real World	Daniel Gert Nielsen, Shivam Saini, Alessia Milo, Georg Götz, and Eric Bezzam	2026-06-24	1,647	--
Kog Laneformer 2B: The Latency-First Model Behind Kog Inference Engine	Morgan Giraud, Gauthier Tallec, and Gaël Delalleau	2026-06-24	3,042	--
Which tokens does a hybrid model predict better?	Kyle Wiggers	2026-06-25	1,364	--
Run a vLLM Server on HF Jobs in One Command	Quentin Gallouédec	2026-06-26	1,611	--
Machine learning for alien climates: Introducing the ThousandWorlds benchmark	Edward Stevenson	2026-06-23	899	--
VLX-Flow: Continuous Video Understanding for Real-Time Multimodal Interaction	Tony Zhao and Yibo Ma	2026-06-26	1,194	--
VLX-Flow: Continuous Video Understanding for Real-Time Multimodal Interaction	Tony Zhao	2026-06-27	1,223	--
VLX-Seek: Improving VLM Fine-Grained Perception via Region Reference Instead of Coordinate Generation	Peng Liu and Tony Zhao	2026-06-27	3,375	--
VLX-Go: Vision-Language Short-Horizon Waypoint Prediction for Embodied Navigation	Peng Liu and Tony Zhao	2026-06-28	1,138	--
OlmoLogic: Boosting Reasoning via RLVR with Inductive Logic Programming	Lukas Helff, Sebastian Sztwiertnia, Felix Friedrich, Hikaru Shindo, and Ahmad Omar	2026-06-26	2,702	--
Chitos: From Detection to Proof — An Autonomous Security AI That Actually …	VIDRAFT_LAB	2026-06-29	1,660	--
Featuring Every Eval Ever Results on Hugging Face Model Pages	Sree Harsha Nelaturu, Avijit Ghosh, Nathan Habib, Jan Batzner, Leshem Choshen, Irene Solaiman, and Julien Chaumond	2026-06-30	1,434	--
80TB+ of astronomy for the HDD-poor: crossmatch the Multimodal Universe from your …	Mike Smith	2026-06-29	2,494	--
DukaanBench: Can AI Run an Indian Grocery Store for 30 Days?	Ekansh Srivastva	2026-06-27	3,871	--
Why Specialization Is Inevitable	Erick Lachmann and Francisco de Almeida Rocha Alves	2026-06-30	2,264	--
DiScoFormer: One transformer for density and score, across distributions	Kyle Wiggers	2026-06-29	894	--
Does Your LLM Know When It's About to Be Wrong?	ginigen-ai	2026-07-01	1,446	--
ScarfBench: Benchmarking AI Agents for Enterprise Java Framework Migration	Raju Pavuluri, Rahul Krishna, Srikanth Govindaraj Tamilselvam, Bridget M, Ashita Saxena, George Safta, Advait Pavuluri, and Michele Merler	2026-06-30	1,067	--
Hugging Face and Cerebras bring Gemma 4 to real-time voice AI	Amir Mahla, Andres Marafioti, Leandro von Werra, and Saurabh Vyas	2026-07-01	576	--
Heretic Grimoire	Vinay Umrethe	2026-06-30	1,360	--
Pulpie: Pareto-Optimal Models for Cleaning the Web	Shreyash Nigam	2026-07-01	2,232	--
AstroBERT Small: Domain-specialized small models	David Mezzetti	2026-07-01	1,249	--
Adding a GPU Without Building One	VIDRAFT_LAB	2026-07-03	1,374	--
SportsBERT Small: Domain-specialized small models	David Mezzetti	2026-06-26	1,126	--
Claude Fable 5 — Technical Harness Report	NIONGOLO Chrys Fé-Marty	2026-07-01	3,626	--
🤗 Kernels: Major Updates	Sayak Paul, Daniël de Kok, and David Holtz	2026-07-06	1,812	--
LeRobot v0.6.0: Imagine, Evaluate, Improve	Steven Palma, Pepijn Kooijmans, Caroline Pascal, Khalil Meftah, Martino Russi, Nikodem Bartnik, Nicolas Rabault, and Thomas Wolf	2026-07-07	2,614	--
Gemma-4 31B + vLLM on RTX 6000 PRO : A Real-Load Benchmark	Nikhil K.	2026-06-29	786	--
🔁 Apprendre à un LLM français de 15M à penser plus profond …	RDTvlokip	2026-07-03	5,522	--
🔁 Teaching a 15M French LLM to think deeper — and to …	RDTvlokip	2026-07-03	4,769	--
PRX Part 4: Our Data Strategy	Roman Frigg, David Bertoin, and Jon Almazán	2026-07-06	4,298	--
BaseRT: Best-in-Class LLM Inference on Apple Silicon via Native Metal	Prabod, Fabian Waschkowski, and Lukas Wesemann	2026-07-01	1,290	--
Run AI workloads on any cloud, store on Hugging Face: zero-egress storage …	Nikhil Jha, Zhanghao Wu, Hope Wang, Adrien Carreira, and Julien Chaumond	2026-07-07	1,818	--
Teaching a coding agent to deploy production endpoints on Amazon SageMaker	Dario Salvati, Alvaro Bartolome, and Jeff Boudier	2026-07-07	3,582	--
Hugging Face Models on Foundry Managed Compute	Manoj Bableshwar and Osi	2026-07-07	2,222	--
From Hugging Face to Amazon SageMaker Studio in one click	Hazim Qudah	2026-07-07	1,017	--
After the party comes the free lunch: regularizing ColBERT models to enhance …	Antoine Chaffin	2026-07-06	2,819	--
NVIDIA Isaac Teleop and GR00T 1.7 Open VLA Model Available in LeRobot	lior ben horin, Kartik S, Johnny Nuñez Cano, Edith Llontop, Leung, Andrew C Wrenn, and Shane Reetz	2026-07-07	1,495	--
Atom2.7m: Representation-Level Specialization for Arithmetic-Aware Small Language Models	Maksymilian	2026-07-07	2,675	--
Native-speed vLLM transformers modeling backend	Harry Mellor and Lysandre	2026-07-08	955	--
Data for Agents	Will Jennings, Jane Polak Scowcroft, Annie Surla, Yev Meyer, Rebecca Kao, Leanna Chraghchian, Chris Alexiuk, Michelle Xu, and Dhruv Nathawani	2026-07-08	1,312	--
Meet Cohere Transcribe Arabic	Shaun Cassini, Sebastian Vincent, Xiaolu Lu, Julian Mack, Dhruti Joshi, and Pierre Richemond	2026-07-07	1,336	--
Distillation in 2026 (so far): which frontier models use it and how	Sergio Paniego	2026-07-08	1,123	--
Taking Alpamayo to New Heights with Driving Foundation Models and Closed-Loop Training	Marco Pavone and Boris Ivanovic	2026-06-01	1,380	--
Profiling in PyTorch (Part 3): Attention is all you profile	Aritra Roy Gosthipaty, Sergio Paniego, Sayak Paul, and Rémi Ouazan Reboul	2026-07-10	4,196	--
How to visualize any Hugging Face model	Hannes von Essen	2026-07-10	563	--
Can Skills Improve Codex’s Data Analysis Capabilities?	Ningyu Zhang	2026-07-10	3,315	--
Quantum Cryptanalysis on Real Hardware: Pushing Symmetric-Structure Key Recovery Beyond the Published …	VIDRAFT_LAB	2026-07-05	2,183	--
Why Whisper cuts off Indic transcripts after six seconds	Kavya Manohar and Kush Juvekar	2026-07-07	1,452	--
Can Codex Handle Real-World Data Analysis?	Ningyu Zhang	2026-07-10	3,210	--
Distilling OmniVoice into Aegis: Female Urdu TTS at 61 MB ONNX for …	Mahwiz Khalil	2026-07-05	1,151	--
VKUE: No GPU? Runs Anyway — a 34.7B Reasoner on a Laptop …	VIDRAFT_LAB	2026-07-12	1,032	--
Putting DoctoBERT to Work: A Practical Guide	bofeng huang and Emma Scharfmann	2026-07-09	3,937	--
Giving AI Agents 3D Bodies, Real Jobs, and Wallets on three.ws	three.ws	2026-07-13	1,748	--
J-Space: Yet Another LLM Mind Reader?	David Louapre	2026-07-13	4,714	--
Deploy GLM-5.2-FP8 as your open, frontier-level agent	Juan Julián	2026-07-13	1,687	--
Welcome Inkling by Thinking Machines	ben burtenshaw, merve, Pedro Cuenca, and Aritra Roy Gosthipaty	2026-07-15	3,472	--
Introducing Real World VoiceEQ: Measuring the human quality of voice AI	David Ayllon, Alice, Jeff Brooks, Franc Camps Febrer, Jakub Piotr Cłapa, Theo Lebryk, Jens Madsen, Olya Ossipova, Sharath Rao, Hoon Shin, Tigran, Rashish Tandon, and Panagiotis Tzirakis	2026-07-15	1,152	--
Model Routing Is Simple. Until It Isn’t.	Yara Rizk, Eyal Shnarch, Jason Tsay, and Merve Unuvar	2026-07-15	1,052	--
The state-of-the-art in open-source AI for Swiss legal tasks	Joel Niklaus and Daniel	2026-07-14	2,249	--
What building Shippy taught us about building agents	Kyle Wiggers	2026-07-15	1,937	--
Security incident disclosure — July 2026	system	2026-07-16	887	--
NVIDIA Nemotron 3 Embed Ranks #1 Overall on RTEB, Advancing Agentic Retrieval	Yauhen Babakhin, Ronay Ak, Jiarui Cai, Vinay Raman, Radek Osmulski, Jakub Zakrzewski, Anmol Gupta, Oliver Holworthy, Sahel Sharifymoghaddam, Khang Pham, James Rong, Steve Han, Sean Sodha, Isabel Hulseman, and Bo Liu	2026-07-16	2,269	--
One Adapter, Both Modalities: Field Notes from Building and Serving a Multimodal …	Amélie Chatelain and Ishrat Jahan Ananya	2026-07-16	7,302	--
Newer Models, Same Advantage	Erick Lachmann, Gabriel Pimenta de Freitas Cardoso, Francisco de Almeida Rocha Alves, and Victor Gabriel Ferreira Barbosa	2026-07-16	2,359	--
Kimi K3 Model Overview: 2.8T Parameters, MXFP4 Quantization, and What the Open …	Viddi AI	2026-07-17	1,046	--
Fine-tune video and image models at scale with NVIDIA NeMo Automodel and …	Pranav Prashant Thombre, linnan wang, Alexandros Koumparoulis, Wenwen Gao, Sylendran Arunagiri, and Bernard Nguyen	2026-07-17	1,999	--
When will language models be good enough?	Colin Raffel	2026-07-16	812	--
Aether-7B-5Attn: A 100% Open-Source Sovereign Foundation Model — and a Controlled Experiment …	VIDRAFT_LAB	2026-07-19	2,299	--

Plushcap, by Matt Makai. 2021-2026.