HuggingFace Blog
295 posts indexed since 2026
Post Details
| Title | Author | Published | Words | HN Pts |
|---|---|---|---|---|
| Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B … | weitaofeng | 2026-01-01 | 1,778 | -- |
| The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on … | Yağız Çalık | 2026-01-02 | 5,072 | -- |
| Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture | Basma Boussaha, Mohammed Alyafeai, Ahmed Alzubaidi, Leen AlQadi, Shaikha Alsuwaidi, Omar saif alkaabi, Hamza Alobeidli, and Hakim Hacid | 2026-01-05 | 1,838 | -- |
| TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell | Konstantin | 2026-01-05 | 3,309 | -- |
| Introducing Falcon H1R 7B | Iheb Chaabane, Puneesh Khanna, Suhail M Shah, Slim Frikha, Shi Hu, Abdalgader Abubaker, Reda alami, Mike Lubinets, Mohamed El Amine Seddik, and Hakim Hacid | 2026-01-05 | 1,332 | -- |
| Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem | Marco Pavone | 2026-01-05 | 893 | -- |
| Understanding Low-Rank Adaptation (LoRA): A Revolution in Fine-Tuning Large Language Models | Ashish Chadha | 2026-01-03 | 2,023 | -- |
| NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI | Tsung-Yi Lin and Debraj Sinha | 2026-01-05 | 1,037 | -- |
| Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR | Kunal Dhawan, Adi- margolin, Gordana Neskovic, Maryam Motamedi, and Yasmina Benkhoui | 2026-01-05 | 1,860 | -- |
| NVIDIA brings agents to life with DGX Spark and Reachy Mini | Jeff Boudier, Nader Khalil, and Alec Fong | 2026-01-05 | 2,128 | -- |
| M2.1: Multilingual and Multi-Task Coding with Strong Generalization | MiniMax | 2026-01-05 | 2,306 | -- |
| Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot | Raffaello Bonghi, lior ben horin, Kartik S, and Kalyan Vadrevu | 2026-01-05 | 1,038 | -- |
| Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval … | Ronay Ak, Gabriel de Souza Pereira Moreira, and Bo Liu | 2026-01-06 | 1,492 | -- |
| OpenMed: Six Months of Open-Source Medical AI and the Road Ahead | Maziyar Panahi | 2026-01-06 | 2,424 | -- |
| Why We Built VIBE Bench: Rethinking Evaluation for Real Workloads | MiniMax | 2026-01-06 | 736 | -- |
| Diversity Vs Density: A data strategy comparison for fine-tuning VLMs | Akhil Theerthala | 2026-01-06 | 2,301 | -- |
| 🥃 Distilling Tiny Embeddings | David Mezzetti | 2026-01-10 | 1,082 | -- |
| Introducing OptiMind, a research model designed for optimization | Anson Ho, Sirui Li, and Ishai Menache | 2026-01-15 | 395 | -- |
| How We Built a Semantic Highlight Model To Save Token Cost for … | Cheney Zhang and Jiang Chen | 2026-01-15 | 2,344 | -- |
| Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments | Bingyang Ye and Shan Chen | 2026-01-13 | 2,717 | -- |
| Open Responses: What you need to know | shaun smith, ben burtenshaw, merve, and Pedro Cuenca | 2026-01-15 | 1,344 | -- |
| Beyond Brute Force: Why LoongFlow is the “Thinking” Evolution of OpenEvolve | Xunan Dai | 2026-01-16 | 1,108 | -- |
| SmolLM-Smashed: Tiny Giants, Optimized for Speed | David Berenstein | 2026-01-13 | 982 | -- |
| VLM-OCR Recipes on GPU Infrastructure | Florent Gbelidji | 2026-01-15 | 2,281 | -- |
| Reviewer Two (but it's an OpenEnv) | Chris von Csefalvay | 2026-01-13 | 1,653 | -- |
| Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments | ben burtenshaw | 2026-01-20 | 1,158 | -- |
| LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family | Said Taghadouini, Adrien Cavaillès, and Baptiste Aubertin | 2026-01-19 | 934 | -- |
| Differential Transformer V2 | Li Dong | 2026-01-20 | 3,136 | -- |
| 🪄 Interpreto: A Unified Toolkit for Interpretability of Transformer Models | Fanny Jourdan and Antonin Poché | 2026-01-20 | 2,112 | -- |
| New in llama.cpp: Anthropic Messages API | Xuan-Son Nguyen and Victor Mustar | 2026-01-19 | 541 | -- |
| One Year Since the “DeepSeek Moment” | Adina Yakefu and Irene Solaiman | 2026-01-20 | 1,617 | -- |
| Optimizing GLM4-MoE for Production: 65% Faster TTFT with SGLang | Novita AI | 2026-01-22 | 1,047 | -- |
| Security, Governance and Performance for Dell On-Prem AI Builders | Balachandran Rajendran, Juan Julián, Alvaro Bartolome, Enrique Hernández Calabrés, Simon Pagezy, and Jeff Boudier | 2026-01-21 | 1,064 | -- |
| RexRerankers: SOTA Rankers for Product Discovery and AI Assistants | Rahul Bajaj, Anuj Garg, and Jaya Nupur | 2026-01-24 | 3,704 | -- |
| Challenges of Synthetic Dataset Generation | Rishiraj Acharya | 2026-01-21 | 942 | -- |
| Reverse Engineering a $500M Mystery: From HashHop to Memory-Augmented Language Models | Asankhaya Sharma | 2026-01-23 | 1,825 | -- |
| AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality | Dhaval Patel, James Rayfield, Saumya Ahuja, Chathurangi Shyalika, Shuxin Lin, and Zhou | 2026-01-21 | 1,505 | -- |
| “DeepSeek R1 时刻” 一周年 | vansin | 2026-01-20 | 315 | -- |
| Benchmark Smarter: Tailor Your Model Evaluation Suite with EvalScope | kelseye.xh | 2026-01-22 | 1,973 | -- |
| Waypoint-1: Real-time Interactive Video Diffusion from Overworld | Andrew Lapp, Louis Castricato, Scott Fox, Shahbuland Matiana, and David Rossi | 2026-01-20 | 853 | -- |
| Why Your AI Strategy Needs Hugging Face Storage | Adrian Lepers | 2026-01-26 | 1,008 | -- |
| NVIDIA Earth-2 Open Models Span the Whole Weather Stack | Mike Pritchard, Jaideep Pathak, Jean Kossaifi, and Aayush Gupta | 2026-01-26 | 736 | -- |
| Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs | Omar saif alkaabi, Ahmed Alzubaidi, Hamza Alobeidli, Shaikha Alsuwaidi, Mohammed Alyafeai, Leen AlQadi, Basma Boussaha, and Hakim Hacid | 2026-01-27 | 1,585 | -- |
| Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective | Jason Zhu, Hejian Sang, Arup De, Rohit Jain, and Yanning Chen | 2026-01-27 | 4,160 | -- |
| Friends and Grandmothers in Silico | Itay Yona | 2026-01-24 | 4,089 | -- |
| Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek | Adina Yakefu and Irene Solaiman | 2026-01-27 | 1,324 | -- |
| Nemotron-Personas-Brazil: Co-Designed Data for Sovereign AI | Andre Manoel, Yev Meyer, Shyamala Prayaga, Will Jennings, and bardiya sadeghi | 2026-01-28 | 903 | -- |
| The Great Classification Showdown: OSS vs BERT on Consumer Hardware | Ben Toussaint | 2026-01-26 | 1,938 | -- |
| We got Claude to teach open models how to write CUDA kernels! | ben burtenshaw, shaun smith, merve, and Pedro Cuenca | 2026-01-28 | 2,350 | -- |
| Slashing torch.compile Warmup & LoRA Swapping Times with Pruna | John Rachwan, Johanna Sommer, Bertrand Charpentier, and Sara Han Díaz | 2026-01-28 | 1,513 | -- |
| Nemotron-Personas-Singapore: Co-Designed Data for Sovereign AI | Will Jennings, Dane Corneil, Yev Meyer, Verdi March, Shyamala Prayaga, and bardiya sadeghi | 2026-01-27 | 1,041 | -- |
| TruthTensor: LLM Evalution in Prediction Markets Under Drift and Market Baseline | Elena Pashkova, shirin Shahabi, Hudson, and Ronald Chan | 2026-01-29 | 1,631 | -- |
| Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp | Doctor Shotgun and Geechan | 2026-01-30 | 2,508 | -- |
| Introducing NVIDIA Cosmos Policy for Advanced Robot Control | Pranjali Joshi, Tsung-Yi Lin, Jinwei Gu, and Prachi Mishra | 2026-01-29 | 1,333 | -- |
| Introducing Daggr: Chain apps programmatically, inspect visually | merve, yuvraj sharma, Abubakar Abid, hysts, and Pedro Cuenca | 2026-01-29 | 1,559 | -- |
| Fine-Tuning FunctionGemma on TPU to Create a Virtual Fitness Coach in 10 … | Alvaro Moran | 2026-02-02 | 2,906 | -- |
| Announcing ReasoningLens — Visualizing and Diagnosing LLM Reasoning at a Glance | Jun Zhang, Jason Zheng, Boxi Cao, and ReasoningLens | 2026-02-03 | 693 | -- |
| Training Design for Text-to-Image Models: Lessons from Ablations | David Bertoin, Roman Frigg, and Jon Almazán | 2026-02-03 | 7,420 | -- |
| The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+ | Adina Yakefu and Irene Solaiman | 2026-02-03 | 1,602 | -- |
| H Company's new Holo2 model takes the lead in UI Localization | Ramzi De Coster, Hamza Benchekroun, and Aurélien Lac | 2026-02-03 | 214 | -- |
| Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s … | Ronay Ak and Gabriel de Souza Pereira Moreira | 2026-02-04 | 1,048 | -- |
| Nvidia Agentic Smart Router on Dell Enterprise Hub : Deepdive on Architecture,Design … | Khushboo Rathi and Balachandran Rajendran | 2026-02-03 | 995 | -- |
| CRAFT: Continuous Reasoning and Agentic Feedback Tuning | Valentin, Denis Timonin, Alexandr, and Alexey | 2026-02-05 | 813 | -- |
| Introducing SyGra Studio | Surajit Dasgupta, Bidyapati Pradhan, Amit Kumar Saha, Vipul Mittal, and Sriram Puttagunta | 2026-02-05 | 747 | -- |
| 🚀 SyGra V2.0.0 | Sriram Puttagunta, Surajit Dasgupta, Bidyapati Pradhan, Amit Kumar Saha, and Vipul Mittal | 2026-02-05 | 724 | -- |
| From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails … | Maziyar Panahi | 2026-02-07 | 5,766 | -- |
| Transformers.js v4 Preview: Now Available on NPM! | Joshua and Nico Martin | 2026-02-09 | 1,185 | -- |
| Training Qwen3 VL to label bbox : synthetic data, environment and training … | Ulrick BLE | 2026-02-09 | 2,544 | -- |
| 🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs | Guanchu | 2026-02-11 | 616 | -- |
| Building a Mood-Based Movie Recommendation Engine with Voyage-4-nano, Hugging Face, and MongoDB … | Arkadiusz Borucki | 2026-02-08 | 3,315 | -- |
| Enabling Large Scale RLHF of GPTOSS with Megatron backend in VeRL | LEI WANG | 2026-02-10 | 5,934 | -- |
| OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments | Christian Washington, Ankit Jasuja, Santosh Sah, Lewis Tunstall, and ben burtenshaw | 2026-02-12 | 1,656 | -- |
| LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search … | Antoine Chaffin and Raphael | 2026-02-12 | 4,993 | -- |
| Forge: Scalable Agent RL Framework and Algorithm | MiniMax, Hyn, zhi zhang, Jiayuan Song, Da Chen, xkc, Yaoyao, kennyKK, and zpysky1125 | 2026-02-13 | 3,387 | -- |
| How to Use Multiple GPUs in Hugging Face Transformers: Device Map vs … | Aritra Roy Gosthipaty | 2026-02-12 | 606 | -- |
| Custom Kernels for All from Codex and Claude | ben burtenshaw, Sayak Paul, Aritra Roy Gosthipaty, and shaun smith | 2026-02-13 | 1,792 | -- |
| What superpower does Kimi-K2.5 bring to the table? | Leco Li | 2026-02-13 | 1,154 | -- |
| The Chinese GLM-5 Model Now Ranks #2 in Arabic Language Performance | Karim Ouda | 2026-02-16 | 322 | -- |
| Compute and Competition in AI: Different FlOPs for Different Folks | Yacine Jernite and Sasha Luccioni | 2026-02-12 | 1,917 | -- |
| How to Build a Benchmark with a Private Test Set on Hugging … | Georgia Channing | 2026-02-16 | 1,775 | -- |
| Qwen3.5: Nobody Agrees on Attention Anymore | Maxime Labonne | 2026-02-17 | 1,192 | -- |
| NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル | Atsunori Fujita, Kotaro Yamamoto, Masaya Ogushi, Vincent Gong, Ameya Sunil Mahabaleshwarkar, and Yoshi Suhara | 2026-02-17 | 297 | -- |
| DenseR: Dense Rewards For Free in LLM Reasoning | Hritik Bansal | 2026-02-18 | 3,977 | -- |
| De-mystifying Multimodal Learning: Enabiling Vision in Language Models | Matteo Nulli | 2026-02-17 | 2,797 | -- |
| One-Shot Any Web App with Gradio's gr.HTML | yuvraj sharma, hysts, and Freddy Boulton | 2026-02-18 | 829 | -- |
| IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and … | Ayhan Sebin, Rohan Arora, and Saurabh Jha | 2026-02-18 | 2,253 | -- |
| Did GPT 5.2 make a breakthrough discovery in theoretical physics? | David Louapre | 2026-02-19 | 4,541 | -- |
| ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models? | Antoine Chaffin, Luca Arnaboldi, Amélie Chatelain, and Florent Krzakala | 2026-02-19 | 2,306 | -- |
| 「データ不足」の壁を越える:合成ペルソナが日本のAI開発を加速 | Atsunori Fujita, Masaya Ogushi, Will Jennings, Yev Meyer, Kotaro Yamamoto, Yoshi Suhara, Vincent Gong, and Dane Corneil | 2026-02-19 | 280 | -- |
| I Let a Lobster Run My Jetson: What OpenClaw Taught Me About … | Andres Marafioti | 2026-02-19 | 1,509 | -- |
| Train AI models with Unsloth and Hugging Face Jobs for FREE | ben burtenshaw, Daniel (Unsloth), Michael Han, Maxime Labonne, Daniel van Strien, and shaun smith | 2026-02-20 | 944 | -- |
| GGML and llama.cpp join HF to ensure the long-term progress of Local … | Georgi Gerganov, Xuan-Son Nguyen, Aleksander Grygier, Lysandre, Victor Mustar, and Julien Chaumond | 2026-02-20 | 936 | -- |
| Introducing Legal RAG Bench | Umar Butler and Abdur-Rahman Butler | 2026-02-20 | 3,235 | -- |
| FINAL Bench: The Real Bottleneck to AGI Is Self-Correction | VIDRAFT_LAB | 2026-02-21 | 1,146 | -- |
| How We Learned to Talk to Machines | Tyler Williams | 2026-02-20 | 1,156 | -- |
| Kimi K2.5: Still Worth It After Two Weeks? | Maxime Labonne | 2026-02-23 | 1,448 | -- |
| Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism? | VIDRAFT_LAB | 2026-02-24 | 2,770 | -- |
| Follow the White Rabbit: Using Embeddings So You Never Get Lost in … | David Corvoysier | 2026-02-23 | 1,420 | -- |
| MAEB: Evaluating Audio Embeddings at Scale | Adnan El Assadi, Solomatin Roman, Kenneth C. Enevoldsen, and Isaac Chung | 2026-02-24 | 1,349 | -- |
| A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and … | Karina Zadorozhny | 2026-01-19 | 7,738 | -- |
| Deploying Open Source Vision Language Models (VLM) on Jetson | Mitesh Patel, Johnny Nuñez Cano, and Raymond Lo | 2026-02-24 | 1,591 | -- |
| GEM Image: Building an AI That Actually Gets Educational Diagrams Right | AIPrep | 2026-02-21 | 966 | -- |
| Mixture of Experts (MoEs) in Transformers | Aritra Roy Gosthipaty, Pedro Cuenca, merve, Ilyas Moutawwakil, Arthur Zucker, Sergio Paniego, and Pablo Montalvo | 2026-02-26 | 2,054 | -- |
| Your MoE Model Does Not Have to Select Fixed Number of Experts | Tong Zhu, Xuyang Hu, Xiaoye Qu, Guanjie Chen, and Yu Cheng | 2026-02-26 | 4,405 | -- |
| Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty? | Yichen Feng, Yuetai Li, Chunjiang Liu, Yue Huang, Zhengqing Yuan, Fengqing Jiang, Zichen Chen, and Zhangchen Xu | 2026-02-25 | 1,792 | -- |
| Bringing Autonomous Driving RL to OpenEnv and TRL | Sergio Paniego | 2026-02-26 | 1,814 | -- |
| A framework and leaderboard for Retrieval Pipelines evaluation on ViDoRe v3 | Quentin Macé, Gabriel de Souza Pereira Moreira, Antoine EDY, Radek Osmulski, and Bo Liu | 2026-02-27 | 1,886 | -- |
| Create, Evaluate, and Connect AI Skills | SkillNet: A Large-Scale Agentic "Skill … | Yuan Liang, Ningyu Zhang, and Xu Ziwen | 2026-02-28 | 2,039 | -- |
| 构建、评估与连接 AI 技能 | SkillNet:大规模智能体“技能图谱”知识库 | Yuan Liang, Ningyu Zhang, and Xu Ziwen | 2026-02-28 | 370 | -- |
| Getting More from Your Test-Time Compute Budget with Portfolio Beam Search | Dan Elbaz, Oren Salzman, Oren Pereg, Daniel Korat, and Ronen Laperdon | 2026-02-24 | 3,527 | -- |
| easytranscriber: Speech Recognition with Accurate Timestamps in the HF Ecosystem | Faton Rekathati | 2026-03-03 | 1,169 | -- |
| The ML Engineer's Guide to Protein AI | Maziyar Panahi | 2026-03-03 | 3,612 | -- |
| PRX Part 3 — Training a Text-to-Image Model in 24h! | David Bertoin, Roman Frigg, and Jon Almazán | 2026-03-03 | 1,732 | -- |
| Introducing Kanon 2 Enricher — the world’s first hierarchical graphitization model | Umar Butler and Abdur-Rahman Butler | 2026-03-03 | 1,571 | -- |
| AI Coding Assistants Keep Shipping Vulnerable Code -- Here's What We're Doing … | Scott Thornton | 2026-02-26 | 371 | -- |
| LLM Architectures Explained: What Powers Today’s Top Models | Sara Han Díaz and Bertrand Charpentier | 2026-03-04 | 1,628 | -- |
| TiRex on the Edge | Robert Weber, Christian Ganhör, and Lukas Fischer | 2026-03-05 | 506 | -- |
| Bringing Robotics AI to Embedded Platforms: Dataset Recording, VLA Fine‑Tuning, and On‑Device … | Gaetan Bahl | 2026-03-05 | 1,851 | -- |
| Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines | YiYi Xu, Alvaro Somoza, Dhruv Nair, and Sayak Paul | 2026-03-05 | 1,907 | -- |
| NEO-unify: Building Native Multimodal Unified Models End to End | Haiwen Diao, Lewei Lu, and Ziwei Liu | 2026-03-05 | 623 | -- |
| Building Tucano 2: Open-Source Language Models That Actually Think in Portuguese | Nicholas Kluge Corrêa, Aniket Sen, Shiza Fatimah, Sophia Falk, and Lucie Flek | 2026-03-05 | 2,258 | -- |
| De-mystifying Multimodal Learning: The Hidden Inefficiency in Vision Language Modelling | Matteo Nulli | 2026-03-04 | 2,120 | -- |
| Konkani LLM: Bringing a Multi-Script Low-Resource Language to the AI Era | Reuben fernandes | 2026-03-07 | 861 | -- |
| Structural Problems in AI Benchmarking and the Case for a Unified Evaluation … | VIDRAFT_LAB | 2026-03-08 | 1,171 | -- |
| MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning | VIDRAFT_LAB | 2026-03-09 | 1,663 | -- |
| LeRobot v0.5.0: Scaling Every Dimension | Steven Palma, Pepijn Kooijmans, Jade Choghari, Caroline Pascal, Khalil Meftah, Martino Russi, Nicolas Rabault, Michel Aractingi, Virgile BATTO, and Thomas Wolf | 2026-03-09 | 1,931 | -- |
| Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge | George Saon and Madison Lee | 2026-03-09 | 385 | -- |
| Ulysses Sequence Parallelism: Training with Million-Token Contexts | Kashif Rasul and Stas Bekman | 2026-03-09 | 3,003 | -- |
| Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries | Amine Dirhoussi, Quentin Gallouédec, Kashif Rasul, Lewis Tunstall, Edward Beeching, Albert Villanova del Moral, Nouamane Tazi, and Leandro von Werra | 2026-03-10 | 9,358 | -- |
| Kanon 2 Reranker: the most powerful reranker for legal RAG | Umar Butler and Abdur-Rahman Butler | 2026-03-10 | 471 | -- |
| How NVIDIA Builds Open Data for AI | Will Jennings, Yev Meyer, Leanna Chraghchian, Rebecca Kao, Jane Polak Scowcroft, and Annie Surla | 2026-03-10 | 1,590 | -- |
| 🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language … | VIDRAFT_LAB | 2026-03-10 | 2,482 | -- |
| Introducing Storage Buckets on the Hugging Face Hub | Lucain Pouget, Eliott Coyac, Adrien Carreira, Victor Mustar, Julien Chaumond, Quentin Lhoest, Pierric Cistac, Sylvestre Bcht, Hugo Larcher, Rajat Arya, Di Xiao, and Assaf Vayner | 2026-03-10 | 1,591 | -- |
| ShopRLVE-GYM: Adaptive Verifiable Environments for E-Commerce Conversational Agents | Rahul Bajaj and Jaya Nupur | 2026-03-08 | 4,976 | -- |
| Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds | Joseph Jennings and Brandon Norick | 2026-03-11 | 710 | -- |
| Scaling Pedagogical Pre-training: From Optimal Mixing to 10 Billion Tokens | Asankhaya Sharma | 2026-03-06 | 4,656 | -- |
| How NVIDIA AI-Q Reached #1 on DeepResearch Bench I and II | David Austin | 2026-03-12 | 1,749 | -- |
| Build an Agent That Thinks Like a Data Scientist: How We Hit … | Jiwei Liu, Maximilian Jeblick, and Jack Yu | 2026-03-13 | 2,052 | -- |
| Arabic TTS Arena: Ranking Voice Models the Way Chess Ranks Grandmasters | Mohamed Rashad | 2026-03-12 | 1,698 | -- |
| Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline | Radek Osmulski, Reza Esfandiarpoor, Yauhen Babakhin, Gabriel de Souza Pereira Moreira, and Bo Liu | 2026-03-13 | 1,520 | -- |
| Pruna 0.3.2: More OSS Algos, More Ways to Optimize | Minette Kaunismäki, Begüm Çığ, Gaspar Rochette, Sara Han Díaz, and Bertrand Charpentier | 2026-03-11 | 922 | -- |
| SILMA TTS: A Lightweight Open Bilingual Text to Speech Model | Karim Ouda | 2026-03-15 | 524 | -- |
| The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare … | Sean Huver, Nigel Nelson, Lukas Zbinden, and Mostafa Toloui | 2026-03-16 | 865 | -- |
| Tokenization is Killing our Multilingual LLM Dream | Omar Kamali | 2026-03-15 | 3,383 | -- |
| Expanding the Alpamayo Open Platform for Developing Reasoning AVs Across Models, Data, … | Marco Pavone | 2026-03-16 | 1,259 | -- |
| Holotron-12B - High Throughput Computer Use Agent | Pierre-Louis Cedoz, Hamza Benchekroun, Aurélien Lac, delfosse, Tony Wu, Mats L. Richter, Antoine Bonnet, Kai Yuan, Aleix Cambray (H-AI), and Alexandra | 2026-03-17 | 868 | -- |
| Super Analyzer: Combining Reasoning and Coding Capabilities to Improve Code Performance | Girish Ganesan and Balachandran Rajendran | 2026-03-13 | 1,363 | -- |
| LoRA Fine-Tuning BitNet b1.58 LLMs on Heterogeneous Edge GPUs via QVAC Fabric | Subash SN, Akshay Nambiar, Milan Gritta, Zhen Cong Chen, Arsalan Anwari, Gianfranco Cordella, and Amril Nurman | 2026-03-17 | 3,124 | -- |
| Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI | Vinay Raman, Ameya Sunil Mahabaleshwarkar, Hayley Ross, Bilal Kartal, Aditya Malte, Zijia Chen, Ali Taghibakhshi, Sharath Turuvekere Sreenivas, Saurav Muralidharan, Khalil Ben Khaled, Nima Tajbakhsh, Pavlo Molchanov, Oluwatobi Olabiyi, and Yoshi Suhara | 2026-03-17 | 1,552 | -- |
| State of Open Source on Hugging Face: Spring 2026 | Avijit Ghosh, Lucie-Aimée Kaffee, Yacine Jernite, and Irene Solaiman | 2026-03-17 | 2,883 | -- |
| Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding | Talor Abramovich, Maor Ashkenazi, Izzy Putterman, Benjamin Chislett, Tiyasa Mitra, Bita Rouhani, Ran Zilberstein, and Yonatan Geifman | 2026-03-19 | 2,333 | -- |
| ATE-2: State-of-the-Art Armenian Text Embeddings and the ArmBench-TextEmbed Benchmark | Hrant Davtyan, Zaruhi Navasardyan, Spartak Bughdaryan, and bag_min | 2026-03-19 | 438 | -- |
| What's New in Mellea 0.4.0 + Granite Libraries Release | Abraham Daniels | 2026-03-20 | 469 | -- |
| Build a Domain-Specific Embedding Model in Under a Day | Steve H, Rucha Apte, Sean Sodha, and Oliver Holworthy | 2026-03-20 | 2,729 | -- |
| Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic … | Yunus Cukran | 2026-03-21 | 986 | -- |
| NanoVDR: A 70M Text-Only Model That Retrieves Visual Documents as Well as … | Zhuchenyang Liu | 2026-03-16 | 1,493 | -- |
| Pocket Models for iOS: Explore On-Device AI with GGUF Models, Data Memory, … | Hamit Hasanhocaoglu, Arda Dogantemur, Metecan Duyal, and StJohn Deakins | 2026-03-18 | 1,270 | -- |
| Introducing AI chunking to semchunk | Umar Butler and Abdur-Rahman Butler | 2026-03-23 | 2,228 | -- |
| Canada Must Not Turn AI Chatbots Into a New Surveillance Frontier | Noah Weinberger | 2026-03-16 | 1,934 | -- |
| A New Framework for Evaluating Voice Agents (EVA) | Tara Bogavelli, Gabrielle Gauthier Melancon, Katrina Stankiewicz, Nifemi Bamgbose, Hoang Nguyen, Raghav Mehndiratta, Hari Subramani, and Fanny Riols | 2026-03-24 | 2,147 | -- |
| SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation | Maziyar Panahi, merve, Jamie@Doubleword, Josh, Seb Ringrose, and Fergus Finn | 2026-03-23 | 3,730 | -- |
| Introducing Cohere-transcribe: state-of-the-art speech recognition | Julian Mack, Ekagra Ranjan, Walter Beller-Morales, Bharat venkitesh, and Pierre Richemond | 2026-03-26 | 1,485 | -- |
| Liberate your OpenClaw 🦀 | Clem 🤗, ben burtenshaw, Pedro Cuenca, Jeff Boudier, merve, Niels Rogge, Victor Mustar, and Mishig Davaadorj | 2026-03-27 | 593 | -- |
| White Hat Security Agent Prompts 600K Dataset by Yatin Taneja | Yatin Taneja | 2026-03-23 | 1,181 | -- |
| Letter of Superintelligence ~ Yatin Taneja | Yatin Taneja | 2026-03-23 | 1,031 | -- |
| ORBA: Orthogonal Reflection Bounded Ablation — A Geometrically Exact Detour in Directional … | Jim Lai | 2026-03-25 | 5,092 | -- |
| Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models | VIDRAFT_LAB | 2026-03-29 | 1,563 | -- |
| How I contributed a new model to the Transformers library using Codex | Niels Rogge | 2026-03-30 | 2,696 | -- |
| Training mRNA Language Models Across 25 Species for $165 | Maziyar Panahi | 2026-03-31 | 6,915 | -- |
| TRL v1.0: Post-Training Library Built to Move with the Field | Quentin Gallouédec, Steven Liu, Pedro Cuenca, and Sergio Paniego | 2026-03-31 | 3,093 | -- |
| Falcon Perception | wamiq para and FalconPerception | 2026-04-01 | 2,955 | -- |
| Using Storage Buckets as a Working Layer for Data Pipelines | Daniel van Strien | 2026-03-26 | 1,095 | -- |
| Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents | Madison Lee, Rogerio Feris, Eli Schwartz, Dhiraj Joshi, Pengyuan Li, and Isaac Sanchez | 2026-03-31 | 1,316 | -- |
| "The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge" | VIDRAFT_LAB | 2026-03-31 | 2,884 | -- |
| 🌈 SKT AI LABS 🌈 | ѕкт αι ℓαвѕ | 2026-03-30 | 555 | -- |
| Holo3: Breaking the Computer Use Frontier | Ramzi De Coster, Pierre-Louis Cedoz, Tony Wu, Hamza Benchekroun, mandreux-hai, delfosse, Aurélien Lac, maxime, Axel Moyal, Antonio Loison, Kai Yuan, and Ronan Riochet | 2026-04-01 | 813 | -- |
| Run Gemma 4 on Intel® Arc™ GPUs Out-Of-the-Box | Matrix Yao, Chendi Xue, FanZhao, Xinyu Chen, Alex Gu, Wuxun Zhang, Xinyi Li, jianan, Yi Wang, and Yintong Lu | 2026-04-01 | 1,495 | -- |
| Welcome Gemma 4: Frontier multimodal intelligence on device | merve, Pedro Cuenca, Sergio Paniego, ben burtenshaw, Steven Zheng, Alvaro Bartolome, and Nathan Habib | 2026-04-02 | 6,003 | -- |
| ArmBench-LLM 1.0: Benchmarking LLMs on Armenian Language Tasks | Hrant Davtyan, Zaruhi Navasardyan, Spartak Bughdaryan, and bag_min | 2026-04-02 | 1,205 | -- |
| YC-Bench: Can Your AI Agent Run a Startup Without Going Bankrupt? | Adit, Riddle He, Vincent Tu, Anand Kumar, and Nazneen Rajani | 2026-04-02 | 169 | -- |
| Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their … | Gustavo A Lujan and kedar kolluri | 2026-04-03 | 2,730 | -- |
| Run Gemma 4 on Intel® Xeon® Out-Of-the-Box | Jiang Li, Xinyu Chen, Chendi Xue, FanZhao, Yi Wang, Wuxun Zhang, Alex Gu, Xinyi Li, jianan, Yintong Lu, and Matrix Yao | 2026-04-01 | 1,464 | -- |
| gradio.Server: Any Custom Frontend with Gradio's Backend | yuvraj sharma and Abubakar Abid | 2026-04-01 | 1,160 | -- |
| From doctest to runnable Markdown | Tarek Ziadé | 2026-04-04 | 1,460 | -- |
| Darwin V6: Diagnostic-Guided Evolutionary Model Merging | VIDRAFT_LAB | 2026-04-08 | 1,003 | -- |
| How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs | Niels Rogge | 2026-04-07 | 1,246 | -- |
| BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders | Nicolas-BZRD and Théo Deschamps-Berger | 2026-04-07 | 1,772 | -- |
| Safetensors is Joining the PyTorch Foundation | Luc Georges and Lysandre | 2026-04-08 | 807 | -- |
| ALTK‑Evolve: On‑the‑Job Learning for AI Agents | Vatche Isahagian, Vinod Muthusamy, Jayaram Radhakrishnan, Gaodan Fang, Punleuk Oum, and G Thomas | 2026-04-08 | 1,180 | -- |
| Building Harvey-style tabular review from scratch, but better | Abdur-Rahman Butler | 2026-04-09 | 4,508 | -- |
| Multimodal Embedding & Reranker Models with Sentence Transformers | Tom Aarsen | 2026-04-09 | 2,886 | -- |
| Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs | Andrew Lapp, Louis Castricato, Scott Fox, Shahbuland Matiana, and David Rossi | 2026-04-09 | 857 | -- |
| Using OCR models with llama.cpp | Xuan-Son Nguyen | 2026-04-10 | 816 | -- |
| "Darwin-27B-Opus: Surpassing the Foundation Model Without Training" | VIDRAFT_LAB | 2026-04-13 | 1,806 | -- |
| Releasing LiteCoder-Terminal-SFT | LiteCoder | 2026-04-13 | 833 | -- |
| When Speech AI Meets the Long Tail of Languages: Inside the VAANI … | Sujith Pulikodan, Sanka, Nihar Desai, Suryansh Shukla, and Prasanta Kumar Ghosh | 2026-04-14 | 901 | -- |
| Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain — … | VIDRAFT_LAB | 2026-04-15 | 1,224 | -- |
| Meet HoloTab by HCompany. Your AI browser companion. | Marc Thibault, Pierre-Louis Cedoz, Hamza Benchekroun, Kai Yuan, Aurélien Lac, Tony Wu, Antonio Loison, Axel Moyal, and Emrick Sinitambirivoutin | 2026-04-15 | 516 | -- |
| Stop benchmarking inference providers | Nathan Habib | 2026-04-14 | 815 | -- |
| Nucleus-Image: Scaling Text-to-Image with Sparse Mixture of Experts | Nucleus AI | 2026-04-14 | 1,546 | -- |
| Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents | Ankita Naik, danish, Ben, Anupama Murthi, and Praveen | 2026-04-15 | 3,111 | -- |
| The PR you would have opened yourself | Pedro Cuenca and Awni Hannun | 2026-04-16 | 2,504 | -- |
| easyaligner: Forced alignment of text and audio, made easy | Faton Rekathati | 2026-04-16 | 1,591 | -- |
| Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers | Tom Aarsen | 2026-04-16 | 3,791 | -- |
| Building a Fast Multilingual OCR Model with Synthetic Data | Ryan Chesler | 2026-04-17 | 2,218 | -- |
| Ecom-RLVE: Adaptive Verifiable Environments for E-Commerce Conversational Agents | Rahul Bajaj, Jaya Nupur, Anuj Garg, and ben burtenshaw | 2026-04-16 | 2,563 | -- |
| NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots | Edith Llontop and Kalyan Vadrevu | 2026-04-17 | 797 | -- |
| Vessel Browser: The Open Source Browser Designed for Autonomous Agents | Tyler Williams | 2026-04-17 | 845 | -- |
| QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard | Leen AlQadi, Ahmed Alzubaidi, Mohammed Alyafeai, Maitha Alhammadi, Shaikha Alsuwaidi, Omar saif alkaabi, Basma Boussaha, and Hakim Hacid | 2026-04-21 | 1,731 | -- |
| How to Ground a Korean AI Agent in Real Demographics with Synthetic … | Will Jennings, Hyunwoo Kim, Jinho Lee, jihyeonRyu, Kiran Praveen, Yev Meyer, Kirit Thadaka, and Shyamala Prayaga | 2026-04-21 | 1,502 | -- |
| Save the traces! 🐳 | Pedro Cuenca | 2026-04-21 | 461 | -- |
| Multilingual Tool Calling in 70+ Languages, On Device | Bronson, Kato Steven Mubiru, Gimei Alex, OJ Onyeagwu, and Adnan El Assadi | 2026-04-20 | 1,636 | -- |
| DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models | Raphael Sourty, Antoine Chaffin, Paulo Moura, and Amélie Chatelain | 2026-04-21 | 5,774 | -- |
| AI and the Future of Cybersecurity: Why Openness Matters | Margaret Mitchell, Yacine Jernite, and Clem 🤗 | 2026-04-21 | 1,245 | -- |
| Introducing the Bright Data CLI for Automated Web Data Pipelines | Bright Data | 2026-04-20 | 1,786 | -- |
| mlinter: a linter for Transformers modeling files | Tarek Ziadé | 2026-04-22 | 1,827 | -- |
| Gemma 4 VLA Demo on Jetson Orin Nano Super | Asier Arranz | 2026-04-22 | 1,575 | -- |
| ML Intern Takes Our Post-Training Internship Test | Carlos Miguel Patiño, Aksel Joonas Reedi, and Lewis Tunstall | 2026-04-23 | 924 | -- |
| Hy3 preview: A Rebuilt Hunyuan, a 21B-Active MoE, and a New Reasoning … | Leco Li | 2026-04-23 | 1,035 | -- |
| How to Use Transformers.js in a Chrome Extension | Nico Martin | 2026-04-23 | 1,774 | -- |
| RL: A Structured Human Action & Intent Dataset for Physical AI and … | Gowtham and Marc Hebert | 2026-04-21 | 2,351 | -- |
| DeepSeek-V4: a million-token context that agents can actually use | ben burtenshaw | 2026-04-24 | 1,488 | -- |
| Building long-horizon SWE environments on Hugging Face: Frontier SWE × OpenEnv | swappy and Sourasish Basu | 2026-04-26 | 1,224 | -- |
| How to build scalable web apps with OpenAI's Privacy Filter | yuvraj sharma, Freddy Boulton, and Abubakar Abid | 2026-04-27 | 1,641 | -- |
| OpenRA-RL: An Open Platform for AI Agents in Real-Time Strategy Games | Xiaochuang Yuan, huixu, Yiyu Tian, momo, Ruiyue Wang, and Kaiser Sun | 2026-04-27 | 3,015 | -- |
| Adaptive Ultrasound Imaging with Physics-Informed NV-Raw2Insights-US AI | Walter Simson, Jay Carlson, Tom Lassiter, Kevin Woo, and Sean Huver | 2026-04-28 | 929 | -- |
| Running AI agents to automate outreach at scale | Niels Rogge | 2026-04-27 | 2,296 | -- |
| Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio … | Tuomas Rintamaki, Amala Sanjay Deshmukh, Nabin Mulepati, Collin McCarthy, Pritam Biswas, Arushi Goel, Leili Tavabi, Alexandre Milesi, Danial Mohseni Taheri, Kateryna Chumachenko, Isabel Hulseman, Zhehuai Chen, Karan, and Tao | 2026-04-28 | 3,186 | -- |
| BiomedBERT Small: Medical models at 22.7M parameters | David Mezzetti | 2026-04-28 | 912 | -- |
| AI evals are becoming the new compute bottleneck | Avijit Ghosh, Yifan Mai, Georgia Channing, and Leshem Choshen | 2026-04-29 | 3,881 | -- |
| Pallas for people who know JAX but not kernels yet | Aritra Roy Gosthipaty | 2026-04-29 | 1,581 | -- |
| DeepInfra on Hugging Face Inference Providers 🔥 | Aray Sultanbekova, Shang-Pin, Utemuratov, Yessen K, Oguz Vuruskaner, Célina Hanouti, Simon Brandeis, and Lucain Pouget | 2026-04-29 | 878 | -- |
| Granite 4.1 LLMs: How They’re Built | Yousaf Shah | 2026-04-29 | 2,848 | -- |
| The MCP Era Feels Like Déjà Vu | Mohamed Rashad and Hessah Alharbi | 2026-04-29 | 2,023 | -- |
| Training low-bit ternary models with Axolotl | wing lian | 2026-04-30 | 1,151 | -- |
| Build a legal RAG app that won't be held in contempt | Tabs | 2026-05-05 | 3,115 | -- |
| Adding Benchmaxxer Repellant to the Open ASR Leaderboard | Eric Bezzam, Steven Zheng, Eustache Le Bihan, Sergio Bruccoleri, Jeanine Sinanan-Singh, Casey Ford, Guanbo Wang, Yukai Huang, Ke Li, Yufeng Hao, and Liao Xiaoling | 2026-05-06 | 1,400 | -- |
| Learning Maths for the Last Time | Shane, LaneFiedler, Enderchef (Enderchefcoder), LH-Tech AI, Arman Rafiee, poe, and AxionLab | 2026-05-06 | 1,325 | -- |
| Introducing the agentic robotics appstore for 10,000 Reachy Minis | Clem 🤗 | 2026-05-06 | 1,207 | -- |
| vLLM V0 to V1: Correctness Before Corrections in RL | Rafael Pardinas and Ehsan Kamalloo | 2026-05-06 | 1,579 | -- |
| 🧠 I trained my own French LLM from scratch — alone, with … | vloplok | 2026-05-05 | 2,017 | -- |
| QVAC MedPsy: State-of-the-Art Medical and Healthcare Language Models for Edge Devices | Mathias Buus, Davide Vitabile, Alex Buffa, Akshay Nambiar, and Amril Nurman | 2026-05-07 | 9,495 | -- |
| Improving Depth Anything V2 Robustness to Video Compression | Ethan F and Ronen Nissim | 2026-05-07 | 3,407 | -- |
| MedQA: Fine-Tuning a Clinical AI on AMD ROCm — No CUDA Required | Harikrishna | 2026-05-08 | 1,520 | -- |
| EMO: Pretraining mixture of experts for emergent modularity | Kyle Wiggers and Ryan Wang | 2026-05-08 | 1,830 | -- |
| CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models | Samuel | 2026-05-08 | 1,783 | -- |
| "OncoAgent: A Dual-Tier Multi-Agent Framework for Privacy-Preserving Oncology Clinical Decision Support" | Máximo López Chenlo | 2026-05-09 | 2,938 | -- |
| Building Blocks for Foundation Model Training and Inference on AWS | Keita Watanabe, Pavel Belevich, and Aman Shanbhag | 2026-05-11 | 4,362 | -- |
| Two Years of Local AI on a Laptop: When Open Models Outpaced … | Mishig Davaadorj | 2026-05-11 | 1,653 | -- |
| Hugging Face on JFrog Artifactory: An Enterprise Guide (and What Changes in … | Jeff Boudier | 2026-05-08 | 5,080 | -- |
| Safety Evals Should Project Test-Time Compute | Tommaso Cerruti | 2026-05-11 | 2,521 | -- |
| You do the work. Big Tech takes the model. | Urro | 2026-05-11 | 3,960 | -- |
| Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier … | VIDRAFT_LAB | 2026-05-15 | 882 | -- |
| Unlocking asynchronicity in continuous batching | Rémi Ouazan Reboul, Pedro Cuenca, and Aritra Roy Gosthipaty | 2026-05-14 | 4,015 | -- |
| Self Evolving is the Endgame or final destiny | Rajkumar rawal | 2026-05-12 | 683 | -- |
| How to Comply with SOC 2 and ISO 27001 with Hugging Face: … | Jeff Boudier | 2026-05-14 | 3,007 | -- |
| Vividh-ASR: Diagnosing and Fixing Studio-Bias in Whisper for Indic Languages | Kavya Manohar, Kush Juvekar, and Kumarmanas Nethil | 2026-05-15 | 3,877 | -- |
| Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context … | Radu Florian, Parul Awasthy, Aashka Trivedi, and Madison Lee | 2026-05-14 | 3,411 | -- |
| PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend | AlexZhang, cuicheng, Jun Zhang, and Manhui Lin | 2026-05-18 | 927 | -- |
| Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation | Ting-Yun Chang, Miguel Martin, Jonathan Allen, Ke Ding, and Pooya Jannaty | 2026-05-18 | 2,653 | -- |
| The Open Agent Leaderboard | Elron Bandel | 2026-05-18 | 1,703 | -- |
| OlmoEarth v1.1: A more efficient family of models | Kyle Wiggers | 2026-05-19 | 898 | -- |
| Introducing the Ettin Reranker Family | Tom Aarsen | 2026-05-19 | 5,698 | -- |
| Software Forgets: Agent Traces Are the Memory | Caleb Fahlgren | 2026-05-19 | 604 | -- |
| Talking to a 4-Year-Old: A Multilingual Benchmark for Children's AI Companions | Batuhan Aktas, Yuvraj, and fatih bugra akdogan | 2026-05-03 | 4,557 | -- |
| Vocabulary-Augmented Prompting for Sango — Production African Language AI Without a Parallel … | MICWEN | 2026-05-13 | 3,112 | -- |
| LeRobot Humanoid: An Open, Low-Cost, 3D-Printed Humanoid for Robot Learning | Virgile BATTO, Caroline Pascal, Steven Palma, Maxime Ellerbach, Nicolas Rabault, Martino Russi, and haixuan tao | 2026-05-21 | 1,550 | -- |
| Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models | Mehran Maghoumi, Yonggan Fu, Pavlo Molchanov, and Khadkevich | 2026-05-23 | 1,167 | -- |
| An experiment with attention. | poe, Lane Fiedler, Shane, and Enderchef (Enderchefcoder) | 2026-05-23 | 1,061 | -- |
| Specialization Beats Scale: A Strategic Variable Most AI Procurement Decisions Overlook | Erick Lachmann and Pimenta de Freitas Cardoso | 2026-05-22 | 2,753 | -- |
| Why Open Models Are the Only Sustainable Way to Teach AI | Pénélope Gittos | 2026-05-22 | 1,325 | -- |
| Harness, Scaffold, and the AI Agent Terms Worth Getting Right | Sergio Paniego and Aritra Roy Gosthipaty | 2026-05-25 | 2,117 | -- |
| Relaunching PapersWithCode with new features | Niels Rogge | 2026-05-24 | 498 | -- |
| Borealis — open data, code, weights recipe for training Audio LLM | Wortega | 2026-05-25 | 2,303 | -- |
| Eight Days in China: What I Learned from the AI Labs, Robotics … | Matt White | 2026-05-22 | 12,170 | -- |
| SANA-WM Bidirectional on Apple Silicon | Arjun Reddy | 2026-05-20 | 1,105 | -- |
| Should we use genetics instead of system prompts for AI Agents & … | Fyx | 2026-05-25 | 2,550 | -- |
| ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic … | Ayhan Sebin, Saurabh Jha, and Rohan Arora | 2026-05-27 | 889 | -- |
| Give your agents ZeroGPU to ship viral AI apps autonomously | Victor Mustar | 2026-05-26 | 941 | -- |
| Reachy Mini goes fully local | Amir Mahla and Andres Marafioti | 2026-05-27 | 1,849 | -- |
| Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in … | Amine Dirhoussi, Quentin Gallouédec, Kashif Rasul, Lewis Tunstall, Edward Beeching, Albert Villanova del Moral, and Leandro von Werra | 2026-05-27 | 4,227 | -- |
| Introduction to Trimming ✂ | Loïck BOURDOIS, Tom Aarsen, Bram Vanroy, Woojun Jung, Manuel Romero, and Prithiv Sakthi | 2026-05-28 | 19,577 | -- |
| MONET: Lowering the bar for World-Class Image Generation research. | Benjamin Aubin, Gonzalo Quintana, Onur, sanjeev sreetharan, Czerwinska, Damien Henry, and Clément Chadebec | 2026-05-28 | 1,601 | -- |
| Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler | Aritra Roy Gosthipaty, Sayak Paul, Sergio Paniego, Rémi Ouazan Reboul, and Pedro Cuenca | 2026-05-29 | 5,132 | -- |
| Dell Enterprise Hub at Dell Tech World 2026: new models, new platforms, … | Simon Pagezy, Enrique Hernández Calabrés, Juan Julián, Bagus Hanindhito, Girish Ganesan, ravikumar, Ian Roche, Jeff Boudier, and Balachandran Rajendran | 2026-05-29 | 1,112 | -- |
| Server is at capacity | specimba, Lewis Tunstall, and Aksel Joonas Reedi | 2026-05-27 | 266 | -- |
| ClawHub Security Signals: Large Corpus Multi-Scanner Dataset for Agent Skill Security Research | Vincent Koc, Patrick Erichsen, Jacob Tomlinson, Agustin Rivera, Mike Appel, and Nir Paz | 2026-06-01 | 1,400 | -- |
| Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning … | Asawaree and Atharva Joshi | 2026-06-01 | 1,960 | -- |
| Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic | Nicholas Fuller | 2026-06-01 | 2,177 | -- |
| Agentic RL: Token-In, Token-Out Done Right | Quentin Gallouédec and Kashif Rasul | 2026-05-29 | 3,670 | -- |
| MiniMax Goes Sparse: Decoding M3's Attention from a Single Diagram | Atlas Cloud | 2026-05-29 | 1,680 | -- |
| A Deep Neural Network that turns Any Image into a Playable Game! … | Abhishek Sensharma | 2026-06-01 | 365 | -- |
| Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains | Nikita Pavlichenko | 2026-06-01 | 600 | -- |
| Holo3.1: Fast & Local Computer Use Agents | Maxime Langevin, Hamza Benchekroun, Axel Moyal, Emrick Sinitambirivoutin, Antonio Loison, Avshalom Manevich, Tony Wu, Pierre-Louis Cedoz, Aurélien Lac, and Ronan Riochet | 2026-06-02 | 867 | -- |
| Taking Alpamayo to New Heights with Driving Foundation Models and Closed-Loop Training | Marco Pavone and Boris Ivanovic | 2026-06-01 | 1,386 | -- |