HuggingFace Blog
196 posts indexed since 2026
Post Details
| Title | Author | Published | Words | HN Pts |
|---|---|---|---|---|
| Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B … | weitaofeng | 2026-01-01 | 1,778 | -- |
| The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on … | Yağız Çalık | 2026-01-02 | 5,072 | -- |
| Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture | Basma Boussaha, Mohammed Alyafeai, Ahmed Alzubaidi, Leen AlQadi, Shaikha Alsuwaidi, Omar saif alkaabi, Hamza Alobeidli, and Hakim Hacid | 2026-01-05 | 1,838 | -- |
| TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell | Konstantin | 2026-01-05 | 3,309 | -- |
| Introducing Falcon H1R 7B | Iheb Chaabane, Puneesh Khanna, Suhail M Shah, Slim Frikha, Shi Hu, Abdalgader Abubaker, Reda alami, Mike Lubinets, Mohamed El Amine Seddik, and Hakim Hacid | 2026-01-05 | 1,332 | -- |
| Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem | Marco Pavone | 2026-01-05 | 893 | -- |
| Understanding Low-Rank Adaptation (LoRA): A Revolution in Fine-Tuning Large Language Models | Ashish Chadha | 2026-01-03 | 2,023 | -- |
| NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI | Tsung-Yi Lin and Debraj Sinha | 2026-01-05 | 1,037 | -- |
| Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR | Kunal Dhawan, Adi- margolin, Gordana Neskovic, Maryam Motamedi, and Yasmina Benkhoui | 2026-01-05 | 1,860 | -- |
| NVIDIA brings agents to life with DGX Spark and Reachy Mini | Jeff Boudier, Nader Khalil, and Alec Fong | 2026-01-05 | 2,128 | -- |
| M2.1: Multilingual and Multi-Task Coding with Strong Generalization | MiniMax | 2026-01-05 | 2,306 | -- |
| Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot | Raffaello Bonghi, lior ben horin, Kartik S, and Kalyan Vadrevu | 2026-01-05 | 1,038 | -- |
| Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval … | Ronay Ak, Gabriel de Souza Pereira Moreira, and Bo Liu | 2026-01-06 | 1,492 | -- |
| OpenMed: Six Months of Open-Source Medical AI and the Road Ahead | Maziyar Panahi | 2026-01-06 | 2,424 | -- |
| Why We Built VIBE Bench: Rethinking Evaluation for Real Workloads | MiniMax | 2026-01-06 | 736 | -- |
| Diversity Vs Density: A data strategy comparison for fine-tuning VLMs | Akhil Theerthala | 2026-01-06 | 2,301 | -- |
| 🥃 Distilling Tiny Embeddings | David Mezzetti | 2026-01-10 | 1,082 | -- |
| Introducing OptiMind, a research model designed for optimization | Anson Ho, Sirui Li, and Ishai Menache | 2026-01-15 | 395 | -- |
| How We Built a Semantic Highlight Model To Save Token Cost for … | Cheney Zhang and Jiang Chen | 2026-01-15 | 2,344 | -- |
| Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments | Bingyang Ye and Shan Chen | 2026-01-13 | 2,717 | -- |
| Open Responses: What you need to know | shaun smith, ben burtenshaw, merve, and Pedro Cuenca | 2026-01-15 | 1,344 | -- |
| Beyond Brute Force: Why LoongFlow is the “Thinking” Evolution of OpenEvolve | Xunan Dai | 2026-01-16 | 1,108 | -- |
| SmolLM-Smashed: Tiny Giants, Optimized for Speed | David Berenstein | 2026-01-13 | 982 | -- |
| VLM-OCR Recipes on GPU Infrastructure | Florent Gbelidji | 2026-01-15 | 2,281 | -- |
| Reviewer Two (but it's an OpenEnv) | Chris von Csefalvay | 2026-01-13 | 1,653 | -- |
| Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments | ben burtenshaw | 2026-01-20 | 1,158 | -- |
| LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family | Said Taghadouini, Adrien Cavaillès, and Baptiste Aubertin | 2026-01-19 | 934 | -- |
| Differential Transformer V2 | Li Dong | 2026-01-20 | 3,136 | -- |
| 🪄 Interpreto: A Unified Toolkit for Interpretability of Transformer Models | Fanny Jourdan and Antonin Poché | 2026-01-20 | 2,112 | -- |
| New in llama.cpp: Anthropic Messages API | Xuan-Son Nguyen and Victor Mustar | 2026-01-19 | 541 | -- |
| One Year Since the “DeepSeek Moment” | Adina Yakefu and Irene Solaiman | 2026-01-20 | 1,617 | -- |
| Optimizing GLM4-MoE for Production: 65% Faster TTFT with SGLang | Novita AI | 2026-01-22 | 1,047 | -- |
| Security, Governance and Performance for Dell On-Prem AI Builders | Balachandran Rajendran, Juan Julián, Alvaro Bartolome, Enrique Hernández Calabrés, Simon Pagezy, and Jeff Boudier | 2026-01-21 | 1,064 | -- |
| RexRerankers: SOTA Rankers for Product Discovery and AI Assistants | Rahul Bajaj, Anuj Garg, and Jaya Nupur | 2026-01-24 | 3,704 | -- |
| Challenges of Synthetic Dataset Generation | Rishiraj Acharya | 2026-01-21 | 942 | -- |
| Reverse Engineering a $500M Mystery: From HashHop to Memory-Augmented Language Models | Asankhaya Sharma | 2026-01-23 | 1,825 | -- |
| AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality | Dhaval Patel, James Rayfield, Saumya Ahuja, Chathurangi Shyalika, Shuxin Lin, and Zhou | 2026-01-21 | 1,505 | -- |
| “DeepSeek R1 时刻” 一周年 | vansin | 2026-01-20 | 315 | -- |
| Benchmark Smarter: Tailor Your Model Evaluation Suite with EvalScope | kelseye.xh | 2026-01-22 | 1,973 | -- |
| Waypoint-1: Real-time Interactive Video Diffusion from Overworld | Andrew Lapp, Louis Castricato, Scott Fox, Shahbuland Matiana, and David Rossi | 2026-01-20 | 853 | -- |
| Why Your AI Strategy Needs Hugging Face Storage | Adrian Lepers | 2026-01-26 | 1,008 | -- |
| NVIDIA Earth-2 Open Models Span the Whole Weather Stack | Mike Pritchard, Jaideep Pathak, Jean Kossaifi, and Aayush Gupta | 2026-01-26 | 736 | -- |
| Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs | Omar saif alkaabi, Ahmed Alzubaidi, Hamza Alobeidli, Shaikha Alsuwaidi, Mohammed Alyafeai, Leen AlQadi, Basma Boussaha, and Hakim Hacid | 2026-01-27 | 1,585 | -- |
| Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective | Jason Zhu, Hejian Sang, Arup De, Rohit Jain, and Yanning Chen | 2026-01-27 | 4,160 | -- |
| Friends and Grandmothers in Silico | Itay Yona | 2026-01-24 | 4,089 | -- |
| Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek | Adina Yakefu and Irene Solaiman | 2026-01-27 | 1,324 | -- |
| Nemotron-Personas-Brazil: Co-Designed Data for Sovereign AI | Andre Manoel, Yev Meyer, Shyamala Prayaga, Will Jennings, and bardiya sadeghi | 2026-01-28 | 903 | -- |
| The Great Classification Showdown: OSS vs BERT on Consumer Hardware | Ben Toussaint | 2026-01-26 | 1,938 | -- |
| We got Claude to teach open models how to write CUDA kernels! | ben burtenshaw, shaun smith, merve, and Pedro Cuenca | 2026-01-28 | 2,350 | -- |
| Slashing torch.compile Warmup & LoRA Swapping Times with Pruna | John Rachwan, Johanna Sommer, Bertrand Charpentier, and Sara Han Díaz | 2026-01-28 | 1,513 | -- |
| Nemotron-Personas-Singapore: Co-Designed Data for Sovereign AI | Will Jennings, Dane Corneil, Yev Meyer, Verdi March, Shyamala Prayaga, and bardiya sadeghi | 2026-01-27 | 1,041 | -- |
| TruthTensor: LLM Evalution in Prediction Markets Under Drift and Market Baseline | Elena Pashkova, shirin Shahabi, Hudson, and Ronald Chan | 2026-01-29 | 1,631 | -- |
| Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp | Doctor Shotgun and Geechan | 2026-01-30 | 2,508 | -- |
| Introducing NVIDIA Cosmos Policy for Advanced Robot Control | Pranjali Joshi, Tsung-Yi Lin, Jinwei Gu, and Prachi Mishra | 2026-01-29 | 1,333 | -- |
| Introducing Daggr: Chain apps programmatically, inspect visually | merve, yuvraj sharma, Abubakar Abid, hysts, and Pedro Cuenca | 2026-01-29 | 1,559 | -- |
| Fine-Tuning FunctionGemma on TPU to Create a Virtual Fitness Coach in 10 … | Alvaro Moran | 2026-02-02 | 2,906 | -- |
| Announcing ReasoningLens — Visualizing and Diagnosing LLM Reasoning at a Glance | Jun Zhang, Jason Zheng, Boxi Cao, and ReasoningLens | 2026-02-03 | 693 | -- |
| Training Design for Text-to-Image Models: Lessons from Ablations | David Bertoin, Roman Frigg, and Jon Almazán | 2026-02-03 | 7,420 | -- |
| The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+ | Adina Yakefu and Irene Solaiman | 2026-02-03 | 1,602 | -- |
| H Company's new Holo2 model takes the lead in UI Localization | Ramzi De Coster, Hamza Benchekroun, and Aurélien Lac | 2026-02-03 | 214 | -- |
| Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s … | Ronay Ak and Gabriel de Souza Pereira Moreira | 2026-02-04 | 1,048 | -- |
| Nvidia Agentic Smart Router on Dell Enterprise Hub : Deepdive on Architecture,Design … | Khushboo Rathi and Balachandran Rajendran | 2026-02-03 | 995 | -- |
| CRAFT: Continuous Reasoning and Agentic Feedback Tuning | Valentin, Denis Timonin, Alexandr, and Alexey | 2026-02-05 | 813 | -- |
| Introducing SyGra Studio | Surajit Dasgupta, Bidyapati Pradhan, Amit Kumar Saha, Vipul Mittal, and Sriram Puttagunta | 2026-02-05 | 747 | -- |
| 🚀 SyGra V2.0.0 | Sriram Puttagunta, Surajit Dasgupta, Bidyapati Pradhan, Amit Kumar Saha, and Vipul Mittal | 2026-02-05 | 724 | -- |
| From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails … | Maziyar Panahi | 2026-02-07 | 5,766 | -- |
| Transformers.js v4 Preview: Now Available on NPM! | Joshua and Nico Martin | 2026-02-09 | 1,185 | -- |
| Training Qwen3 VL to label bbox : synthetic data, environment and training … | Ulrick BLE | 2026-02-09 | 2,544 | -- |
| 🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs | Guanchu | 2026-02-11 | 616 | -- |
| Building a Mood-Based Movie Recommendation Engine with Voyage-4-nano, Hugging Face, and MongoDB … | Arkadiusz Borucki | 2026-02-08 | 3,315 | -- |
| Enabling Large Scale RLHF of GPTOSS with Megatron backend in VeRL | LEI WANG | 2026-02-10 | 5,934 | -- |
| OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments | Christian Washington, Ankit Jasuja, Santosh Sah, Lewis Tunstall, and ben burtenshaw | 2026-02-12 | 1,656 | -- |
| LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search … | Antoine Chaffin and Raphael | 2026-02-12 | 4,993 | -- |
| Forge: Scalable Agent RL Framework and Algorithm | MiniMax, Hyn, zhi zhang, Jiayuan Song, Da Chen, xkc, Yaoyao, kennyKK, and zpysky1125 | 2026-02-13 | 3,387 | -- |
| How to Use Multiple GPUs in Hugging Face Transformers: Device Map vs … | Aritra Roy Gosthipaty | 2026-02-12 | 606 | -- |
| Custom Kernels for All from Codex and Claude | ben burtenshaw, Sayak Paul, Aritra Roy Gosthipaty, and shaun smith | 2026-02-13 | 1,792 | -- |
| What superpower does Kimi-K2.5 bring to the table? | Leco Li | 2026-02-13 | 1,154 | -- |
| The Chinese GLM-5 Model Now Ranks #2 in Arabic Language Performance | Karim Ouda | 2026-02-16 | 322 | -- |
| Compute and Competition in AI: Different FlOPs for Different Folks | Yacine Jernite and Sasha Luccioni | 2026-02-12 | 1,917 | -- |
| How to Build a Benchmark with a Private Test Set on Hugging … | Georgia Channing | 2026-02-16 | 1,775 | -- |
| Qwen3.5: Nobody Agrees on Attention Anymore | Maxime Labonne | 2026-02-17 | 1,192 | -- |
| NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル | Atsunori Fujita, Kotaro Yamamoto, Masaya Ogushi, Vincent Gong, Ameya Sunil Mahabaleshwarkar, and Yoshi Suhara | 2026-02-17 | 297 | -- |
| DenseR: Dense Rewards For Free in LLM Reasoning | Hritik Bansal | 2026-02-18 | 3,977 | -- |
| De-mystifying Multimodal Learning: Enabiling Vision in Language Models | Matteo Nulli | 2026-02-17 | 2,797 | -- |
| One-Shot Any Web App with Gradio's gr.HTML | yuvraj sharma, hysts, and Freddy Boulton | 2026-02-18 | 829 | -- |
| IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and … | Ayhan Sebin, Rohan Arora, and Saurabh Jha | 2026-02-18 | 2,253 | -- |
| Did GPT 5.2 make a breakthrough discovery in theoretical physics? | David Louapre | 2026-02-19 | 4,541 | -- |
| ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models? | Antoine Chaffin, Luca Arnaboldi, Amélie Chatelain, and Florent Krzakala | 2026-02-19 | 2,306 | -- |
| 「データ不足」の壁を越える:合成ペルソナが日本のAI開発を加速 | Atsunori Fujita, Masaya Ogushi, Will Jennings, Yev Meyer, Kotaro Yamamoto, Yoshi Suhara, Vincent Gong, and Dane Corneil | 2026-02-19 | 280 | -- |
| I Let a Lobster Run My Jetson: What OpenClaw Taught Me About … | Andres Marafioti | 2026-02-19 | 1,509 | -- |
| Train AI models with Unsloth and Hugging Face Jobs for FREE | ben burtenshaw, Daniel (Unsloth), Michael Han, Maxime Labonne, Daniel van Strien, and shaun smith | 2026-02-20 | 944 | -- |
| GGML and llama.cpp join HF to ensure the long-term progress of Local … | Georgi Gerganov, Xuan-Son Nguyen, Aleksander Grygier, Lysandre, Victor Mustar, and Julien Chaumond | 2026-02-20 | 936 | -- |
| Introducing Legal RAG Bench | Umar Butler and Abdur-Rahman Butler | 2026-02-20 | 3,235 | -- |
| FINAL Bench: The Real Bottleneck to AGI Is Self-Correction | VIDRAFT_LAB | 2026-02-21 | 1,146 | -- |
| How We Learned to Talk to Machines | Tyler Williams | 2026-02-20 | 1,156 | -- |
| Kimi K2.5: Still Worth It After Two Weeks? | Maxime Labonne | 2026-02-23 | 1,448 | -- |
| Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism? | VIDRAFT_LAB | 2026-02-24 | 2,770 | -- |
| Follow the White Rabbit: Using Embeddings So You Never Get Lost in … | David Corvoysier | 2026-02-23 | 1,420 | -- |
| MAEB: Evaluating Audio Embeddings at Scale | Adnan El Assadi, Solomatin Roman, Kenneth C. Enevoldsen, and Isaac Chung | 2026-02-24 | 1,349 | -- |
| A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and … | Karina Zadorozhny | 2026-01-19 | 7,738 | -- |
| Deploying Open Source Vision Language Models (VLM) on Jetson | Mitesh Patel, Johnny Nuñez Cano, and Raymond Lo | 2026-02-24 | 1,591 | -- |
| GEM Image: Building an AI That Actually Gets Educational Diagrams Right | AIPrep | 2026-02-21 | 966 | -- |
| Mixture of Experts (MoEs) in Transformers | Aritra Roy Gosthipaty, Pedro Cuenca, merve, Ilyas Moutawwakil, Arthur Zucker, Sergio Paniego, and Pablo Montalvo | 2026-02-26 | 2,054 | -- |
| Your MoE Model Does Not Have to Select Fixed Number of Experts | Tong Zhu, Xuyang Hu, Xiaoye Qu, Guanjie Chen, and Yu Cheng | 2026-02-26 | 4,405 | -- |
| Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty? | Yichen Feng, Yuetai Li, Chunjiang Liu, Yue Huang, Zhengqing Yuan, Fengqing Jiang, Zichen Chen, and Zhangchen Xu | 2026-02-25 | 1,792 | -- |
| Bringing Autonomous Driving RL to OpenEnv and TRL | Sergio Paniego | 2026-02-26 | 1,814 | -- |
| A framework and leaderboard for Retrieval Pipelines evaluation on ViDoRe v3 | Quentin Macé, Gabriel de Souza Pereira Moreira, Antoine EDY, Radek Osmulski, and Bo Liu | 2026-02-27 | 1,886 | -- |
| Create, Evaluate, and Connect AI Skills | SkillNet: A Large-Scale Agentic "Skill … | Yuan Liang, Ningyu Zhang, and Xu Ziwen | 2026-02-28 | 2,039 | -- |
| 构建、评估与连接 AI 技能 | SkillNet:大规模智能体“技能图谱”知识库 | Yuan Liang, Ningyu Zhang, and Xu Ziwen | 2026-02-28 | 370 | -- |
| Getting More from Your Test-Time Compute Budget with Portfolio Beam Search | Dan Elbaz, Oren Salzman, Oren Pereg, Daniel Korat, and Ronen Laperdon | 2026-02-24 | 3,527 | -- |
| easytranscriber: Speech Recognition with Accurate Timestamps in the HF Ecosystem | Faton Rekathati | 2026-03-03 | 1,169 | -- |
| The ML Engineer's Guide to Protein AI | Maziyar Panahi | 2026-03-03 | 3,612 | -- |
| PRX Part 3 — Training a Text-to-Image Model in 24h! | David Bertoin, Roman Frigg, and Jon Almazán | 2026-03-03 | 1,732 | -- |
| Introducing Kanon 2 Enricher — the world’s first hierarchical graphitization model | Umar Butler and Abdur-Rahman Butler | 2026-03-03 | 1,571 | -- |
| AI Coding Assistants Keep Shipping Vulnerable Code -- Here's What We're Doing … | Scott Thornton | 2026-02-26 | 371 | -- |
| LLM Architectures Explained: What Powers Today’s Top Models | Sara Han Díaz and Bertrand Charpentier | 2026-03-04 | 1,628 | -- |
| TiRex on the Edge | Robert Weber, Christian Ganhör, and Lukas Fischer | 2026-03-05 | 506 | -- |
| Bringing Robotics AI to Embedded Platforms: Dataset Recording, VLA Fine‑Tuning, and On‑Device … | Gaetan Bahl | 2026-03-05 | 1,851 | -- |
| Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines | YiYi Xu, Alvaro Somoza, Dhruv Nair, and Sayak Paul | 2026-03-05 | 1,907 | -- |
| NEO-unify: Building Native Multimodal Unified Models End to End | Haiwen Diao, Lewei Lu, and Ziwei Liu | 2026-03-05 | 623 | -- |
| Building Tucano 2: Open-Source Language Models That Actually Think in Portuguese | Nicholas Kluge Corrêa, Aniket Sen, Shiza Fatimah, Sophia Falk, and Lucie Flek | 2026-03-05 | 2,258 | -- |
| De-mystifying Multimodal Learning: The Hidden Inefficiency in Vision Language Modelling | Matteo Nulli | 2026-03-04 | 2,120 | -- |
| Konkani LLM: Bringing a Multi-Script Low-Resource Language to the AI Era | Reuben fernandes | 2026-03-07 | 861 | -- |
| Structural Problems in AI Benchmarking and the Case for a Unified Evaluation … | VIDRAFT_LAB | 2026-03-08 | 1,171 | -- |
| MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning | VIDRAFT_LAB | 2026-03-09 | 1,663 | -- |
| LeRobot v0.5.0: Scaling Every Dimension | Steven Palma, Pepijn Kooijmans, Jade Choghari, Caroline Pascal, Khalil Meftah, Martino Russi, Nicolas Rabault, Michel Aractingi, Virgile BATTO, and Thomas Wolf | 2026-03-09 | 1,931 | -- |
| Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge | George Saon and Madison Lee | 2026-03-09 | 385 | -- |
| Ulysses Sequence Parallelism: Training with Million-Token Contexts | Kashif Rasul and Stas Bekman | 2026-03-09 | 3,003 | -- |
| Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries | Amine Dirhoussi, Quentin Gallouédec, Kashif Rasul, Lewis Tunstall, Edward Beeching, Albert Villanova del Moral, Nouamane Tazi, and Leandro von Werra | 2026-03-10 | 9,358 | -- |
| Kanon 2 Reranker: the most powerful reranker for legal RAG | Umar Butler and Abdur-Rahman Butler | 2026-03-10 | 471 | -- |
| How NVIDIA Builds Open Data for AI | Will Jennings, Yev Meyer, Leanna Chraghchian, Rebecca Kao, Jane Polak Scowcroft, and Annie Surla | 2026-03-10 | 1,590 | -- |
| 🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language … | VIDRAFT_LAB | 2026-03-10 | 2,482 | -- |
| Introducing Storage Buckets on the Hugging Face Hub | Lucain Pouget, Eliott Coyac, Adrien Carreira, Victor Mustar, Julien Chaumond, Quentin Lhoest, Pierric Cistac, Sylvestre Bcht, Hugo Larcher, Rajat Arya, Di Xiao, and Assaf Vayner | 2026-03-10 | 1,591 | -- |
| ShopRLVE-GYM: Adaptive Verifiable Environments for E-Commerce Conversational Agents | Rahul Bajaj and Jaya Nupur | 2026-03-08 | 4,976 | -- |
| Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds | Joseph Jennings and Brandon Norick | 2026-03-11 | 710 | -- |
| Scaling Pedagogical Pre-training: From Optimal Mixing to 10 Billion Tokens | Asankhaya Sharma | 2026-03-06 | 4,656 | -- |
| How NVIDIA AI-Q Reached #1 on DeepResearch Bench I and II | David Austin | 2026-03-12 | 1,749 | -- |
| Build an Agent That Thinks Like a Data Scientist: How We Hit … | Jiwei Liu, Maximilian Jeblick, and Jack Yu | 2026-03-13 | 2,052 | -- |
| Arabic TTS Arena: Ranking Voice Models the Way Chess Ranks Grandmasters | Mohamed Rashad | 2026-03-12 | 1,698 | -- |
| Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline | Radek Osmulski, Reza Esfandiarpoor, Yauhen Babakhin, Gabriel de Souza Pereira Moreira, and Bo Liu | 2026-03-13 | 1,520 | -- |
| Pruna 0.3.2: More OSS Algos, More Ways to Optimize | Minette Kaunismäki, Begüm Çığ, Gaspar Rochette, Sara Han Díaz, and Bertrand Charpentier | 2026-03-11 | 922 | -- |
| SILMA TTS: A Lightweight Open Bilingual Text to Speech Model | Karim Ouda | 2026-03-15 | 524 | -- |
| The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare … | Sean Huver, Nigel Nelson, Lukas Zbinden, and Mostafa Toloui | 2026-03-16 | 865 | -- |
| Tokenization is Killing our Multilingual LLM Dream | Omar Kamali | 2026-03-15 | 3,383 | -- |
| Expanding the Alpamayo Open Platform for Developing Reasoning AVs Across Models, Data, … | Marco Pavone | 2026-03-16 | 1,259 | -- |
| Holotron-12B - High Throughput Computer Use Agent | Pierre-Louis Cedoz, Hamza Benchekroun, Aurélien Lac, delfosse, Tony Wu, Mats L. Richter, Antoine Bonnet, Kai Yuan, Aleix Cambray (H-AI), and Alexandra | 2026-03-17 | 868 | -- |
| Super Analyzer: Combining Reasoning and Coding Capabilities to Improve Code Performance | Girish Ganesan and Balachandran Rajendran | 2026-03-13 | 1,363 | -- |
| LoRA Fine-Tuning BitNet b1.58 LLMs on Heterogeneous Edge GPUs via QVAC Fabric | Subash SN, Akshay Nambiar, Milan Gritta, Zhen Cong Chen, Arsalan Anwari, Gianfranco Cordella, and Amril Nurman | 2026-03-17 | 3,124 | -- |
| Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI | Vinay Raman, Ameya Sunil Mahabaleshwarkar, Hayley Ross, Bilal Kartal, Aditya Malte, Zijia Chen, Ali Taghibakhshi, Sharath Turuvekere Sreenivas, Saurav Muralidharan, Khalil Ben Khaled, Nima Tajbakhsh, Pavlo Molchanov, Oluwatobi Olabiyi, and Yoshi Suhara | 2026-03-17 | 1,552 | -- |
| State of Open Source on Hugging Face: Spring 2026 | Avijit Ghosh, Lucie-Aimée Kaffee, Yacine Jernite, and Irene Solaiman | 2026-03-17 | 2,883 | -- |
| Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding | Talor Abramovich, Maor Ashkenazi, Izzy Putterman, Benjamin Chislett, Tiyasa Mitra, Bita Rouhani, Ran Zilberstein, and Yonatan Geifman | 2026-03-19 | 2,333 | -- |
| ATE-2: State-of-the-Art Armenian Text Embeddings and the ArmBench-TextEmbed Benchmark | Hrant Davtyan, Zaruhi Navasardyan, Spartak Bughdaryan, and bag_min | 2026-03-19 | 438 | -- |
| What's New in Mellea 0.4.0 + Granite Libraries Release | Abraham Daniels | 2026-03-20 | 469 | -- |
| Build a Domain-Specific Embedding Model in Under a Day | Steve H, Rucha Apte, Sean Sodha, and Oliver Holworthy | 2026-03-20 | 2,729 | -- |
| Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic … | Yunus Cukran | 2026-03-21 | 986 | -- |
| NanoVDR: A 70M Text-Only Model That Retrieves Visual Documents as Well as … | Zhuchenyang Liu | 2026-03-16 | 1,493 | -- |
| Pocket Models for iOS: Explore On-Device AI with GGUF Models, Data Memory, … | Hamit Hasanhocaoglu, Arda Dogantemur, Metecan Duyal, and StJohn Deakins | 2026-03-18 | 1,270 | -- |
| Introducing AI chunking to semchunk | Umar Butler and Abdur-Rahman Butler | 2026-03-23 | 2,228 | -- |
| Canada Must Not Turn AI Chatbots Into a New Surveillance Frontier | Noah Weinberger | 2026-03-16 | 1,934 | -- |
| A New Framework for Evaluating Voice Agents (EVA) | Tara Bogavelli, Gabrielle Gauthier Melancon, Katrina Stankiewicz, Nifemi Bamgbose, Hoang Nguyen, Raghav Mehndiratta, Hari Subramani, and Fanny Riols | 2026-03-24 | 2,147 | -- |
| SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation | Maziyar Panahi, merve, Jamie@Doubleword, Josh, Seb Ringrose, and Fergus Finn | 2026-03-23 | 3,730 | -- |
| Introducing Cohere-transcribe: state-of-the-art speech recognition | Julian Mack, Ekagra Ranjan, Walter Beller-Morales, Bharat venkitesh, and Pierre Richemond | 2026-03-26 | 1,485 | -- |
| Liberate your OpenClaw 🦀 | Clem 🤗, ben burtenshaw, Pedro Cuenca, Jeff Boudier, merve, Niels Rogge, Victor Mustar, and Mishig Davaadorj | 2026-03-27 | 593 | -- |
| White Hat Security Agent Prompts 600K Dataset by Yatin Taneja | Yatin Taneja | 2026-03-23 | 1,181 | -- |
| Letter of Superintelligence ~ Yatin Taneja | Yatin Taneja | 2026-03-23 | 1,031 | -- |
| ORBA: Orthogonal Reflection Bounded Ablation — A Geometrically Exact Detour in Directional … | Jim Lai | 2026-03-25 | 5,092 | -- |
| Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models | VIDRAFT_LAB | 2026-03-29 | 1,563 | -- |
| How I contributed a new model to the Transformers library using Codex | Niels Rogge | 2026-03-30 | 2,696 | -- |
| Training mRNA Language Models Across 25 Species for $165 | Maziyar Panahi | 2026-03-31 | 6,915 | -- |
| TRL v1.0: Post-Training Library Built to Move with the Field | Quentin Gallouédec, Steven Liu, Pedro Cuenca, and Sergio Paniego | 2026-03-31 | 3,093 | -- |
| Falcon Perception | wamiq para and FalconPerception | 2026-04-01 | 2,955 | -- |
| Using Storage Buckets as a Working Layer for Data Pipelines | Daniel van Strien | 2026-03-26 | 1,095 | -- |
| Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents | Madison Lee, Rogerio Feris, Eli Schwartz, Dhiraj Joshi, Pengyuan Li, and Isaac Sanchez | 2026-03-31 | 1,316 | -- |
| "The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge" | VIDRAFT_LAB | 2026-03-31 | 2,884 | -- |
| 🌈 SKT AI LABS 🌈 | ѕкт αι ℓαвѕ | 2026-03-30 | 555 | -- |
| Holo3: Breaking the Computer Use Frontier | Ramzi De Coster, Pierre-Louis Cedoz, Tony Wu, Hamza Benchekroun, mandreux-hai, delfosse, Aurélien Lac, maxime, Axel Moyal, Antonio Loison, Kai Yuan, and Ronan Riochet | 2026-04-01 | 813 | -- |
| Run Gemma 4 on Intel® Arc™ GPUs Out-Of-the-Box | Matrix Yao, Chendi Xue, FanZhao, Xinyu Chen, Alex Gu, Wuxun Zhang, Xinyi Li, jianan, Yi Wang, and Yintong Lu | 2026-04-01 | 1,495 | -- |
| Welcome Gemma 4: Frontier multimodal intelligence on device | merve, Pedro Cuenca, Sergio Paniego, ben burtenshaw, Steven Zheng, Alvaro Bartolome, and Nathan Habib | 2026-04-02 | 6,003 | -- |
| ArmBench-LLM 1.0: Benchmarking LLMs on Armenian Language Tasks | Hrant Davtyan, Zaruhi Navasardyan, Spartak Bughdaryan, and bag_min | 2026-04-02 | 1,205 | -- |
| YC-Bench: Can Your AI Agent Run a Startup Without Going Bankrupt? | Adit, Riddle He, Vincent Tu, Anand Kumar, and Nazneen Rajani | 2026-04-02 | 169 | -- |
| Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their … | Gustavo A Lujan and kedar kolluri | 2026-04-03 | 2,730 | -- |
| Run Gemma 4 on Intel® Xeon® Out-Of-the-Box | Jiang Li, Xinyu Chen, Chendi Xue, FanZhao, Yi Wang, Wuxun Zhang, Alex Gu, Xinyi Li, jianan, Yintong Lu, and Matrix Yao | 2026-04-01 | 1,464 | -- |
| gradio.Server: Any Custom Frontend with Gradio's Backend | yuvraj sharma and Abubakar Abid | 2026-04-01 | 1,160 | -- |
| From doctest to runnable Markdown | Tarek Ziadé | 2026-04-04 | 1,460 | -- |
| Darwin V6: Diagnostic-Guided Evolutionary Model Merging | VIDRAFT_LAB | 2026-04-08 | 1,003 | -- |
| How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs | Niels Rogge | 2026-04-07 | 1,246 | -- |
| BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders | Nicolas-BZRD and Théo Deschamps-Berger | 2026-04-07 | 1,772 | -- |
| Safetensors is Joining the PyTorch Foundation | Luc Georges and Lysandre | 2026-04-08 | 807 | -- |
| ALTK‑Evolve: On‑the‑Job Learning for AI Agents | Vatche Isahagian, Vinod Muthusamy, Jayaram Radhakrishnan, Gaodan Fang, Punleuk Oum, and G Thomas | 2026-04-08 | 1,180 | -- |
| Building Harvey-style tabular review from scratch, but better | Abdur-Rahman Butler | 2026-04-09 | 4,508 | -- |
| Multimodal Embedding & Reranker Models with Sentence Transformers | Tom Aarsen | 2026-04-09 | 2,886 | -- |
| Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs | Andrew Lapp, Louis Castricato, Scott Fox, Shahbuland Matiana, and David Rossi | 2026-04-09 | 857 | -- |
| Using OCR models with llama.cpp | Xuan-Son Nguyen | 2026-04-10 | 816 | -- |
| "Darwin-27B-Opus: Surpassing the Foundation Model Without Training" | VIDRAFT_LAB | 2026-04-13 | 1,806 | -- |
| Releasing LiteCoder-Terminal-SFT | LiteCoder | 2026-04-13 | 833 | -- |
| When Speech AI Meets the Long Tail of Languages: Inside the VAANI … | Sujith Pulikodan, Sanka, Nihar Desai, Suryansh Shukla, and Prasanta Kumar Ghosh | 2026-04-14 | 901 | -- |