HuggingFace Blog
289 posts indexed since 2022
Post Details
| Title | Author | Published | Words | HN Pts |
|---|---|---|---|---|
| Building the Open Agent Ecosystem Together: Introducing OpenEnv | Joseph Spisak, Davide Testuggine, Zach Wentz, Pierre Andrews, Sanyam Bhutani, Hamid Shojanazeri, Pankit Thapar, Emre Guven, Lewis Tunstall, and Vaibhav Srivastav | 2025-10-23 | 1,117 | -- |
| VibeGame: Exploring Vibe Coding Games | Dylan Ebert | 2025-09-29 | 1,777 | -- |
| Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard | Yauhen Babakhin, Radek Osmulski, Ronay Ak, Gabriel de Souza Pereira Moreira, and Mengyao Xu | 2025-10-21 | 706 | -- |
| Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes | Bryan Catanzaro and Jonathan Cohen | 2025-10-22 | 1,684 | -- |
| Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm | You Liang Tan, Fengyuan Hu, Oyindamola Omotuyi, Oluwaseun Doherty, Chitoku Yato, and Shane Reetz | 2025-06-11 | 1,902 | -- |
| Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than … | Solomatin Roman, Kenneth C. Enevoldsen, and Isaac Chung | 2025-10-20 | 2,320 | -- |
| huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning | Lucain Pouget, Célina Hanouti, Lysandre, and Julien Chaumond | 2025-10-27 | 2,139 | -- |
| Supercharge your OCR Pipelines with Open Models | merve, Aritra Roy Gosthipaty, Daniel van Strien, Hynek Kydlicek, Andres Marafioti, Vaibhav Srivastav, and Pedro Cuenca | 2025-10-21 | 3,544 | -- |
| Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for … | Prachi Mishra | 2025-10-28 | 921 | -- |
| Hugging Face and VirusTotal collaborate to strengthen AI security | Adrien Carreira and Bernardo Quintero | 2025-10-22 | 507 | -- |
| Voice Cloning with Consent | Margaret Mitchell and Lucie-Aimée Kaffee | 2025-10-28 | 1,394 | -- |
| Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with … | Jiqing.Feng, Matrix Yao, Ke Ding, and Ilyas Moutawwakil | 2025-10-16 | 1,374 | -- |
| Advancing Predictive ADMET Modeling Through Community-Driven Science: The ExpansionRx-OpenADMET Blind Challenge | Georgia Channing and Hugo MacDermott | 2025-10-27 | 943 | -- |
| Vision Tokens vs Text Tokens: Understanding the 10× Compression | Yi Cui | 2025-10-22 | 535 | -- |
| Projected Abliteration | Jim Lai | 2025-10-25 | 2,218 | -- |
| Streaming datasets: 100x More Efficient | Andres Marafioti, Quentin Lhoest, ben burtenshaw, Pedro Cuenca, and merve | 2025-10-27 | 1,306 | -- |
| Sentence Transformers is joining Hugging Face! | Tom Aarsen | 2025-10-22 | 1,011 | -- |
| Unlock the power of images with AI Sheets | Ame Vi, Daniel Vila, Francisco Aranda, Damián Pumar, Leandro von Werra, and Thomas Wolf | 2025-10-21 | 1,495 | -- |
| Get your VLM running in 3 simple steps on Intel CPUs | Ezequiel Lanza, Helena, Nikita, Ella Charlaix, and Ilyas Moutawwakil | 2025-10-15 | 1,479 | -- |
| Introducing RTEB: A New Standard for Retrieval Evaluation | Frank Liu, Kenneth C. Enevoldsen, Solomatin Roman, Isaac Chung, Tom Aarsen, and Fődi, Zoltán | 2025-10-01 | 2,833 | -- |
| Building a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac | Steven Palma and Andres Diaz-Pinto | 2025-10-29 | 1,115 | -- |
| Uncensor any LLM with abliteration | Maxime Labonne | 2024-06-13 | 3,144 | -- |
| GSMA Open-Telco LLM Benchmarks 2.0: The first dedicated LLM Evaluation for Telecoms | Lina Bariah, Antonio De Domenico, Louis Powell, Mohamed Sana, Merouane Debbah, Mark Austin, Farbod Tavakkoli, George George, Nicola Piovesan, Simone Mangiante, cherrared, Sumeyye Bas, GHADA SOLIMAN, Dilara Zeynep Gurer, Laszlo Suto, and Pierre Wang | 2025-10-20 | 3,090 | -- |
| NVIDIA Isaac GR00T in LeRobot | lior ben horin, Kartik S, Aravindh Shan, Asawaree, and You Liang Tan | 2025-10-28 | 1,182 | -- |
| LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR | Said Taghadouini, Baptiste Aubertin, and Adrien Cavaillès | 2025-10-23 | 4,470 | -- |
| Granite 4.0 Nano: Just how small can you go? | Kate Soule and Rameswar Panda | 2025-10-28 | 544 | -- |
| Code a simple RAG from scratch | Xuan-Son Nguyen | 2024-10-29 | 2,933 | -- |
| How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA … | Asawaree | 2025-10-28 | 1,078 | -- |
| Can Your LLM Think Like a Professional? Introducing ProfBench | Zhilin Wang, Jaehun Jung, Ximing Lu, Shizhe Diao, jiaqiz, VivienneZhang, Nik Spirin, and Dong | 2025-10-28 | 1,337 | -- |
| 🛡️ Nemotron PII: Synthesized Data for Privacy-Preserving AI | Maarten Van Segbroeck | 2025-10-28 | 988 | -- |
| SOTA OCR on-device with Core ML and dots.ocr | Christopher Fleetwood and Pedro Cuenca | 2025-10-02 | 1,910 | -- |
| Australian-made LLM beats OpenAI and Google at legal retrieval | Umar Butler, Abdur-Rahman Butler, and Adrian Lucas Malec | 2025-10-23 | 930 | -- |
| NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image … | Yao Xu, Timo Roman, Lukas Voegtle, Philipp Fischer, Amala Sanjay Deshmukh, Kateryna Chumachenko, and Jarno Seppänen | 2025-10-28 | 1,014 | -- |
| Promoter-GPT: Writing DNA Instructions with Language Models | Adele de Hoffer | 2025-10-22 | 3,509 | -- |
| LeRobot v0.4.0: Super Charging OSS Robotics Learning | Steven Palma, Michel Aractingi, Pepijn Kooijmans, Caroline Pascal, Jade Choghari, Francesco Capuano, Adil Zouitine, Martino Russi, and Thomas Wolf | 2025-10-24 | 1,980 | -- |
| KV Caching Explained: Optimizing Transformer Inference Efficiency | Hafedh Hichri | 2025-01-30 | 1,230 | -- |
| Why Did MiniMax M2 End Up as a Full Attention Model? | MiniMax | 2025-10-30 | 1,640 | -- |
| The World’s First and Best Speed Painting Software | 2025-10-29 | 1,368 | -- | |
| 3+ Years of ML & Society at Hugging Face 🤗🤝🧑🤝🧑 | Yacine Jernite, Giada Pistilli, Lucie-Aimée Kaffee, and Sasha Luccioni | 2025-10-29 | 807 | -- |
| Nemotron-Personas-USA: Synthesized Data for Sovereign AI | Will Jennings, Dane Corneil, and Yev Meyer | 2025-10-28 | 630 | -- |
| svara-TTS — Open Multilingual TTS for India’s Voices | Aditya Chhabra | 2025-10-27 | 1,626 | -- |
| What makes good reasoning data | MiniMax | 2025-10-30 | 629 | -- |
| On the Shifting Global Compute Landscape | Tiezhen WANG and Irene Solaiman | 2025-10-29 | 3,172 | -- |
| Aligning to What? Rethinking Agent Generalization in MiniMax M2 | MiniMax | 2025-10-30 | 1,103 | -- |
| Evaluate Your Own RAG: Why Best Practices Failed Us | Charles AZAM, Antoine Hoorelbeke, Antoine Guyot, Maxence Leclercq, and Jérémy PICOSSON | 2025-11-05 | 3,569 | -- |
| Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation | Exploding Gradients | 2025-09-16 | 3,586 | -- |
| DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge | Yihua Zhang | 2025-02-07 | 2,499 | -- |
| ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases | Quentin Macé, Antonio Loison, Antoine EDY, Victor Xing, and Gautier Viaud | 2025-11-05 | 2,524 | -- |
| Classement compar:IA : des votes des utilisateurs au classement participatif des modèles | compar:IA | 2025-11-03 | 1,821 | -- |
| Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness | Steven Zheng | 2025-11-05 | 1,120 | -- |
| Running Large Transformer Models on Mobile and Edge Devices | MtugrulKaya | 2025-11-03 | 6,026 | -- |
| TorchSim: A new PyTorch-based molecular dynamics engine | Davide Sarpa | 2025-10-31 | 3,592 | -- |
| The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix | Asankhaya Sharma | 2025-11-03 | 1,833 | -- |
| ⚡ Power, Heat, and Intelligence ☁️ - AI Data Centers Explained 🏭 | Boris Gamazaychikov and Sasha Luccioni | 2025-11-05 | 2,952 | -- |
| Small Language Models (SLM): A Comprehensive Overview | John Johnson | 2025-02-22 | 1,456 | -- |
| Toward Community-Governed Safety | Giada Pistilli and Lucie-Aimée Kaffee | 2025-11-03 | 681 | -- |
| From GRPO to DAPO and GSPO: What, Why, and How | Yihua Zhang | 2025-08-09 | 5,841 | -- |
| Budget Alignment: Making Models Reason in the User’s Language | Shan Chen, Jirui Qi, and Zidi Xiong | 2025-11-04 | 3,207 | -- |
| Introduction to State Space Models (SSM) | Loïck BOURDOIS | 2024-07-19 | 6,663 | -- |
| Let's talk about LLM evaluation | Clémentine Fourrier | 2024-05-23 | 3,264 | -- |
| Who Routes LLM Routers? RouterArena: Building the Evaluation Foundation for LLM Routing | Yifan Lu, Riksin, Jiayi Yuan, Bruce Cui, SJ Chang, Hongyi Liu, and Jiarong Xing | 2025-11-11 | 1,552 | -- |
| SYNTH: the new data frontier | Pierre-Carl Langlais | 2025-11-10 | 1,995 | -- |
| Effective Prompting for Generative Vision Models | Sara Han Díaz and Bertrand Charpentier | 2025-11-10 | 1,013 | -- |
| 🌳 QAT: The Art of Growing a Bonsai Model | Yi Cui | 2025-11-09 | 1,267 | -- |
| Norm-Preserving Biprojected Abliteration | Jim Lai | 2025-11-06 | 2,135 | -- |
| Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face | Daniel Voigt Godoy | 2025-02-11 | 3,900 | -- |
| Mastering Tensor Dimensions in Transformers | Hafedh Hichri | 2025-01-12 | 2,555 | -- |
| Text-to-image Architectural Experiments | David Bertoin, Jon Almazán, and Roman | 2025-11-13 | 3,525 | -- |
| Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level … | Tensor-Slayer | 2025-11-07 | 1,843 | -- |
| We’re open-sourcing our text-to-image model and the process behind it | Jon Almazán, David Bertoin, and Roman | 2025-11-12 | 1,110 | -- |
| Building for an Open Future - our new partnership with Google Cloud | Jeff Boudier and Simon Pagezy | 2025-11-13 | 869 | -- |
| Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach | Pere Martra | 2024-11-24 | 3,670 | -- |
| ⛳ Optimizer: What Does It Do and Why We Need It | Yi Cui | 2025-11-12 | 1,313 | -- |
| To Think or Not to Think: A Router for Hybrid LLMs | Amir Mohseni | 2025-11-16 | 2,137 | -- |
| The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs | Xiaoran Liu (SII) | 2025-11-15 | 1,834 | -- |
| The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling | Elaine McVey Houskeeper and Georgia Channing | 2025-11-18 | 1,662 | -- |
| Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models | Torsten Scholak, Oleksiy Ostapenko, Raymond Li, Luke Kumar, and Joel Lamy-Poirier | 2025-11-19 | 1,709 | -- |
| Easily Build and Share ROCm Kernels with Hugging Face | Abdennacer Badaoui, Daniel Huang, colorswind, and Zesen Liu | 2025-11-17 | 3,120 | -- |
| Join the AMD Open Robotics Hackathon | Eric Ma and Guruprasad MP | 2025-11-13 | 506 | -- |
| PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs | Samuel Lima Braz | 2025-01-24 | 8,770 | -- |
| AI Model Optimization More Flexible Than Ever | Johanna Sommer, Sara Han Díaz, and Bertrand Charpentier | 2025-11-17 | 725 | -- |
| Visualizing How VLMs Work | Hafedh Hichri and Ed Daniels | 2025-10-07 | 1,851 | -- |
| 🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset | Cornelius Wolff, Daniel Gomm, and Madelon Hulsebos | 2025-11-19 | 944 | -- |
| Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms | Mattt | 2025-11-20 | 1,326 | -- |
| Introducing Cogito v2.1 | Deep Cogito Team | 2025-11-19 | 1,067 | -- |
| Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks | Eric Bezzam, Steven Zheng, Eustache Le Bihan, and Vaibhav Srivastav | 2025-11-21 | 936 | -- |
| 20x Faster TRL Fine-tuning with RapidFire AI | Kamran Bigdely, Arun Kumar, and Quentin Gallouédec | 2025-11-21 | 1,198 | -- |
| How to make NeuTTS-air generate over 200 seconds of audio in a … | Yatharth Sharma | 2025-11-21 | 792 | -- |
| Building Deep Research: How we Achieved State of the Art | Michael Griff, Dean Sacoransky, and Noah Nefsky | 2025-11-24 | 1,628 | -- |
| OVHcloud on Hugging Face Inference Providers 🔥 | Gilles Closset, Fabien Ric, and Elias Tourneux | 2025-11-24 | 788 | -- |
| Prefill and Decode for Concurrent Requests - Optimizing LLM Performance | Benjamin Merkel | 2025-04-16 | 2,165 | -- |
| Announcing the LLM Open Finance models | Raheel Qader, Gaëtan Caillaut, Jingshu, Mariam Nakhle, Arezki SADOUNE, MASSINISSA AHMIM, and Jean-Gabriel BARTHELEMY | 2025-11-24 | 601 | -- |
| DeLERP: Decomposed Linear Interpolation for Model Merging | Jim Lai | 2025-11-20 | 1,364 | -- |
| How MCP Blockly Makes MCP Server Creation Accessible for Everyone | Owen Kaplinsky | 2025-11-28 | 952 | -- |
| Curating datasets directly on the Hub | Daniel Vila | 2025-11-27 | 504 | -- |
| 10 Best Open-Source LLM Models (2025 Updated): Llama 4, Qwen 3 and … | Daya Shankar | 2025-11-13 | 2,419 | -- |
| Gemini-3 Benchmarkathon | Robert Scholz, Slimane Alaoui Soulimani Valenti, Ernest Beta, Odysseas S. Chlapanis, Adhithya kiran, Matteo Bürgler, Sophie Franco, Chu Fei Luo, Prof. Samuel Dahan, and Joel Niklaus | 2025-11-28 | 4,648 | -- |
| Building Jobly: Semantic Job Matching with RAG and Vector Embeddings | Valentina Nieddu and Giacomo Bandini | 2025-11-28 | 1,878 | -- |
| Continuous batching | Rémi Ouazan Reboul, Arthur Zucker, and Luc Georges | 2025-11-25 | 3,970 | -- |
| Welcome FLUX.2 - BFL’s new open image generation model 🤗 | YiYi Xu, Daniel Gu, Sayak Paul, Alvaro Somoza, Dhruv Nair, Aritra Roy Gosthipaty, Linoy Tsaban, and Apolinário from multimodal AI art | 2025-11-25 | 3,460 | -- |
| A Guide to Hugging Face’s Papers Page | Adina Yakefu | 2025-11-25 | 973 | -- |
| makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch | Avinash Sooriyarachchi | 2024-05-07 | 3,812 | -- |
| Custom Policy Enforcement with Reasoning: Faster, Safer AI Applications | Traian Rebedea, Shyamala Prayaga, Makesh Sreedhar, Chris Parisien, and Isabel Hulseman | 2025-12-02 | 1,648 | -- |
| Transformers v5: Simple model definitions powering the AI ecosystem | Lysandre, Arthur Zucker, Cyril Vallez, and Vaibhav Srivastav | 2025-12-01 | 2,250 | -- |
| Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO … | Yihua Zhang | 2025-02-11 | 18,441 | -- |
| Building and evaluating Multimodal Rerankers | Ulrick BLE | 2025-11-30 | 4,201 | -- |
| An Edge-First Generalized LLM LoRA Fine-Tuning Framework for Heterogeneous GPUs | Subash SN, Akshay Nambiar, Patrik Lambert, Milan Gritta, and Amril Nurman | 2025-12-01 | 4,604 | -- |
| 📌 Rethinking Multimodality from an Industry Perspective: Captioning Is Far More Important … | Bohan Zhai and Shijia Yang | 2025-11-29 | 3,816 | -- |
| SARLO-80: Worldwide Slant SAR Language Optic Dataset at 80 cm Resolution | Solène Debuysère, Nicolas Trouvé, and Georgia Channing | 2025-12-01 | 1,551 | -- |
| Bringing Math to Life: Building StepWise Math for the MCP Hackathon | Vikas Gupta | 2025-11-27 | 948 | -- |
| Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement | Asankhaya Sharma | 2025-12-03 | 2,075 | -- |
| We Got Claude to Fine-Tune an Open Source LLM | ben burtenshaw and shaun smith | 2025-12-04 | 2,016 | -- |
| BERTs that chat: turn any BERT into a chatbot with dLLM | Zhanhui Zhou and Lingjie Chen | 2025-11-28 | 943 | -- |
| Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand | Quentin Gallouédec | 2025-12-04 | 1,219 | -- |
| AI Energy Score v2: Refreshed Leaderboard, now with Reasoning 🧠 | Sasha Luccioni and Boris Gamazaychikov | 2025-12-04 | 1,496 | -- |
| Introducing swift-huggingface: The Complete Swift Client for Hugging Face | Mattt | 2025-12-05 | 1,524 | -- |
| DeepFabric: Generate, Train and Evaluate with Datasets curated for Model Behavior Training. | Luke Hinds | 2025-12-04 | 3,284 | -- |
| TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval | Özay Ezerceli, Mahmud ElHuseyni 🇵🇸, SELVA TAŞ, Reyhan Bayraktar, Betül Terzioğlu, Yusuf Çelebi, Yağız Asker, and nmmursit | 2025-12-04 | 3,173 | -- |
| Engineering Notes: Training a LoRA for Z-Image Turbo with the Ostris AI … | Shawn | 2025-12-02 | 1,280 | -- |
| DeepMath: A lightweight math reasoning Agent with SmolAgents | Daniel Fleischer, Moshe Berchansky, and Moshe Wasserblat | 2025-12-04 | 1,123 | -- |
| Making Model Tuning Accessible: This is what we built observing 100s of … | Mehant, Yashasvi Chaurasia, Ashok Pon Kumar, and Praveen Jayachandran | 2025-12-05 | 1,821 | -- |
| A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: … | Yihua Zhang | 2025-02-04 | 7,388 | -- |
| Muon vs MuonClip vs Muon+AdamW for Fine-Tuning | Nishith Jain | 2025-12-09 | 705 | -- |
| How We Use Claude Code Skills to Run 1,000+ ML Experiments a … | Sigrid Jin | 2025-12-08 | 4,707 | -- |
| New in llama.cpp: Model Management | Xuan-Son Nguyen and Victor Mustar | 2025-12-11 | 740 | -- |
| Build Hallucination-Free RAG with Verbatim | Adam Kovacs | 2025-11-18 | 2,281 | -- |
| I Built a RAG System That Listens to Live BBC News and … | Rakshit Aralimatti | 2025-12-09 | 907 | -- |
| Make and publish your Reachy Mini App | Antoine Pirrone and Rouanet | 2025-12-03 | 1,081 | -- |
| Why You Should Care About Partial Differential Equations (PDEs) | Aishwarya Balaji, BryanBradfo, Jose Manuel Nápoles, Prateik Sinha, and Roey Ben Chaim | 2025-12-12 | 1,761 | -- |
| MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier | Surya Kant Sahu and Jaipal Singh | 2025-12-12 | 2,144 | -- |
| Diffusion Language Models: The New Paradigm | Pro Creations | 2025-06-10 | 1,644 | -- |
| Codex is Open Sourcing AI models | ben burtenshaw and shaun smith | 2025-12-11 | 2,426 | -- |
| Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance | Sathwik Tejaswi Madhusudhan, Sagar Davasam, and Torsten Scholak | 2025-12-09 | 1,908 | -- |
| CUGA on Hugging Face: Democratizing Configurable AI Agents | Jim Laredo, Avi Yaeli, Sami Marreed, AYHAN SEBIN, and Merve Unuvar | 2025-12-15 | 1,058 | -- |
| Topic 23: What is LLM Inference, it's challenges and solutions for it | Ksenia Se | 2025-01-17 | 1,511 | -- |
| Phare LLM benchmark V2: Reasoning models don't guarantee better security | Pierre Le Jeune, David Berenstein, Matteo, and Weixuan Xiao | 2025-12-16 | 2,631 | -- |
| Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation | kelseye.xh and Zhongjie Duan | 2025-12-16 | 1,416 | -- |
| The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator | Seph Mard, Isabel Hulseman, Besmira Nushi, Piotr Januszewski, Grzegorz Chlebus, VivienneZhang, Wojciech Prazuch, Pablo Ribalta, Nik Spirin, and Ferenc Galko | 2025-12-17 | 2,102 | -- |
| Nemotron 3 Nano - A new Standard for Efficient, Open, and Intelligent … | Chris Alexiuk, Shashank Verma, Chintan, Chris Wing, and Gordana Neskovic | 2025-12-15 | 2,382 | -- |
| Everything You Need to Know about Knowledge Distillation | Ksenia Se and Alyona Vert | 2025-03-06 | 3,517 | -- |
| EuroLLM-22B | EuroLLM Team, Miguel Moura Ramos, Duarte Alves, and Hippolyte Gisserot-Boukhlef | 2025-12-14 | 1,162 | -- |
| Gotchas in Tokenizer Behavior Every Developer Should Know | Quentin Gallouédec | 2025-04-18 | 2,659 | -- |
| What is the Hugging Face Community Building? | Avijit Ghosh, Yacine Jernite, and Irene Solaiman | 2025-07-15 | 1,377 | -- |
| Open Collaboration in Action: Inside the Open Safeguard Hackathon | Andrew Chang, juliet shen, and Yacine Jernite | 2025-12-18 | 1,248 | -- |
| cua-bench: A Framework for Benchmarking, Training Data, and RL Environments for Computer-Use … | Francesco Bonacci and Dillon DuPont | 2025-12-16 | 1,086 | -- |
| Spinning Up a CPU-Only Micro-LLM with LoRA for Literary Style | Kashif Salahuddin | 2025-12-16 | 1,000 | -- |
| Announcing LiteCoder-Terminal: Lightweight Terminal Agents with <1k Synthesized Trajectories | LiteCoder | 2025-12-18 | 677 | -- |
| Tokenization in Transformers v5: Simpler, Clearer, and More Modular | Ita Zaporozhets, Aritra Roy Gosthipaty, Arthur Zucker, Sergio Paniego, merve, and Pedro Cuenca | 2025-12-18 | 3,024 | -- |
| Shadow AI - Where are the CIOs? | Jeff Boudier | 2025-12-19 | 616 | -- |
| LLM based TTS models | Yatharth Sharma | 2025-12-18 | 871 | -- |
| AI Labs Must Resist Age Verification | Adam Molnar and Noah Weinberger | 2025-12-17 | 2,593 | -- |
| 🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About … | Ksenia Se | 2025-03-17 | 4,266 | -- |
| Backbone-Optimizer Coupling Bias: The Hidden Co-Design Principle | Juanxi Tian | 2025-12-20 | 5,279 | -- |
| Encoding the World's Medical Knowledge into 970K | David Mezzetti | 2025-12-22 | 934 | -- |
| Skill is All You Need: Lessons from Building Marketing Agents at Noumena | liuzeming, Arcobalneo, HUANLIN LUO, wubin, Huan Zhao, Lee, and Noumena-AI | 2025-12-25 | 2,334 | -- |
| AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems | Jaykumar Kasundra | 2025-12-23 | 2,080 | -- |
| Understanding InstaFlow/Rectified Flow | Isamu Isozaki | 2023-10-06 | 1,802 | -- |
| Nano-BEIR: A Multilingual Information Retrieval Benchmark with Quality-Enhanced Queries | KuKu | 2025-12-22 | 1,274 | -- |
| Decoding Strategies in Large Language Models | Maxime Labonne | 2024-10-29 | 4,166 | -- |
| The Optimal Architecture for Small Language Models | Asankhaya Sharma | 2025-12-26 | 2,348 | -- |
| Deriving the PPO Loss from First Principles | aayush garg | 2025-12-25 | 12,448 | -- |
| Continuity as a First-Class System Property in Artificial Intelligence | Jeremy Felps | 2025-12-30 | 1,462 | -- |
| System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience | Asankhaya Sharma | 2025-06-02 | 1,027 | -- |
| Deriving the DPO Loss from First Principles | aayush garg | 2025-12-30 | 7,331 | -- |
| Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B … | weitaofeng | 2026-01-01 | 1,778 | -- |
| OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve | Asankhaya Sharma | 2025-05-20 | 1,959 | -- |
| Building Conversational AI: A Deep Dive into Voice Agent Architectures and Best … | abdeljalil_elma | 2025-09-02 | 1,854 | -- |
| We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for … | Clem 🤗, Steve Nguyen, and Jeremy Laville | 2025-07-08 | 593 | -- |
| Create Mixtures of Experts with MergeKit | Maxime Labonne | 2024-03-28 | 2,007 | -- |
| The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on … | Yağız Çalık | 2026-01-02 | 5,072 | -- |
| What are Embeddings and Vector Databases? | Damien B | 2024-08-20 | 1,392 | -- |
| Introduction to Quantization cooked in 🤗 with 💗🧑🍳 | merve | 2023-08-25 | 1,372 | -- |
| Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture | Basma Boussaha, Mohammed Alyafeai, Ahmed Alzubaidi, Leen AlQadi, Shaikha Alsuwaidi, Omar saif alkaabi, Hamza Alobeidli, and Hakim Hacid | 2026-01-05 | 1,838 | -- |
| TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell | Konstantin | 2026-01-05 | 3,309 | -- |
| Introducing Falcon H1R 7B | Iheb Chaabane, Puneesh Khanna, Suhail M Shah, Slim Frikha, Shi Hu, Abdalgader Abubaker, Reda alami, Mike Lubinets, Mohamed El Amine Seddik, and Hakim Hacid | 2026-01-05 | 1,332 | -- |
| Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem | Marco Pavone | 2026-01-05 | 893 | -- |
| Understanding Low-Rank Adaptation (LoRA): A Revolution in Fine-Tuning Large Language Models | Ashish Chadha | 2026-01-03 | 2,023 | -- |
| NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI | Tsung-Yi Lin and Debraj Sinha | 2026-01-05 | 1,037 | -- |
| Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR | Kunal Dhawan, Adi- margolin, Gordana Neskovic, Maryam Motamedi, and Yasmina Benkhoui | 2026-01-05 | 1,860 | -- |
| NVIDIA brings agents to life with DGX Spark and Reachy Mini | Jeff Boudier, Nader Khalil, and Alec Fong | 2026-01-05 | 2,128 | -- |
| M2.1: Multilingual and Multi-Task Coding with Strong Generalization | MiniMax | 2026-01-05 | 2,306 | -- |
| Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot | Raffaello Bonghi, lior ben horin, Kartik S, and Kalyan Vadrevu | 2026-01-05 | 1,038 | -- |
| Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval … | Ronay Ak, Gabriel de Souza Pereira Moreira, and Bo Liu | 2026-01-06 | 1,492 | -- |
| OpenMed: Six Months of Open-Source Medical AI and the Road Ahead | Maziyar Panahi | 2026-01-06 | 2,424 | -- |
| Why We Built VIBE Bench: Rethinking Evaluation for Real Workloads | MiniMax | 2026-01-06 | 736 | -- |
| Diversity Vs Density: A data strategy comparison for fine-tuning VLMs | Akhil Theerthala | 2026-01-06 | 2,301 | -- |
| 🥃 Distilling Tiny Embeddings | David Mezzetti | 2026-01-10 | 1,082 | -- |
| How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, … | Sherry Chen | 2025-09-30 | 3,271 | -- |
| Introducing OptiMind, a research model designed for optimization | Anson Ho, Sirui Li, and Ishai Menache | 2026-01-15 | 395 | -- |
| From Image-to-LoRA to In-Context Edit | kelseye.xh and Zhongjie Duan | 2025-12-29 | 936 | -- |
| Common AI Model Formats | Xuan-Son Nguyen | 2025-02-27 | 2,109 | -- |
| How We Built a Semantic Highlight Model To Save Token Cost for … | Cheney Zhang and Jiang Chen | 2026-01-15 | 2,344 | -- |
| Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments | Bingyang Ye and Shan Chen | 2026-01-13 | 2,717 | -- |
| Open Responses: What you need to know | shaun smith, ben burtenshaw, merve, and Pedro Cuenca | 2026-01-15 | 1,344 | -- |
| Beyond Brute Force: Why LoongFlow is the “Thinking” Evolution of OpenEvolve | Xunan Dai | 2026-01-16 | 1,108 | -- |
| ColPali: Efficient Document Retrieval with Vision Language Models 👀 | Manuel Faysse | 2024-07-05 | 1,399 | -- |
| SmolLM-Smashed: Tiny Giants, Optimized for Speed | David Berenstein | 2026-01-13 | 982 | -- |
| VLM-OCR Recipes on GPU Infrastructure | Florent Gbelidji | 2026-01-15 | 2,281 | -- |
| The Large Language Model Course | Maxime Labonne | 2025-01-16 | 4,256 | -- |
| Reviewer Two (but it's an OpenEnv) | Chris von Csefalvay | 2026-01-13 | 1,653 | -- |
| Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments | ben burtenshaw | 2026-01-20 | 1,158 | -- |
| LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family | Said Taghadouini, Adrien Cavaillès, and Baptiste Aubertin | 2026-01-19 | 934 | -- |
| Differential Transformer V2 | Li Dong | 2026-01-20 | 3,136 | -- |
| 🪄 Interpreto: A Unified Toolkit for Interpretability of Transformer Models | Fanny Jourdan and Antonin Poché | 2026-01-20 | 2,112 | -- |
| New in llama.cpp: Anthropic Messages API | Xuan-Son Nguyen and Victor Mustar | 2026-01-19 | 541 | -- |
| One Year Since the “DeepSeek Moment” | Adina Yakefu and Irene Solaiman | 2026-01-20 | 1,617 | -- |
| Optimizing GLM4-MoE for Production: 65% Faster TTFT with SGLang | Novita AI | 2026-01-22 | 1,047 | -- |
| Security, Governance and Performance for Dell On-Prem AI Builders | Balachandran Rajendran, Juan Julián, Alvaro Bartolome, Enrique Hernández Calabrés, Simon Pagezy, and Jeff Boudier | 2026-01-21 | 1,064 | -- |
| RexRerankers: SOTA Rankers for Product Discovery and AI Assistants | Rahul Bajaj, Anuj Garg, and Jaya Nupur | 2026-01-24 | 3,704 | -- |
| Challenges of Synthetic Dataset Generation | Rishiraj Acharya | 2026-01-21 | 942 | -- |
| Reverse Engineering a $500M Mystery: From HashHop to Memory-Augmented Language Models | Asankhaya Sharma | 2026-01-23 | 1,825 | -- |
| AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality | Dhaval Patel, James Rayfield, Saumya Ahuja, Chathurangi Shyalika, Shuxin Lin, and Zhou | 2026-01-21 | 1,505 | -- |
| “DeepSeek R1 时刻” 一周年 | vansin | 2026-01-20 | 315 | -- |
| Benchmark Smarter: Tailor Your Model Evaluation Suite with EvalScope | kelseye.xh | 2026-01-22 | 1,973 | -- |
| Waypoint-1: Real-time Interactive Video Diffusion from Overworld | Andrew Lapp, Louis Castricato, Scott Fox, Shahbuland Matiana, and David Rossi | 2026-01-20 | 853 | -- |
| A Beginner-Friendly PyTorch Tutorial: Build and Train Your First Model | Daniel Voigt Godoy | 2025-01-20 | 8,215 | -- |
| Why Your AI Strategy Needs Hugging Face Storage | Adrian Lepers | 2026-01-26 | 1,008 | -- |
| NVIDIA Earth-2 Open Models Span the Whole Weather Stack | Mike Pritchard, Jaideep Pathak, Jean Kossaifi, and Aayush Gupta | 2026-01-26 | 736 | -- |
| Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs | Omar saif alkaabi, Ahmed Alzubaidi, Hamza Alobeidli, Shaikha Alsuwaidi, Mohammed Alyafeai, Leen AlQadi, Basma Boussaha, and Hakim Hacid | 2026-01-27 | 1,585 | -- |
| Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective | Jason Zhu, Hejian Sang, Arup De, Rohit Jain, and Yanning Chen | 2026-01-27 | 4,160 | -- |
| Friends and Grandmothers in Silico | Itay Yona | 2026-01-24 | 4,089 | -- |
| Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. | Maziyar Panahi | 2025-07-16 | 2,205 | -- |
| Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek | Adina Yakefu and Irene Solaiman | 2026-01-27 | 1,324 | -- |
| Nemotron-Personas-Brazil: Co-Designed Data for Sovereign AI | Andre Manoel, Yev Meyer, Shyamala Prayaga, Will Jennings, and bardiya sadeghi | 2026-01-28 | 903 | -- |
| The Great Classification Showdown: OSS vs BERT on Consumer Hardware | Ben Toussaint | 2026-01-26 | 1,938 | -- |
| We got Claude to teach open models how to write CUDA kernels! | ben burtenshaw, shaun smith, merve, and Pedro Cuenca | 2026-01-28 | 2,350 | -- |
| Slashing torch.compile Warmup & LoRA Swapping Times with Pruna | John Rachwan, Johanna Sommer, Bertrand Charpentier, and Sara Han Díaz | 2026-01-28 | 1,513 | -- |
| Nemotron-Personas-Singapore: Co-Designed Data for Sovereign AI | Will Jennings, Dane Corneil, Yev Meyer, Verdi March, Shyamala Prayaga, and bardiya sadeghi | 2026-01-27 | 1,041 | -- |
| TruthTensor: LLM Evalution in Prediction Markets Under Drift and Market Baseline | Elena Pashkova, shirin Shahabi, Hudson, and Ronald Chan | 2026-01-29 | 1,631 | -- |
| Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp | Doctor Shotgun and Geechan | 2026-01-30 | 2,508 | -- |
| Introducing NVIDIA Cosmos Policy for Advanced Robot Control | Pranjali Joshi, Tsung-Yi Lin, Jinwei Gu, and Prachi Mishra | 2026-01-29 | 1,333 | -- |
| Introducing Daggr: Chain apps programmatically, inspect visually | merve, yuvraj sharma, Abubakar Abid, hysts, and Pedro Cuenca | 2026-01-29 | 1,559 | -- |
| MamayLM, передова мовна модель для української мови | Hanna Yukhymenko, Anton Alexandrov, and Martin Vechev | 2025-04-23 | 1,941 | -- |
| Fine-Tuning FunctionGemma on TPU to Create a Virtual Fitness Coach in 10 … | Alvaro Moran | 2026-02-02 | 2,906 | -- |
| Announcing ReasoningLens — Visualizing and Diagnosing LLM Reasoning at a Glance | Jun Zhang, Jason Zheng, Boxi Cao, and ReasoningLens | 2026-02-03 | 693 | -- |
| Training Design for Text-to-Image Models: Lessons from Ablations | David Bertoin, Roman Frigg, and Jon Almazán | 2026-02-03 | 7,420 | -- |
| The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+ | Adina Yakefu and Irene Solaiman | 2026-02-03 | 1,602 | -- |
| H Company's new Holo2 model takes the lead in UI Localization | Ramzi De Coster, Hamza Benchekroun, and Aurélien Lac | 2026-02-03 | 214 | -- |
| Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s … | Ronay Ak and Gabriel de Souza Pereira Moreira | 2026-02-04 | 1,048 | -- |
| Nvidia Agentic Smart Router on Dell Enterprise Hub : Deepdive on Architecture,Design … | Khushboo Rathi and Balachandran Rajendran | 2026-02-03 | 995 | -- |
| Getting Started With Hugging Face in 10 Minutes | Vladislav Guzey | 2025-03-10 | 1,514 | -- |
| CRAFT: Continuous Reasoning and Agentic Feedback Tuning | Valentin, Denis Timonin, Alexandr, and Alexey | 2026-02-05 | 813 | -- |
| Introducing SyGra Studio | Surajit Dasgupta, Bidyapati Pradhan, Amit Kumar Saha, Vipul Mittal, and Sriram Puttagunta | 2026-02-05 | 747 | -- |
| 🚀 SyGra V2.0.0 | Sriram Puttagunta, Surajit Dasgupta, Bidyapati Pradhan, Amit Kumar Saha, and Vipul Mittal | 2026-02-05 | 724 | -- |
| Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth | Maxime Labonne | 2024-07-29 | 2,923 | -- |
| From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails … | Maziyar Panahi | 2026-02-07 | 5,766 | -- |
| Transformers.js v4 Preview: Now Available on NPM! | Joshua and Nico Martin | 2026-02-09 | 1,185 | -- |
| Rank-Stabilized LoRA: Unlocking the Potential of LoRA Fine-Tuning | D K | 2024-02-20 | 1,793 | -- |
| Training Qwen3 VL to label bbox : synthetic data, environment and training … | Ulrick BLE | 2026-02-09 | 2,544 | -- |
| 🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs | Guanchu | 2026-02-11 | 616 | -- |
| Building a Mood-Based Movie Recommendation Engine with Voyage-4-nano, Hugging Face, and MongoDB … | Arkadiusz Borucki | 2026-02-08 | 3,315 | -- |
| Enabling Large Scale RLHF of GPTOSS with Megatron backend in VeRL | LEI WANG | 2026-02-10 | 5,934 | -- |
| Why SGLang is a Game-Changer for LLM Workflows | Makwana Paresh | 2025-07-07 | 1,639 | -- |
| OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments | Christian Washington, Ankit Jasuja, Santosh Sah, Lewis Tunstall, and ben burtenshaw | 2026-02-12 | 1,656 | -- |
| LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search … | Antoine Chaffin and Raphael | 2026-02-12 | 4,993 | -- |
| Transformers | Esmail Atta Gumaan | 2024-07-02 | 2,730 | -- |
| Forge: Scalable Agent RL Framework and Algorithm | MiniMax, Hyn, zhi zhang, Jiayuan Song, Da Chen, xkc, Yaoyao, kennyKK, and zpysky1125 | 2026-02-13 | 3,387 | -- |
| How to Use Multiple GPUs in Hugging Face Transformers: Device Map vs … | Aritra Roy Gosthipaty | 2026-02-12 | 606 | -- |
| Custom Kernels for All from Codex and Claude | ben burtenshaw, Sayak Paul, Aritra Roy Gosthipaty, and shaun smith | 2026-02-13 | 1,792 | -- |
| Model2Vec: Distill a Small Fast Model from any Sentence Transformer | Thomas van Dongen and Stéphan Tulkens | 2024-10-14 | 2,441 | -- |
| What superpower does Kimi-K2.5 bring to the table? | Leco Li | 2026-02-13 | 1,154 | -- |
| Unbelievable! Run 70B LLM Inference on a Single 4GB GPU with This … | Gavin Li | 2023-11-30 | 1,279 | -- |
| The Chinese GLM-5 Model Now Ranks #2 in Arabic Language Performance | Karim Ouda | 2026-02-16 | 322 | -- |
| Compute and Competition in AI: Different FlOPs for Different Folks | Yacine Jernite and Sasha Luccioni | 2026-02-12 | 1,917 | -- |
| How to Build a Benchmark with a Private Test Set on Hugging … | Georgia Channing | 2026-02-16 | 1,775 | -- |
| Qwen3.5: Nobody Agrees on Attention Anymore | Maxime Labonne | 2026-02-17 | 1,192 | -- |
| NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル | Atsunori Fujita, Kotaro Yamamoto, Masaya Ogushi, Vincent Gong, Ameya Sunil Mahabaleshwarkar, and Yoshi Suhara | 2026-02-17 | 297 | -- |
| DenseR: Dense Rewards For Free in LLM Reasoning | Hritik Bansal | 2026-02-18 | 3,977 | -- |
| De-mystifying Multimodal Learning: Enabiling Vision in Language Models | Matteo Nulli | 2026-02-17 | 2,797 | -- |
| One-Shot Any Web App with Gradio's gr.HTML | yuvraj sharma, hysts, and Freddy Boulton | 2026-02-18 | 829 | -- |
| Gemma3NPC - A Solution for Live NPC Interactions | Hexi Wang and Keegan Carey | 2025-08-14 | 5,954 | -- |
| RynnEC: Bringing MLLMs into Embodied World | Ronghao Dang, YuqianYuan, yunxuan mao, Kehan Li, jiangpin, zhikai wang, and Xin Li | 2025-08-14 | 1,382 | -- |
| IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and … | Ayhan Sebin, Rohan Arora, and Saurabh Jha | 2026-02-18 | 2,253 | -- |
| Did GPT 5.2 make a breakthrough discovery in theoretical physics? | David Louapre | 2026-02-19 | 4,541 | -- |
| ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models? | Antoine Chaffin, Luca Arnaboldi, Amélie Chatelain, and Florent Krzakala | 2026-02-19 | 2,306 | -- |
| 「データ不足」の壁を越える:合成ペルソナが日本のAI開発を加速 | Atsunori Fujita, Masaya Ogushi, Will Jennings, Yev Meyer, Kotaro Yamamoto, Yoshi Suhara, Vincent Gong, and Dane Corneil | 2026-02-19 | 280 | -- |
| I Let a Lobster Run My Jetson: What OpenClaw Taught Me About … | Andres Marafioti | 2026-02-19 | 1,509 | -- |
| Train AI models with Unsloth and Hugging Face Jobs for FREE | ben burtenshaw, Daniel (Unsloth), Michael Han, Maxime Labonne, Daniel van Strien, and shaun smith | 2026-02-20 | 944 | -- |
| Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) | Aritra Roy Gosthipaty | 2025-01-19 | 4,342 | -- |
| GGML and llama.cpp join HF to ensure the long-term progress of Local … | Georgi Gerganov, Xuan-Son Nguyen, Aleksander Grygier, Lysandre, Victor Mustar, and Julien Chaumond | 2026-02-20 | 936 | -- |
| Introducing Legal RAG Bench | Umar Butler and Abdur-Rahman Butler | 2026-02-20 | 3,235 | -- |
| FINAL Bench: The Real Bottleneck to AGI Is Self-Correction | VIDRAFT_LAB | 2026-02-21 | 1,146 | -- |
| How We Learned to Talk to Machines | Tyler Williams | 2026-02-20 | 1,156 | -- |
| Kimi K2.5: Still Worth It After Two Weeks? | Maxime Labonne | 2026-02-23 | 1,448 | -- |
| Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism? | VIDRAFT_LAB | 2026-02-24 | 2,770 | -- |
| Follow the White Rabbit: Using Embeddings So You Never Get Lost in … | David Corvoysier | 2026-02-23 | 1,420 | -- |
| MAEB: Evaluating Audio Embeddings at Scale | Adnan El Assadi, Solomatin Roman, Kenneth C. Enevoldsen, and Isaac Chung | 2026-02-24 | 1,349 | -- |
| A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and … | Karina Zadorozhny | 2026-01-19 | 7,738 | -- |
| Deploying Open Source Vision Language Models (VLM) on Jetson | Mitesh Patel, Johnny Nuñez Cano, and Raymond Lo | 2026-02-24 | 1,591 | -- |