Company
Date Published
Author
Kate Soule and Rameswar Panda
Word count
544
Language
-
Hacker News points
None

Summary

IBM's Granite 4.0 Nano models, the latest addition to the Granite 4.0 family, represent the company's smallest and most efficient AI models designed for edge and on-device applications. These models, featuring a hybrid-SSM architecture, are optimized for performance with significantly fewer parameters, and are released under an Apache 2.0 license. They demonstrate superior capabilities compared to similarly sized models from competitors like Alibaba and Google, particularly in general knowledge, math, code, and safety domains, as well as in specific tasks critical for agentic workflows. The Granite 4.0 Nano models, which include both instruct models and their base model counterparts, are backed by IBM's ISO 42001 certification for responsible model development, ensuring adherence to global standards. With this release, IBM continues to advance AI technology by developing powerful models that do not rely on massive parameter counts, promising further innovations in the Granite 4.0 family.