Company
Date Published
Author
Hemant Jain
Word count
1701
Language
English
Hacker News points
None

Summary

Cohere has introduced "North," an enterprise-ready AI platform designed to enhance workplace productivity by leveraging modern AI tools. Among its offerings are Compass, an intelligent search and discovery system, and Command, a suite of scalable language models. A key focus is on T-Few finetuning, a technique that optimizes the finetuning of large language models (LLMs) by updating only a fraction of the model's weights. This approach reduces training time and resource use, allowing multiple finetunes to share GPU resources for efficient serving. T-Few's ability to stack finetuned models enhances serving scalability by enabling concurrent inference on a single GPU, thus maximizing GPU utilization. The process involves a finetuning workflow that updates model weights, making it highly portable and efficient, with the finetune weights being a small fraction of the baseline model size. This methodology is particularly beneficial for applications requiring efficient and high-performance language models, offering solutions across various industries including technology, financial services, healthcare, manufacturing, and the public sector.