Company
Date Published
Author
Baseten
Word count
605
Language
English
Hacker News points
None

Summary

There's a new open source LLM, Mistral 7B, which surpasses other models on benchmarks and has strong code-generation capabilities. It was released with an Apache 2.0 license and includes both a chat-tuned instruct variant and a base variant. The LLM's sliding 4k-token context window is an interesting new approach to attention during inference. Additionally, Baseten has introduced model observability features, including error codes, wake from sleep functionality, and a replica chart on the model metrics page. Furthermore, Baseten will host a series of in-person events exploring open source AI, starting with an event in New York and San Francisco.