Company
Date Published
Author
Kyle Corbitt
Word count
300
Language
English
Hacker News points
None

Summary

Mistral, a new model developed by Meta AI, has been found to outperform similar small models on benchmarks and offers an 8K context window, making it suitable for fine-tuning task-specific LLMs under 34B parameters. The performance improvements of Mistral over Llama 2 are significant and generalize across many task types, with users consistently preferring Mistral's output or considering it comparable to Llama 2 13B in non-deterministic tasks. While there are areas yet to be evaluated, such as multilingual tasks, overall, Mistral is an extremely strong model that can save customers significant money by decreasing inference costs while improving accuracy and reducing latency.