MamayLM, передова мовна модель для української мови
Blog post from HuggingFace
MamayLM is a powerful Ukrainian language model developed by researchers at INSAIT and ETH Zurich, designed to outperform models of similar size and even those significantly larger, such as Gemma2 27B and Llama 3.1 70B. With 9 billion parameters, MamayLM is resource-efficient, capable of operating on a single GPU, and excels in both Ukrainian and English language tasks. Built upon Google Gemma 2, it incorporates advanced data collection, model merging, and training techniques to enhance its linguistic capabilities, particularly in understanding and generating Ukrainian text. The model's proficiency offers substantial benefits for local businesses and government institutions, enabling the integration of cutting-edge AI technology without high costs or complex infrastructure. MamayLM's dual-language abilities make it valuable in fields like education and healthcare, where overcoming language barriers is crucial. It is available for use on the HuggingFace platform, with both standard and quantized versions published, offering a versatile tool for various applications.