SILMA TTS: A Lightweight Open Bilingual Text to Speech Model

Post Details

Company

Hugging Face

Date Published

March 15, 2026

Author

Karim Ouda

Word Count

524

Company Posts That Month

63

Language

-

Hacker News Points

-

Post removed?

No

Source URL

huggingface.co/blog/silma-ai/opensource-arabic-english-text-to-speech-model

Summary

SILMA AI has introduced SILMA TTS v1, a lightweight, 150M-parameter bilingual text-to-speech model that supports both Arabic and English, leveraging the F5-TTS diffusion architecture. The model, which is open-source under the Apache 2.0 License, was meticulously pre-trained using a vast dataset of audio to ensure high-fidelity speech synthesis, instant voice cloning, and ultra-low latency, making it suitable for real-time applications. By optimizing the original F5-TTS model and focusing on Arabic language support, SILMA AI aims to address the scarcity of high-quality Arabic audio data and overcome previous licensing constraints, providing a valuable resource for both research and commercial purposes. The development involved significant architectural optimizations, extensive pretraining on high-quality data, and targeted fine-tuning for Arabic, enhancing text handling and audio quality. Users can easily implement the model via simple installation commands, with further resources available on platforms like GitHub and Hugging Face.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
AI Model Fine-tuning	2	906	165	54	-16%
Real-time	1	6,457	1,307	242	+28%
Voice AI	1	2,447	202	43	+13%

Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.