Introducing Prem-1B

Post Details

Company

Prem AI

Date Published

Sept. 21, 2024

Author

PremAI

Word Count

2,957

Company Posts That Month

3

Language

English

Hacker News Points

-

Post removed?

No

Source URL

www.premai.io/blog/introducing-prem-1b

Summary

Prem AI has introduced the Prem-1B series, an open-source large language model designed to democratize access to advanced language model capabilities traditionally restricted to closed-model APIs. The model, available on HuggingFace under an Apache License 2.0, is optimized for Retrieval-Augmented Generation (RAG) and features an extended context length of 8192 tokens to efficiently handle multi-turn conversations. The infrastructure for model training employs 16 H100 GPUs, interconnected through Ray to enable multi-GPU training, and the architecture is based on a transformer decoder-only model similar to Llama 2. The pre-training process utilized SlimPajama and Llama's tokenizer to efficiently handle a data corpus of 600 billion tokens, while chat fine-tuning adapted the model for conversational use. Additionally, Direct Preference Optimization (DPO) was employed to align the model's responses with human preferences, resulting in competitive performance across various benchmarks. Future plans involve enhancing the model's performance and exploring model alignment techniques, with a focus on expanding the quality of data used in training and fine-tuning processes.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
LLM	15	3,889	441	129	+7%
RAG	14	1,936	254	78	-19%
AI Model Fine-tuning	13	628	146	67	-32%
Reinforcement learning	10	No monthly metrics for this publish month.

Use This Data

Use this post, company, and trend context to find content marketing opportunities, perform competitive analysis, or address product feature gaps via the Plushcap MCP server or the Plushcap API.