Partnering with Meta to bring Llama 3 to Firework’s inference and fine-tuning

Post Details

Company

Fireworks AI

Date Published

Oct. 6, 2025

Author

-

Word Count

800

Language

English

Hacker News Points

-

Source URL

fireworks.ai/blog/announcing-llama3-on-fireworks

Summary

Fireworks has announced the integration of Meta's open-source Llama 3 models, which are available with 8 billion (8B) and 70 billion (70B) parameters, into their platform, highlighting improved speeds for inference and fine-tuning capabilities. These models, which excel in language processing benchmarks, are part of a broader initiative by Meta to promote open and responsible AI development, with plans to release even larger models in the future. Fireworks offers businesses access to these models through a serverless inference stack, enabling cost-effective deployment of numerous LoRA adapters. Companies like Quora and Sourcegraph are already utilizing Fireworks for their AI-driven applications. By providing a seamless setup process through their platform, Fireworks aims to make state-of-the-art AI accessible to developers and enterprises, supporting both dedicated and shared infrastructure for enhanced performance.