Fireworks AI Hacker News

Filters

Since:

Posts by Month (25 total)

Hacker News Posts

Search:

Title	Points	Comments	Date
Fireworks: Function Calling Model and API	53	--	2023-12-21
FireAttention V3: Enabling AMD as a Viable Alternative for GPU Inference	20	--	2024-10-17
Fireworks F1: A Breakthrough in Complex Reasoning with Compound AI	17	--	2024-11-18
FireFunction V1 – GPT-4-level function calling model – 4x faster, open weights	7	--	2024-02-22
How are people training this LLMs? Dont they need lot of money?	4	--	2024-01-19
LLM Eval Driven Development with Claude Code	4	--	2025-08-28
How we fixed prompt injection for all models on Fireworks	4	--	2026-04-23
Serving Open Source Models 4x faster than vLLM by quantizing with ~no …	3	--	2024-01-10
FireAttention – Serving Mixtral and open-source MoE models at 4x speed vs. …	3	--	2024-01-09
Multi-Query Attention Is All You Need	3	--	2023-07-13
DeepSeek V4 Pro: Validating Frontier Models for Production	3	--	2026-04-28
Accelerating Code Completion with Fireworks Fast LLM Inference	2	--	2023-10-11
How Fireworks evaluates quantization precisely and interpretably	2	--	2024-08-03
The Benchmark Gap: What It Takes to Ship Kimi K2.5	2	--	2026-02-16
Deep-Dive into LLM Fine-Tuning	2	--	2026-02-23
Turn Your LLM into a Calibrated Classifier for $2	2	--	2026-02-20
Why Building Mega Clusters Is Wrong	2	--	2026-03-21
Fireworks.ai: Language Model Serving with Custom LoRA Fine-Tuned Models	1	--	2023-08-18
Can DeepSeek R1 Teach Better Than Humans?	1	--	2025-02-05
Document Inlining: Crossing the Modality Gap with Compound AI	1	--	2024-12-23
GPUs on-demand: Not serverless, not reserved, but some third thing	1	--	2024-06-07
Natural Language → SQL with Reinforcement Fine Tuning (RFT)	1	--	2025-08-18
Turning Production Logs into Evaluation Datasets: A Data-Driven Approach	1	--	2026-02-16
DPO, your simplest RL pipeline with two rollouts	1	--	2026-02-18
Frontier RL Is Cheaper Than You Think	1	--	2026-03-26

Fireworks AI on HN