Home / Companies / Fireworks AI / Hacker News

Fireworks AI on HN

23 posts with 1+ points since 2022

Filters
Since:
Posts by Month (23 total)
Hacker News Posts
Title Points Comments Date
Fireworks: Function Calling Model and API 53 -- 2023-12-21
FireAttention V3: Enabling AMD as a Viable Alternative for GPU Inference 20 -- 2024-10-17
Fireworks F1: A Breakthrough in Complex Reasoning with Compound AI 17 -- 2024-11-18
FireFunction V1 – GPT-4-level function calling model – 4x faster, open weights 7 -- 2024-02-22
How are people training this LLMs? Dont they need lot of money? 4 -- 2024-01-19
LLM Eval Driven Development with Claude Code 4 -- 2025-08-28
Serving Open Source Models 4x faster than vLLM by quantizing with ~no … 3 -- 2024-01-10
FireAttention – Serving Mixtral and open-source MoE models at 4x speed vs. … 3 -- 2024-01-09
Multi-Query Attention Is All You Need 3 -- 2023-07-13
Accelerating Code Completion with Fireworks Fast LLM Inference 2 -- 2023-10-11
How Fireworks evaluates quantization precisely and interpretably 2 -- 2024-08-03
The Benchmark Gap: What It Takes to Ship Kimi K2.5 2 -- 2026-02-16
Deep-Dive into LLM Fine-Tuning 2 -- 2026-02-23
Turn Your LLM into a Calibrated Classifier for $2 2 -- 2026-02-20
Why Building Mega Clusters Is Wrong 2 -- 2026-03-21
Fireworks.ai: Language Model Serving with Custom LoRA Fine-Tuned Models 1 -- 2023-08-18
Can DeepSeek R1 Teach Better Than Humans? 1 -- 2025-02-05
Document Inlining: Crossing the Modality Gap with Compound AI 1 -- 2024-12-23
GPUs on-demand: Not serverless, not reserved, but some third thing 1 -- 2024-06-07
Natural Language → SQL with Reinforcement Fine Tuning (RFT) 1 -- 2025-08-18
Turning Production Logs into Evaluation Datasets: A Data-Driven Approach 1 -- 2026-02-16
DPO, your simplest RL pipeline with two rollouts 1 -- 2026-02-18
Frontier RL Is Cheaper Than You Think 1 -- 2026-03-26