• Companies
    All Pre-seed Seed Series A Series B Series C Series D Series E+ Public Acquired Private Equity Bootstrapped Random
    Metadata
    All Careers Pages LinkedIn X / Twitter GitHub Discord Slack YouTube Notes & news
  • Competitive Analysis
    All spaces List by company Random
  • Leaderboards
    All YouTube
    Hacker News Hits
    1000+ Upvotes 500+ Upvotes 250+ Upvotes 100+ Upvotes 50+ Upvotes
    Blog output SDKs
  • Developer Trends
  • About
    Creator Changelog Roadmap
Company Data Deep Dive

Baseten  Hacker News data

11 Hacker News submissions by month with at least 1
  • 1
  • 25
  • 50
  • 100
  • 250
  • 500
 points since the start of 2024
  • 2021
  • 2022
  • 2023
  • 2024
  • 2025

11 submissions with 1 points or greater

HN Points HN Title (Links to original post) Submitted Date
9 Show HN: Baseten Chains – Framework and SDK for Multi-Model AI Products 2024-06-27
2 Open Source Inference Engine Baseten Raises $40M from IVP, Spark and Greylock 2024-03-14
2 How to double tokens per second for Llama 3 with Medusa 2024-08-20
2 Show HN: Automatically Build Nvidia TRT-LLM Engines 2024-08-01
2 FP8: Efficient model inference with 8-bit floating point numbers 2024-03-08
1 How to build function calling and JSON mode for open-source and fine-tuned LLMs 2024-09-12
1 Show HN: 60% higher tokens per second for 70B custom LLMs 2024-07-31
1 Introduction to quantizing machine learning models 2024-02-16
1 Deploying custom ComfyUI workflows as APIs 2024-11-20
1 Continuous vs. dynamic batching for AI inference 2025-08-06
247 Running GPT-OSS-120B at 500 tokens per second on Nvidia GPUs 2025-08-07
  • By Matt Makai.
  • 2021-2025.