Simplifying Langfuse for Scale
Blog post from Langfuse
Langfuse, an open-source LLM engineering platform, has transitioned to an observation-centric data model to enhance performance and simplify its product. This shift, live in Beta on Langfuse Cloud, involves moving from a two-entity model using Postgres and ClickHouse to a single, wide ClickHouse table, optimizing for efficient data access without the need for joins or deduplication. This change significantly speeds up data processing, reducing dashboard load times and improving query performance. Previously, high latencies and error rates were issues as the platform scaled, especially with large users experiencing limited data visibility. The new model leverages OpenTelemetry spans, which are immutable, allowing for streamlined data handling and reduced costs, particularly in storage and API calls. Langfuse's updates also include a new observations-first UI, eliminating the need for complex joins and focusing on real-time data propagation using OpenTelemetry's Context and Baggages. This restructuring aims to provide a more robust and efficient experience, with plans for further implementation and migration tools for self-hosters in upcoming releases.