Home / Companies / Vespa / Blog / Post Details
Content Deep Dive

Vespa Newsletter, April 2024

Blog post from Vespa

Post Details
Company
Date Published
Author
Kristian Aune
Word Count
605
Language
English
Hacker News Points
-
Summary

In the April 2024 Vespa Newsletter, significant updates and advancements are highlighted, including the introduction of the SPLADE Embedder, which enhances learned sparse retrieval by using term impact scores from large language models, and the support for half-precision floating-point (float16) ONNX models, improving inference performance on GPUs. New guides are available for using Cohere embedding models, which now support binary and int8 vectors, offering cost savings and performance benefits. The newsletter also discusses enhancements to the ColBERT embedder, allowing multi-paragraph inputs and highlights recent blog posts and case studies showcasing Vespa's application in semantic search and recommendation systems across various companies. Additionally, it announces upcoming meetups and conferences focused on improving the usefulness of large language models and retrieval augmented generation.