Vespa Newsletter, April 2024

Post Details

Company

Vespa

Date Published

April 15, 2024

Author

Kristian Aune

Word Count

605

Language

English

Hacker News Points

-

Source URL

blog.vespa.ai/vespa-newsletter-april-2024

Summary

In the April 2024 Vespa Newsletter, significant updates and advancements are highlighted, including the introduction of the SPLADE Embedder, which enhances learned sparse retrieval by using term impact scores from large language models, and the support for half-precision floating-point (float16) ONNX models, improving inference performance on GPUs. New guides are available for using Cohere embedding models, which now support binary and int8 vectors, offering cost savings and performance benefits. The newsletter also discusses enhancements to the ColBERT embedder, allowing multi-paragraph inputs and highlights recent blog posts and case studies showcasing Vespa's application in semantic search and recommendation systems across various companies. Additionally, it announces upcoming meetups and conferences focused on improving the usefulness of large language models and retrieval augmented generation.