Vespa Newsletter, April 2024
Blog post from Vespa
In the April 2024 Vespa Newsletter, significant updates and advancements are highlighted, including the introduction of the SPLADE Embedder, which enhances learned sparse retrieval by using term impact scores from large language models, and the support for half-precision floating-point (float16) ONNX models, improving inference performance on GPUs. New guides are available for using Cohere embedding models, which now support binary and int8 vectors, offering cost savings and performance benefits. The newsletter also discusses enhancements to the ColBERT embedder, allowing multi-paragraph inputs and highlights recent blog posts and case studies showcasing Vespa's application in semantic search and recommendation systems across various companies. Additionally, it announces upcoming meetups and conferences focused on improving the usefulness of large language models and retrieval augmented generation.