Home / Companies / Vespa / Blog / Post Details
Content Deep Dive

Blog recommendation with neural network models

Blog post from Vespa

Post Details
Company
Date Published
Author
-
Word Count
2,053
Language
English
Hacker News Points
-
Summary

The blog post describes how to deploy neural network models in Vespa using the Tensor Framework, specifically focusing on a two-phase ranking system to optimize document recommendations. Initially, a 2-layer fully connected neural network model is trained with TensorFlow using user and document latent factors, and the training data comprises both positive and negative feedback to improve model accuracy. The post explains the deployment of this model in Vespa, where a simpler ranking expression is used in the first phase to filter documents, followed by a more complex neural network-based ranking in the second phase to rank the top 200 documents more accurately. This two-phase approach enhances the efficiency of CPU usage and improves the recommendation quality, leading to a 7% increase in relevant document retrieval and better ranking of documents in the recommendation list. The process involves exporting model parameters as tensors and defining ranking expressions in Vespa, ultimately illustrating the benefits of integrating neural network models into Vespa's ranking architecture.