Blog recommendation with neural network models
Blog post from Vespa
The blog post describes how to deploy neural network models in Vespa using the Tensor Framework, specifically focusing on a two-phase ranking system to optimize document recommendations. Initially, a 2-layer fully connected neural network model is trained with TensorFlow using user and document latent factors, and the training data comprises both positive and negative feedback to improve model accuracy. The post explains the deployment of this model in Vespa, where a simpler ranking expression is used in the first phase to filter documents, followed by a more complex neural network-based ranking in the second phase to rank the top 200 documents more accurately. This two-phase approach enhances the efficiency of CPU usage and improves the recommendation quality, leading to a 7% increase in relevant document retrieval and better ranking of documents in the recommendation list. The process involves exporting model parameters as tensors and defining ranking expressions in Vespa, ultimately illustrating the benefits of integrating neural network models into Vespa's ranking architecture.