Embedding English Wikipedia in under 15 minutes

Company

Modal

Date Published

Jan. 23, 2024

Author

Jason Liu

Word count

2433

Language

English

Hacker News points

URL

modal.com/blog/embedding-wikipedia

Summary

The text discusses the use of open-source embedding models and Modal, a serverless platform, to create production-ready applications using large language models. The authors highlight the advantages of using open-source models, such as fine-tuning with user data, and demonstrate how to run large-scale batch jobs at scale using Modal's abstractions. They provide an example of running the entire English Wikipedia in just 15 minutes using Hugging Face's Text Embedding Inference service on Modal, showcasing the platform's ability to speed up feedback loops and enable experimentation and deployment. The authors also discuss further customizations, such as deploying on a schedule and uploading datasets to public or private repositories.