Building LLM-Powered Web Apps with Client-Side Technology

Company

Ollama

Date Published

Oct. 13, 2023

Author

Word count

1012

Language

Hacker News points

None

URL

ollama.com/blog/building-llm-powered-web-apps

Summary

Jacob Lee, a JS/TS maintainer at LangChainAI, explores building a web app using only local models and browser-compatible technologies in response to the growing interest in machine learning among JavaScript developers. The project focuses on creating a web app that performs Retrieval-Augmented Generation (RAG) to allow users to interact with unstructured data locally on their machines, emphasizing cost efficiency, privacy, and potential speed benefits. Using JavaScript-compatible tools like LangChain for document processing, a HuggingFace embeddings model for vector representation, and Voy for vector storage, Lee demonstrates how to set up a pipeline for querying data. However, the challenge of running large language models (LLMs) in-browser remains, leading to the integration of the Mistral 7B model via Ollama for local processing. The project highlights the rapid advancement of open-source models and the potential for future web apps to leverage local hardware, although a new browser API may be necessary for non-technical users to access locally running LLMs easily.