Google Vertex AI and SingleStore are being used to build enterprise-grade private Large Language Models (LLMs) applications that are data-aware and custom to a company's needs. Companies can now use Google Vertex AI APIs to choose from available models, fine-tune them with company-specific requirements, and expose them as APIs. They can also use SingleStore database deployed on Google Cloud Platform to store SQL and JSON data, run analytics in split seconds, and do both lexical and semantic search to curate the most relevant and fresh data for LLMs. This enables companies to take full advantage of current advancements in generative AI by storing and querying vector data along with other types of data using SQL in real-time, using Notebooks to chain multiple LLMs, and having connectors and pipelines that enable fast ingest of data from different sources.