Top Large Language Models (LLMs): GPT-4, LLaMA 2, Mistral 7B, ChatGPT, and More

Company

Vectara

Date Published

Oct. 17, 2023

Author

Suleman Kazi & Adel Elmahdy

Word count

2078

Language

English

Hacker News points

None

URL

vectara.com/blog/top-large-language-models-llms-gpt-4-llama-gato-bloom-and-when-to-choose-one-over-the-other

Summary

GPT-4 is OpenAI's latest model announced in March 2023, which shows impressive performance on various tasks including professional medical and law exams, expanding the maximum input length to 32,768 tokens, but its architecture and training datasets remain unclear. GPT-4 wins our pick for a fully hosted, API-based LLM due to its strong track record of OpenAI, although a subscription to ChatGPT Plus is required for access. ChatGPT is another text-only model released by Open AI in November 2022, designed to engage in natural language conversations, with basic access available without a subscription and suitable for personal projects or experimentation. LLaMA 2, released in July 2023, is Meta AI's next-generation open-source language understanding model, which comes in various sizes and variants for code understanding and completion, fine-tuning for commercial and research purposes, and has double the context length of its predecessor. The FALCON series of models developed by the UAE's Technology Innovation Institute shows impressive performance on pre-trained Open Large Language Models and is available for both research and commercial use. Mistral 7B, announced in September 2023, outperforms Llama2 on many benchmarks with a relatively small size that doesn't require monstrous GPUs to host, making it our pick for the best overall self-hosted model for commercial and research purposes. GPT-3 is OpenAI's pre-trained model fine-tuned on a particular task, exhibiting impressive few-shot and zero-shot performance on NLP tasks. BLOOM released in November 2022, is a multilingual LLM that generates text in 46 natural languages and 13 programming languages, with the aim of developing a more transparent and interpretable model. LaMDA announced in May 2021, is a model designed to have more natural and engaging conversations with users, built on an earlier Google Chatbot called Meena. MT-NLG uses the architecture of the transformer-based Megatron to generate coherent and contextually relevant text for various tasks, available via API. LLaMA announced February 2023 by Meta AI, is a model available in multiple parameter sizes from 7 billion to 65 billion parameters, with access only available to researchers, government affiliates, those in academia, and after submitting an application to Meta. Stanford Alpaca was announced in March 2023, fine-tuned from Meta's LLaMA 7B model and trained on 52k instruction-following demonstrations, aiming to help the academic community engage with models by providing an open-source model rivaling OpenAI's GPT-3.5 models. FLAN UL2 is an encoder decoder model souped-up version of the T5 model trained using Flan, exceeding prior versions' performance and available for self-hosting or fine-tuning. GATO announced May 2022, deepmind's multimodal model capable of working on not just text but other modalities and performing multiple tasks such as image captioning and controlling a robotic arm, although its release remains unclear. PaLM, announced April 2022, is based on Google's Pathways AI architecture aiming to build models that can handle many different tasks and learn new ones quickly, achieving state-of-the-art performance on many language-related tasks. Claude is described as a "next generation AI assistant" by Anthropic, available in two modes: Claude and Claude Instant, with limited details about its training process or model architecture. ChatGLM announced March 2023 by Tsinghua University's Knowledge Engineering Group, is a bilingual Chinese-English language model available for download at HuggingFace, optimized for the Chinese language and with an Apache-2.0 license allowing commercial use.