Large Language Model: Definition and Examples
A Large Language Model (LLM) is an artificial intelligence model trained on massive volumes of text, capable of understanding and generating natural language with near-human fluency.
Full definition
A Large Language Model is a type of artificial neural network designed to process, understand, and produce natural language text. These models are called 'large' because of their enormous number of parameters—often tens or even hundreds of billions—and the massive amount of textual data on which they are trained.
The functioning of an LLM relies on an architecture called Transformer, introduced in 2017 by Google researchers. This architecture allows the model to analyze relationships between all words in a text simultaneously, thanks to an attention mechanism. During training, the model learns to predict the next word in a sequence, enabling it to acquire a deep statistical understanding of language, grammar, facts, and even some forms of reasoning.
Among the most well-known LLMs are OpenAI's GPT-4, Anthropic's Claude, Google's Gemini, and Meta's LLaMA. These models can perform a wide variety of tasks: writing, translation, summarization, code analysis, question answering, and much more, often without being explicitly programmed for each task.
The emergence of LLMs has profoundly transformed the field of prompt engineering. Unlike traditional software where you write code, interacting with an LLM involves formulating instructions in natural language—the prompts. The quality of the response directly depends on the clarity, precision, and structure of the prompt, making prompt engineering an essential skill to fully exploit the potential of these models.
Etymology
The term 'Large Language Model' appeared in the artificial intelligence scientific literature in the early 2020s. 'Large' refers to the size of the model (number of parameters), 'Language' indicates its specialization in natural language processing, and 'Model' denotes the underlying mathematical model. The acronym LLM quickly became common usage from 2022 onwards, with the democratization of ChatGPT.
Concrete examples
Asking an LLM to synthesize a complex document
Summarize this 20-page report into 5 key points, using accessible language for a non-technical audience.
Using an LLM to generate code from a description
Write a Python function that takes a list of prices as input and returns the average price, minimum price, and maximum price as a dictionary.
Leveraging an LLM's multilingual capabilities
Translate this marketing text from French to English and Spanish, adapting the tone for each target market.
Practical usage
Understanding what an LLM is allows you to tailor your prompts to its capabilities and limitations. For example, knowing that an LLM predicts the next word statistically explains why it may sometimes 'hallucinate' plausible but false information. In practice, structure your prompts with clear instructions, provide relevant context, and ask the model to reason step by step to get more reliable results.
Related concepts
FAQ
What is the difference between an LLM and classical AI?
Do LLMs really understand language?
Why is the size of an LLM important?
See also
How to use this prompt
- Copy the prompt with the button above.
- Paste it into ChatGPT, Claude or your favorite AI assistant.
- Replace the bracketed variables with your details, then refine the result.
About Prompt Guide
Prompt Guide is a free library of 2500+ ready-to-use prompts for ChatGPT, Claude and other AIs, with guides to learn prompting and tools to build and optimize your own prompts.
More definitions
Latent Space: Definition and Examples
Latent space is a compressed mathematical representation where an AI model encodes the essential features of data as numerical vectors, capturing semantic relationships between concepts.
Long Context Model: Definition and Examples
A Long Context Model is a language model capable of processing and reasoning over very large amounts of text in a single interaction, with a window...
LoRA: Definition and Examples
LoRA (Low-Rank Adaptation) is an efficient fine-tuning technique that allows adapting a large language model or image generation model to a specific task.
MCP Model Context Protocol: Definition and Examples
The Model Context Protocol (MCP) is an open standard that allows AI models to connect to external data sources, tools, and services.
Million Token Context: Definition and Examples
Capacity of a language model to process up to a million tokens in a single request, enabling analysis of very large documents, codebases
Model Card: Definition and Examples
A model card is a standardized document that accompanies an AI model to describe its performance, limitations, potential biases, and conditions of use
Get new prompts every week
Join our newsletter.