Adapter Tuning: Definition and Examples
Efficient fine-tuning technique that involves inserting small trainable modules (called adapters) into a pre-trained language model, without modifying its original weights.
Full definition
Adapter Tuning is a method for adapting language models that relies on inserting small additional neural networks — adapters — between the existing layers of a pre-trained model. During training, only the parameters of these adapters are updated, while the original model's weights remain frozen. This allows adapting a model to a specific task by training only a tiny fraction of its total parameters.
This approach was popularized by the work of Houlsby et al. (2019) at Google Research. The central idea is that a massive model like GPT or BERT already contains a rich representation of language, and that adding a lightweight adaptation layer is sufficient to specialize it. Each adapter typically consists of a projection to a reduced dimension (bottleneck), a non-linear activation function, and then a projection back to the original dimension.
The main advantage of Adapter Tuning is its resource efficiency. Whereas traditional fine-tuning requires storing a complete copy of the model for each task, Adapter Tuning only requires storing the small added modules, typically less than 5% of the model's parameters. This makes it possible to deploy a single base model with multiple interchangeable adapter sets depending on the task.
Adapter Tuning is part of a broader family of techniques called PEFT (Parameter-Efficient Fine-Tuning), which also includes LoRA, Prefix Tuning, and Prompt Tuning. These methods all address the same challenge: how to customize increasingly large models without exploding computational and storage costs.
Etymology
The term "adapter" comes from English and refers to a device that makes a system compatible with a use different from the one for which it was designed. In machine learning, the concept was borrowed to describe these lightweight modules that "adapt" a general-purpose model to a specific task. The term "tuning" refers to the process of adjusting parameters.
Concrete examples
Adapt a general-purpose language model for sentiment classification in customer reviews, without modifying the base model.
You are a model specialized in sentiment analysis thanks to an adapter trained on French customer reviews. Classify the sentiment of this text as positive, negative, or neutral: "The product is okay but the delivery was catastrophic."
Use multiple adapters on the same base model to handle different languages or business domains.
Activate the 'legal-fr' adapter to analyze this contract. Identify potentially abusive clauses and explain them in plain language.
Explain to a technical team why choose Adapter Tuning over full fine-tuning for a project with limited resources.
Practical usage
In prompt engineering, understanding Adapter Tuning helps better choose your model adaptation strategy. If you need to specialize an LLM for a specific domain (medical, legal, technical) with a limited budget, Adapter Tuning is often the best compromise between performance and cost. Practically, libraries like Hugging Face PEFT allow implementing this technique in a few lines of code on open source models.
Related concepts
FAQ
What is the difference between Adapter Tuning and LoRA?
How many parameters are trained with Adapter Tuning?
Can Adapter Tuning be used with models like GPT-4 or Claude?
See also
How to use this prompt
- Copy the prompt with the button above.
- Paste it into ChatGPT, Claude or your favorite AI assistant.
- Replace the bracketed variables with your details, then refine the result.
About Prompt Guide
Prompt Guide is a free library of 2500+ ready-to-use prompts for ChatGPT, Claude and other AIs, with guides to learn prompting and tools to build and optimize your own prompts.
More definitions
Agent: Definition and Examples
An agent is an AI system capable of acting autonomously to accomplish complex tasks, planning its actions, using tools, and…
Agentic Workflow: Definition and Examples
An agentic workflow is a workflow in which one or more AI agents autonomously make decisions, chain actions, and adapt
AI A/B Testing: Definition and Examples
AI A/B Testing refers to the use of artificial intelligence to design, execute, and analyze A/B tests in an automated way, enabling
AI Accountability: Definition and Examples
AI Accountability refers to the set of principles and mechanisms ensuring that artificial intelligence systems, as well as their designers and users, are held responsible for their decisions, impacts, and outcomes.
AI Alignment: Definition and Examples
AI Alignment refers to the set of research and techniques aimed at ensuring that artificial intelligence systems act in accordance with human intentions, values, and interests.
AI Audit: Definition and Examples
An AI Audit is a systematic evaluation process of an artificial intelligence system aiming to verify its compliance, reliability, fairness, and transparency.
Get new prompts every week
Join our newsletter.