P

Trustworthy AI: Definition and Examples

Trustworthy AI refers to artificial intelligence designed to be reliable, ethical, transparent, and respectful of fundamental rights, ensuring that its decisions and behaviors merit the trust of users and society.

Full definition

The concept of Trustworthy AI rests on the idea that an artificial intelligence system must not only be technically performant but also trustworthy on ethical, social, and legal levels. It is a holistic approach that encompasses algorithm transparency, fairness of outcomes, technical robustness, and respect for privacy.

This framework was widely popularized by the European Commission's guidelines in 2019, which identify seven key requirements: human oversight, technical robustness and safety, privacy, transparency, diversity and non-discrimination, societal and environmental well-being, and accountability. These principles now form the foundation of the European AI Act and influence global regulations.

In prompt engineering, the notion of Trustworthy AI translates into designing prompts that encourage verifiable, unbiased, and transparent responses. This involves explicitly asking the AI to cite its sources, signal its uncertainties, avoid stereotypes, and respect the limits of its knowledge rather than fabricating information.

The stakes are fundamental: as AI integrates into critical domains such as healthcare, justice, and finance, the trust placed in these systems must rest on concrete and measurable guarantees, not merely technological promises. Trustworthy AI thus represents a paradigm shift from AI optimized solely for performance to AI optimized for reliability and accountability.

Etymology

The term combines 'trustworthy' (from Old English 'treowwyrðe') and 'AI' (Artificial Intelligence). It became established in institutional vocabulary around 2018-2019, notably with the publication of the 'Ethics Guidelines for Trustworthy AI' by the European Commission's High-Level Expert Group on AI.

Concrete examples

Source verification and transparency

Analyze this medical report and provide your conclusions. For each assertion, indicate your confidence level (high, medium, low) and explicitly signal if you do not have sufficient data to conclude.

Bias reduction in analysis

Evaluate these 5 resumes for the senior developer position. Focus solely on technical skills and relevant experience. Ignore any information related to gender, ethnic origin, or age. Explain your evaluation criteria before starting.

Audit of an existing AI system

Act as a specialized auditor in responsible AI. Analyze this credit scoring model according to the 7 criteria of the European Commission's Trustworthy AI. For each criterion, assign a compliance score and identify risks.

Practical usage

In prompt engineering, applying Trustworthy AI involves formulating instructions that enforce transparency: asking the AI to distinguish verified facts from assumptions, to explain its reasoning step by step, and to signal its limitations. Systematically integrate safeguards into your prompts, such as 'if you are not sure, say so' or 'cite your sources.' This discipline not only improves the reliability of responses but also makes it easier to detect hallucinations and biases.

Related concepts

AI EthicsExplainability (XAI)AI AlignmentResponsible AIAI GovernanceFairness in AI

FAQ

What is the difference between Trustworthy AI and Responsible AI?
The two concepts are similar but have different angles. Responsible AI focuses on the accountability of creators and operators of AI systems (who is responsible if something goes wrong?). Trustworthy AI encompasses a broader vision centered on the end user: does the system deserve the trust placed in it? It integrates technical criteria (robustness, security) in addition to ethical aspects.
How is Trustworthy AI linked to the European AI Act?
The AI Act, which came into force in 2024, directly builds on the principles of Trustworthy AI defined by the European expert group. It translates these principles into concrete legal obligations, classifying AI systems by risk level (unacceptable, high, limited, minimal) and imposing transparency, documentation, and human oversight requirements proportionate to the risk.
Can we concretely measure whether an AI is 'trustworthy'?
Yes, several evaluation frameworks exist. The NIST AI Risk Management Framework provides quantifiable metrics. Robustness can be assessed through adversarial testing, fairness through bias audits across different demographic groups, transparency through the degree of explainability of decisions, and reliability through error and hallucination rates. The key is not to rely solely on declarations of intent but to implement verifiable and auditable measures.

See also

How to use this prompt

  1. Copy the prompt with the button above.
  2. Paste it into ChatGPT, Claude or your favorite AI assistant.
  3. Replace the bracketed variables with your details, then refine the result.

About Prompt Guide

Prompt Guide is a free library of 2500+ ready-to-use prompts for ChatGPT, Claude and other AIs, with guides to learn prompting and tools to build and optimize your own prompts.

More definitions

Get new prompts every week

Join our newsletter.