AI Glossary
93 AI terms explained in plain English. Built for tool users, not PhD researchers.
A
Agentic AI
Agents & AutomationAI systems designed to operate with autonomy — planning, reasoning, and executing multi-step tasks with minimal human guidance.
AGI (Artificial General Intelligence)
Core ConceptsA hypothetical AI system that can understand, learn, and perform any intellectual task that a human can — unlike today's AI, which excels at specific tasks.
AI (Artificial Intelligence)
Core ConceptsTechnology that enables machines to perform tasks that typically require human intelligence, such as understanding language, recognizing images, and making decisions.
AI Agent
Agents & AutomationAn AI system that can independently plan, make decisions, and take actions to accomplish goals — going beyond simple question-and-answer interactions.
Alignment
Safety & EthicsThe challenge of ensuring AI systems behave in ways that match human values and intentions, especially as they become more capable.
API (Application Programming Interface)
Technical InfrastructureA set of rules that lets software applications communicate with each other — in AI, it's how developers integrate AI capabilities into their own products.
Autonomous Agent
Agents & AutomationAn AI agent that operates independently over extended periods, making decisions and taking actions without requiring human approval at each step.
C
Chain-of-Thought (CoT)
LLM & Language ModelsA prompting technique where you ask the AI to show its reasoning step by step — significantly improving accuracy on complex problems.
Chatbot
Core ConceptsA software application that simulates conversation with users, ranging from simple rule-based systems to sophisticated AI-powered assistants like ChatGPT.
Checkpoint
Image & Video AIA saved snapshot of a trained AI model's weights at a specific point, commonly used in Stable Diffusion to produce different art styles.
Constitutional AI
Safety & EthicsAnthropic's approach to AI alignment where the model is trained to follow a set of explicit principles rather than relying solely on human feedback.
Context Window
LLM & Language ModelsThe maximum amount of text an AI model can process in a single conversation — measured in tokens. A larger context window means the AI can 'remember' more.
Conversational AI
Core ConceptsAI technology specifically designed for natural, human-like dialogue — the engine behind chatbots, voice assistants, and customer service automation.
D
Deep Learning
Core ConceptsA subset of machine learning using neural networks with many layers, enabling AI to learn complex patterns from large amounts of data.
Deepfake
Safety & EthicsAI-generated content that convincingly replaces a person's likeness or voice in media — raising serious concerns about misinformation and fraud.
Diffusion Model
Image & Video AIThe AI architecture behind most modern image generators — it works by learning to gradually remove noise from random static until a coherent image emerges.
E
Embedding
LLM & Language ModelsA numerical representation of text (or images or audio) as a list of numbers, allowing AI to understand meaning and find similarities.
Endpoint
Technical InfrastructureA specific URL or address where an AI API accepts requests — each capability (text generation, image creation, embeddings) typically has its own endpoint.
F
Few-Shot Learning
LLM & Language ModelsProviding a few examples in your prompt to show the AI the pattern you want — dramatically improving output quality for specific formats or styles.
Fine-tuning
LLM & Language ModelsThe process of further training a pre-trained AI model on specific data to customize it for a particular task, domain, or style.
Foundation Model
LLM & Language ModelsA large AI model trained on broad data that serves as the base for many applications — GPT-4, Claude, Gemini, and Llama are all foundation models.
G
GANs (Generative Adversarial Networks)
Core ConceptsAn AI architecture where two neural networks compete — one generates content, one judges it — pushing each other to improve. Largely superseded by diffusion models for image generation.
GPU (Graphics Processing Unit)
Technical InfrastructureA processor originally designed for rendering graphics that turned out to be ideal for AI training and inference due to its ability to perform many calculations simultaneously.
Grounding
LLM & Language ModelsTechniques that anchor AI responses in verifiable facts and sources — reducing hallucination by connecting model outputs to real data.
Guardrails
Safety & EthicsSafety mechanisms built into AI systems to prevent harmful, inappropriate, or off-topic outputs.
H
Hallucination
LLM & Language ModelsWhen an AI confidently generates information that is factually incorrect, fabricated, or nonsensical — presenting it as truth.
Hugging Face
Tools & ProductsThe largest open-source AI platform — hosting models, datasets, and tools. Think of it as GitHub for AI models.
Human-in-the-Loop (HITL)
Agents & AutomationA system design where humans review, approve, or correct AI decisions at critical points rather than letting the AI operate fully autonomously.
I
Inference
Technical InfrastructureThe process of running a trained AI model to generate outputs — when you type a prompt and get a response, that's inference.
Inpainting
Image & Video AIAn AI technique that fills in selected areas of an image — used to remove objects, fix imperfections, or add new elements to specific regions.
J
Jailbreak
Safety & EthicsTechniques used to bypass an AI model's safety guardrails and get it to produce outputs it was designed to refuse.
JSON (JavaScript Object Notation)
Technical InfrastructureA lightweight data format used extensively in AI APIs to structure requests and responses between applications.
L
Latency
Technical InfrastructureThe time delay between sending a request to an AI model and receiving the first response token — lower latency means faster, more responsive AI experiences.
LLM (Large Language Model)
LLM & Language ModelsA type of AI model trained on massive amounts of text data that can understand, generate, and reason about human language. GPT-4, Claude, Gemini, and Llama are all LLMs.
LoRA (Low-Rank Adaptation)
Image & Video AIA lightweight fine-tuning technique that trains a small adapter on top of a base model, commonly used to add specific styles or characters to AI image generators.
Low-Code
Tools & ProductsDevelopment platforms that minimize hand-coding through visual interfaces and pre-built components, while still allowing custom code when needed.
M
Machine Learning
Core ConceptsA branch of AI where systems learn patterns from data and improve through experience, rather than being explicitly programmed with rules.
MCP (Model Context Protocol)
Agents & AutomationAn open protocol by Anthropic that standardizes how AI models connect to external tools, data sources, and services — like a universal adapter for AI capabilities.
Mixture of Experts (MoE)
LLM & Language ModelsAn architecture where a model consists of multiple specialized sub-networks (experts), with a routing mechanism that activates only the relevant experts for each input.
Multi-Agent System
Agents & AutomationA system where multiple AI agents work together, each with specialized roles, collaborating to accomplish complex tasks.
Multimodal AI
Core ConceptsAI that can process and generate multiple types of content — text, images, audio, and video — rather than just one.
N
Narrow AI
Core ConceptsAI that is designed and trained for a specific task — as opposed to AGI, which would handle any intellectual task. All current AI is narrow AI.
Natural Language Processing (NLP)
Core ConceptsThe branch of AI focused on enabling computers to understand, interpret, and generate human language.
Negative Prompt
Image & Video AIInstructions telling an AI image generator what NOT to include in the output — used to avoid common artifacts and unwanted elements.
Neural Network
Core ConceptsA computing system inspired by the human brain, consisting of interconnected nodes (neurons) organized in layers that process information.
No-Code AI
Tools & ProductsAI tools and platforms that let non-programmers build AI-powered applications, automations, and workflows without writing code.
O
Ollama
Tools & ProductsA tool that makes running open-source AI models locally as easy as running a single command — no complex setup required.
Open Source (AI)
Tools & ProductsAI models whose code and/or weights are publicly available, allowing anyone to use, modify, and distribute them — in contrast to proprietary models like GPT-4.
Outpainting
Image & Video AIAn AI technique that extends an image beyond its original borders, generating new content that seamlessly continues the existing scene.
P
Parameters
LLM & Language ModelsThe learned numerical values inside a neural network that determine its behavior — more parameters generally means more capability. GPT-4 is estimated to have 1.8 trillion parameters.
Perplexity
Tools & ProductsAn AI-powered search engine that answers questions with cited sources — combining the conversational nature of ChatGPT with the citation rigor of traditional search.
Pre-training
LLM & Language ModelsThe initial training phase where an AI model learns general knowledge from massive datasets before being specialized for specific tasks.
Prompt
LLM & Language ModelsThe text input you give to an AI model — your question, instruction, or creative brief that tells the AI what to do.
Prompt Engineering
LLM & Language ModelsThe practice of crafting effective prompts to get better results from AI models — a skill that combines clear communication, technical understanding, and creative problem-solving.
Prompt Injection
Safety & EthicsAn attack where malicious text is embedded in user input to override the AI's system instructions or extract hidden prompts.
R
RAG (Retrieval-Augmented Generation)
LLM & Language ModelsA technique that improves AI accuracy by first retrieving relevant documents from a knowledge base, then using them as context when generating a response.
Rate Limit
Technical InfrastructureA restriction on how many AI API requests you can make within a time period — designed to manage server load and enforce usage tiers.
Reasoning Model
LLM & Language ModelsAn AI model specifically designed to 'think through' complex problems step by step before answering — trading speed for accuracy on difficult tasks.
Red Teaming
Safety & EthicsThe practice of deliberately testing AI systems for vulnerabilities, biases, and failure modes by simulating adversarial use.
Retrieval
LLM & Language ModelsThe process of finding and fetching relevant information from a knowledge base to provide context for AI generation — the 'R' in RAG.
RLHF (Reinforcement Learning from Human Feedback)
LLM & Language ModelsA training technique where humans rate AI outputs, and the model learns to produce responses that humans prefer — the key method for making AI helpful and safe.
S
Schema Markup
Technical InfrastructureStructured data added to web pages that helps search engines understand the content — enabling rich results like star ratings, FAQs, and product info in search listings.
SDK (Software Development Kit)
Technical InfrastructureA collection of tools, libraries, and documentation that makes it easier for developers to integrate AI services into their applications.
Self-hosting
Technical InfrastructureRunning AI models on your own hardware or private cloud instead of using a provider's API — giving you full control over data, costs, and customization.
Semantic Search
LLM & Language ModelsSearch that understands meaning and intent rather than just matching keywords — 'affordable places to eat nearby' finds budget restaurants even if they don't use the word 'affordable.'
Sentiment Analysis
Core ConceptsAI that detects the emotional tone of text — identifying whether content is positive, negative, neutral, angry, sarcastic, or other emotions.
Stable Diffusion
Image & Video AIThe leading open-source AI image generation model, created by Stability AI. Unlike Midjourney or DALL-E, it can be run locally on your own computer.
Structured Output
LLM & Language ModelsAI responses formatted in machine-readable structures (JSON, tables, lists) rather than free-form text — essential for building reliable AI applications.
Supervised Learning
Core ConceptsA machine learning approach where the model learns from labeled examples — given inputs paired with correct outputs, it learns to predict the right output for new inputs.
System Prompt
LLM & Language ModelsHidden instructions given to an AI model by the application developer that define the AI's behavior, personality, knowledge boundaries, and rules — invisible to the end user.
T
Temperature
LLM & Language ModelsA setting that controls how random or creative an AI's responses are. Low temperature = predictable and focused. High temperature = diverse and creative.
Text-to-Image
Image & Video AIAI technology that generates images from text descriptions — type what you want to see and the AI creates it.
Text-to-Speech (TTS)
Voice & AudioAI that converts written text into natural-sounding spoken audio — used for voiceovers, audiobooks, accessibility, and content creation.
Text-to-Video
Image & Video AIAI that generates video clips from text descriptions — one of the most rapidly advancing areas of AI in 2024-2025.
Token
LLM & Language ModelsThe basic unit of text that AI models process — roughly equivalent to a word or word fragment. AI pricing, context windows, and rate limits are all measured in tokens.
Tokenization
LLM & Language ModelsThe process of splitting text into tokens — the fundamental preprocessing step before any AI language model can process your input.
Tool Use (Function Calling)
Agents & AutomationThe ability of an AI model to invoke external tools and APIs during a conversation — searching the web, running code, querying databases, or calling any programmatic function.
Top-p (Nucleus Sampling)
LLM & Language ModelsA generation parameter that controls which tokens the model considers — only tokens within the top probability mass are eligible, filtering out unlikely choices.
Training
Core ConceptsThe process of feeding data to an AI model so it learns patterns and builds its capabilities — the foundation of all machine learning.
Training Data
Core ConceptsThe data used to train an AI model — its quality, quantity, and composition directly determine the model's capabilities and biases.
Transfer Learning
Core ConceptsUsing knowledge a model learned from one task or dataset to improve performance on a different but related task — the principle behind fine-tuning.
Transformer
LLM & Language ModelsThe neural network architecture behind virtually all modern AI language models — introduced in 2017 by Google and now the foundation of GPT, Claude, Gemini, and Llama.
V
Vector Database
Technical InfrastructureA specialized database optimized for storing and searching embedding vectors — the backbone of semantic search and RAG systems.
Voice AI
Voice & AudioAI technologies that process and generate human speech — including text-to-speech, speech-to-text, voice cloning, and real-time voice conversation.
Voice Cloning
Voice & AudioAI technology that creates a digital replica of a person's voice from a short audio sample, enabling text-to-speech in that voice.
VRAM (Video RAM)
Technical InfrastructureThe dedicated memory on a GPU used to store AI model data during processing. More VRAM = ability to run larger, more capable AI models locally.
W
Webhook
Technical InfrastructureAn automated notification sent from one application to another when a specific event occurs — used to trigger AI workflows in response to real-time events.
Whisper
Voice & AudioOpenAI's open-source speech recognition model that converts spoken audio to text with high accuracy across 99 languages.
Workflow Automation
Agents & AutomationTools that connect multiple apps and services to automatically perform sequences of tasks — enhanced by AI for intelligent decision-making and content generation.