Checkmark Plagiarism Logo
Checkmark Plagiarism
Menu
Back to Learning
AI BasicsHow It Works~9 min read

The Plain-English AI Glossary for Teachers and Parents

A jargon-free glossary of the AI terms showing up in classrooms, with simple definitions and real usage examples for teachers, administrators, and parents.

The Checkmark Plagiarism Team
The Plain-English AI Glossary for Teachers and Parents

Every few years a new vocabulary sneaks into the staff room. A decade ago it was "flipped classroom" and "growth mindset." Now it is "prompt," "hallucination," and "large language model." The difference this time is that the words are not just pedagogical fashion. They describe tools your students are already using, sometimes in ways that land on your desk as a discipline question or an academic integrity case.

You do not need a computer science degree to keep up. You need a working vocabulary, the kind that lets you read a news article, understand a vendor pitch, or talk to a student without nodding along to words you cannot define. This glossary is built for that. Each term gets a plain-English definition and a short example of how it actually shows up in a school setting. Skim it, bookmark it, share it with a colleague who keeps asking what these words mean.

The foundation: what the technology actually is

Start here, because most of the confusion downstream comes from fuzzy definitions up top.

Artificial intelligence (AI) is the broad umbrella term for software that performs tasks we used to think required human judgment, like recognizing a face, recommending a video, or writing a paragraph. It is a marketing word as much as a technical one. When a product says it is "powered by AI," that could mean almost anything.

Example: A reading app that adjusts difficulty based on a student's answers is using AI, and so is the chatbot that writes their essay. The label alone tells you very little.

Machine learning is the most common kind of AI today. Instead of a programmer writing out every rule by hand, the system learns patterns from large amounts of example data. Show it enough labeled photos of cats and it learns to spot cats it has never seen.

Example: A plagiarism checker that learns to flag suspicious writing was "trained" on millions of documents rather than programmed with a fixed list of rules.

Large language model (LLM) is the engine behind tools like ChatGPT, Claude, and Gemini. It is a machine learning system trained on an enormous amount of text, and its core skill is predicting what word should come next. String enough of those predictions together and you get fluent sentences, essays, and code.

Example: When a student types "write me a 500-word essay on the causes of World War One," an LLM produces it one likely word at a time. It is not looking up a saved essay. It is generating a new one on the spot.

Generative AI is the family of tools that create new content, text, images, audio, or video, rather than just analyzing existing content. LLMs are the text branch of this family.

Example: The same student could use generative AI to write the essay, then use a different generative tool to make an illustration for the cover page.

The words you will hear when students use these tools

This is the vocabulary of daily use, the terms a student would recognize even if they could not define them.

Prompt is the instruction you give an AI tool. The quality of the prompt heavily shapes the quality of the output, which is why "prompting" has quietly become a skill of its own.

Example: "Summarize this article" is a weak prompt. "Summarize this article in three bullet points a sixth grader could understand" is a strong one. Teaching students the difference is becoming part of digital literacy.

Token is the unit an LLM reads and writes in. A token is roughly three quarters of a word. You will mostly run into this term when a tool talks about length limits or pricing.

Example: A tool that allows "8,000 tokens" can handle roughly 6,000 words of input and output combined, which matters if a student tries to paste an entire textbook chapter.

Context window is how much text a model can hold in its working memory at once, measured in tokens. Go past it and the model starts to forget the beginning of the conversation.

Example: In a long back-and-forth, a chatbot may "forget" an instruction you gave twenty messages ago. That is the context window filling up, not the tool being defiant.

Hallucination is the polite industry term for when an AI confidently states something false. Because the model is predicting plausible text rather than retrieving verified facts, it can invent quotes, sources, statistics, and even court cases that never existed.

Example: A student cites a study in their bibliography that does not exist anywhere. They did not necessarily lie. The chatbot hallucinated the citation, and the student trusted it. This is one of the most important terms for any educator to understand.

Fine-tuning is the process of taking a general-purpose model and training it further on a narrower set of examples so it specializes. It is the difference between a general practitioner and a specialist.

Example: A company might fine-tune a model on thousands of graded essays so it can give feedback in a particular rubric's style.

The detection vocabulary

This is the part of the glossary most relevant to academic integrity, and the part most often misunderstood.

AI detector is a tool that estimates whether a piece of writing was produced by a human or a generative AI. It does not "catch" AI the way a metal detector catches metal. It produces a probability based on statistical signals in the text.

Example: A detector might report that a passage is "85 percent likely AI-generated." That is an estimate, not a confession, and it should open a conversation rather than close a case.

Perplexity measures how predictable a piece of text is to a language model. AI-generated writing tends to be smoother and more predictable, so it often scores low. Human writing tends to be more surprising, so it often scores higher. Detectors lean on this signal.

Example: A student who writes in short, formulaic, predictable sentences may trip a detector even when the work is entirely their own, because their natural style happens to be low in perplexity.

Burstiness measures variation in sentence structure and length. Humans tend to mix long sentences with short ones. Some AI text is more uniform. Detectors use this alongside perplexity.

Example: An essay where every sentence is roughly the same length and rhythm may raise a flag, which is one reason detector results should never stand alone as proof.

False positive is when a detector flags human work as AI-generated. False negative is the opposite, when AI work slips through as human. Both happen, which is why responsible policy treats detector scores as evidence to investigate, not verdicts to act on.

Example: A multilingual student writing in careful, textbook-style English is at higher risk of a false positive, because that style can look statistically similar to AI text.

Watermarking is an emerging technique where AI companies embed a hidden statistical signal in generated text so it can be identified later. It is not yet widespread or reliable, but you will hear it pitched as a future solution.

Example: A vendor may claim their tool can detect "watermarked" AI text. Ask which models actually watermark, because most do not yet.

The terms that get misused

A few words get thrown around loosely, and knowing what they really mean protects you from both hype and panic.

Plagiarism versus AI-generated writing are not the same thing, even though they often share a policy page. Plagiarism is copying someone else's work and passing it off as your own. AI-generated writing is text a student did not author themselves, but it is original in the sense that it was not copied from an existing source. A traditional plagiarism checker may pass an AI essay cleanly, because there is no original to match it against.

Example: A student submits an AI-written essay. A plagiarism checker finds zero matches and reports it as original. The work is still not the student's own thinking. This is exactly why the two kinds of detection exist side by side.

Algorithm simply means a set of step-by-step instructions a computer follows. It is not sinister and it is not magic. Every recipe is an algorithm.

Training data is the collection of text, images, or other examples a model learned from. Concerns about bias, privacy, and copyright almost always trace back to what was in the training data.

Example: If a model was trained mostly on formal, published English, it may handle a student's casual dialect poorly. The behavior reflects the diet it was fed.

A short word on staying current

This vocabulary will keep changing, and that is fine. You do not have to chase every new term. The goal is not fluency in computer science. It is enough literacy to ask good questions, read a vendor claim with a healthy eyebrow raised, and talk to a student about their work without either panicking or pretending nothing has changed.

Print this page, hand it to the colleague who keeps asking what a "hallucination" is, and you have done more for your school's AI literacy than most professional development days manage. The jargon is only intimidating until someone defines it plainly. Now someone has.

The Plain-English AI Glossary for Teachers and Parents