AI Cheating Detection in Schools: How It Works, Its Limits, and the Cat-and-Mouse Game

AI detection tools are being used in HK schools to catch AI-written work. A computing teacher explains how they work, why they're unreliable, and what this means.

Mr. NgSTEM & AI Literacy

27 May 20245 min read

#academic-integrity#AI#cheating#assessment#schools

I want to be honest with parents about something that is generating a lot of anxiety in schools right now: AI detection tools are not reliable enough to confidently accuse a student of having used AI to produce their work. Schools that treat these tools as definitive evidence are making a significant mistake, and families should know their rights.

This article explains how AI detection works, why it's limited, and what actually matters for academic integrity in the AI era.

How AI detection tools claim to work

Several commercial tools — Turnitin's AI detection module, GPTZero, Originality.ai, and others — analyse submitted text for patterns associated with AI-generated writing. The theoretical basis is that AI language models produce text with certain statistical regularities: lower "perplexity" (a measure of how predictable each word choice is), more uniform sentence structure, characteristic vocabulary patterns.

Human writing, in theory, is more varied, more idiosyncratic, more likely to include the quirks and imprecisions of a specific person's voice.

In theory, this allows a classifier to estimate the probability that a piece of text was AI-generated.

Why these tools are less reliable than they appear

The published accuracy rates of AI detection tools sound reasonable until you examine the conditions under which they were measured. The problem is that the tools were evaluated on obvious AI-generated text — output directly from ChatGPT without modification. Real student use is far messier.

Students modify AI outputs. Any student who types a ChatGPT response into their work verbatim is taking a risk that any competent teacher's reading instincts would catch anyway. Sophisticated users edit, rephrase, mix their own sentences with AI suggestions, and add personal details. These modified texts are much harder to classify.

The tools disproportionately flag certain groups. Several independent studies have found AI detection tools produce significantly higher false positive rates for non-native English writers. The formal, careful writing patterns of a student writing in their second or third language — avoiding the idioms and contractions that mark fluent native speech — score as more "AI-like." In Hong Kong, where students are writing in English as a second language, this is a serious issue.

AI companies are updating their models. As detection tools improve, AI writing tools adapt. The text produced by the most recent versions of AI systems is increasingly difficult to distinguish from human writing in automated ways.

What the tools can't measure. AI detection tools analyse linguistic patterns. They cannot assess whether a student understands what they submitted. That's a different question, and it's ultimately the more important one.

What this means for accused students

If your child's school claims their work was written by AI based solely on a detection tool, you have a right to ask pointed questions.

What tool was used? What percentage or confidence level was reported? What is the tool's documented false positive rate? Has the student been given an opportunity to demonstrate understanding of the submitted work by discussing it in person?

The last question is the critical one. If a student wrote something — even with significant AI assistance — they should be able to discuss it. Ask them to explain their argument. Ask where specific evidence came from. Ask why they chose a particular phrase. A student who can't engage meaningfully with their own submission has a genuine problem. A student who can is in a different position.

No teacher should take disciplinary action against a student based on an AI detection score without a follow-up conversation. Full stop.

The cat-and-mouse game schools are playing

Every additional layer of AI detection prompts students to find workarounds. This is the nature of enforcement-based approaches to technological change. The workarounds are easily found online and most students who are determined to circumvent detection can do so.

The schools I've seen handle this best are not the ones with the most sophisticated detection technology. They're the ones that have redesigned assessments to make AI assistance less useful — not by banning technology but by asking questions that require personal observation, specific classroom experience, or documented reasoning processes that AI can't fabricate because they're verifiable.

"Write an essay arguing for or against position X" is eminently AI-completable. "Write an essay reflecting on how your understanding of position X has changed over the course of this unit, citing specific moments from our discussions and your own research process" is much harder to fake, because the content is intrinsically personal.

What integrity actually means now

I want to offer parents a framework that I think is more useful than "did they use AI or not."

The question is: does your child understand what they submitted? Can they build on it? Can they defend it?

A student who used AI to generate ideas and then thought carefully about which ones were right, extended them, questioned them, and wrote their own analysis of them — that student did intellectual work. The AI was a tool in the process, like a calculator or a dictionary.

A student who submitted AI output without engaging with it — who can't tell you what their essay actually argued — has not done intellectual work. The AI was a substitute for thinking, not a tool for thinking.

The former is a student learning to use the tools of their era. The latter is a student who has opted out of their own education. Those are meaningfully different situations, and they can't be distinguished by a detection algorithm.

When Tutor Wong gives feedback, the goal is always that your child understands the material — not just that the right answers appear on the page.

Mr. Ng

STEM & AI Literacy

Secondary school science and computing teacher in New Territories. BSc Computer Science (CUHK), PGDE. Early adopter of AI tools in the classroom — and a cautious one. Believes every student needs to understand how algorithms make decisions that affect them.

All articles by Mr. Ng

Get Wong's Tips Weekly

One practical tip every week — no spam, just useful stuff.

We'll only send tips. Unsubscribe anytime.

Back to all articles

Disclaimer: The opinions expressed in this article are those of the author alone and do not represent the views or positions of 補習天王 (Tutor Wong), its founders, staff, or team. This article is provided for informational purposes only and does not constitute professional advice.

Keep Reading

study-tips

The 10-Minute Homework Check That Works Better Than 45 Minutes

Counterintuitive insight: checking less homework more carefully is more effective than checking everything. Here's the method.

Miss Fu6 min

study-tips

The 5-Minute Reset: A Brain Break That Actually Helps Focus

Not all breaks are equal. Based on attention restoration theory, here are specific 5-minute activities that genuinely restore your child's focus.

Miss Fu6 min

study-tips

The Homework Routine That Survives Chinese New Year

Re-establishing your child's homework routine after the CNY break doesn't have to take three weeks. Here's the 3-day reset method.

Miss Fu5 min