About TrainedOnMe

Why we built this

Writers, journalists, researchers, and developers put enormous effort into creating original content, only to have no way of knowing whether that content quietly ended up in an AI model's training set. The companies building these models rarely disclose what they trained on, and there is no straightforward way for individuals to find out.

TrainedOnMe is an attempt to give that power back. It lets anyone paste in text they created and run a statistical test against major AI models to see whether those models show signs of having memorized it.

How it works

TrainedOnMe uses a technique grounded in peer-reviewed AI research to probe whether a model has memorized specific text. The core idea: a model trained on a piece of writing will often continue it in ways that closely echo the original, while a model that has never seen it produces something merely plausible.

We run a series of statistical tests against each model and analyze how closely its behavior matches what we'd expect from memorization versus independent generation.

Limitations

This test can only detect content that was memorized during training. A model may have been trained on your content without strongly memorizing it, particularly if your text is short, stylistically generic, or similar to many other documents. A negative result is not proof that your content was never used. It means we could not find a detectable signal.

The test works best with content that is distinctive and long enough to extract several passages (500+ words is a good target). Highly formulaic text, such as boilerplate, legal language, or code, is harder to distinguish from independently-generated text.

Privacy

Your content is never stored by us. To run the test, passages are sent directly to the AI provider APIs (OpenAI, Anthropic, Google) and discarded immediately after scoring. Each model provider's data policy applies to those calls.

Who we are

We are a team at Stanford University working on questions of AI transparency and accountability. TrainedOnMe started as a research project and grew into a public tool because we kept getting asked the same question by writers, journalists, and researchers: "Is my work in there?"

If you have questions or want to get in touch, use the contact form.