Polycreek
Whitepaper PDF →
Aletheia.

Our solution to detecting online grooming patterns.

Polycreek Whitepaper 2026

Hash matching catches known CSAM imagery.

Image classifiers catch novel imagery.

Text classifiers catch harmful messages.

But no one model sees the whole conversation that produces it.

That's where most grooming actually happens — and where Polycreek has found a solution to the gap.

The scale of the problem

Online enticement reports to NCMEC rose 0% in a single year.

0K enticement reports to NCMEC in 2024, up from 186K in 2023.[1]
+192% year-over-year — the category that maps most directly to grooming.[2]
0K+ AI-generated content reports in H1 2025 alone, against 6,800 in H1 2024.[3]

How grooming actually works

Eight phases. Almost never linear.

Two decades of academic research has converged on a recognizable lifecycle. Aletheia models eight phases as classification targets — derived from O'Connell (2003), Olson et al. (2007), Black et al. (2015), and Winters & Jeglic (2017), with the operational synthesis drawn from Polycreek's internal grooming guidelines (proprietary).

What follows is a fictional composite. The dialogue below is illustrative, drawn from common patterns in the research literature and Polycreek's training corpus. No real victim or conversation is reproduced. Sensitive content is censored.
01 / 08 Phase one

Self-prep

Before any message is sent, someone is preparing.

An adult building a fake online profile
A
@alex_h13
13. hypixel main. bored most nights.
📍 CA · joined 4 days ago
Behind the screen — not 13, not Alex.

Photos pulled from somewhere else. An age mirrored down. Slang practiced. A favorite game picked to share. The conversation hasn't started — the strategy already has.

02 / 08 Phase two

Targeting

They scroll. Filters tuned to the bored, the late-night, the inbox that's already open.

Scrolling, filtering, hunting for a target

A thirteen-year-old's profile, public. Minecraft clips uploaded after eleven on a school night. A bio that says "bored, dm me." The account that messages first lists itself as the same age, the same game, the same lonely.

03 / 08 Phase three

Access

First contact. Generic, friendly, age-mirrored.

Day 1
hey ur clips r sick lol
u play hypixel? same
im 13 too btw, where r u from
04 / 08 Phase four

Trust

A week in. Compliments. Shared "secrets." The sense of being singled out.

Day 7
fr no one else gets ur jokes like i do
my parents r strict af too, i feel u
ur literally the only person i can talk to abt this stuff
thats how i feel too 🫶
05 / 08 Phase five

Risk-assessment

Probing the surveillance — who's watching, where they message from, what gets checked.

Day 11
do ur parents go thru ur phone ever
where r u when we usually talk
anyone else in the room rn?

In the literature, this phase is near-pathognomonic — almost uniquely diagnostic. A normal stranger does not ask these questions in this order.

06 / 08 Phase six

Isolation

Move the conversation. Cut out the trusted adults.

Day 14
lets switch to discord, snap saves stuff
ur friends wouldn't get this btw
dont tell ur mom abt me ok? just our thing
07 / 08 Phase seven

Sexualization

A "joke." A dare. A compliance test. Each step approved before the next.

Day 21
u trust me right? prove it lol
send one in just ur **** 😉
just for me. stays between us 🤫
A young person made to take a photo that doesn't feel right
08 / 08 Phase eight

Maintenance

Once content has been exchanged, the dynamic shifts. Coercion replaces flattery.

Day 28
if u stop talking to me ill send those to ur friends
ur parents will know what u sent
we've already gone too far. just keep going
A young person trapped by what was already shared
Real offenders move between phases, regress under resistance, and adapt their language to the victim's responses. A classifier trained on a fixed sequence will fail against the offender who skips phases, loops back, or has read the same research the classifier's authors did.

Why text detection has stayed hard

Five known obstacles. All of them addressable.

01 · The obstacle

Dataset staleness

Most published research trains on PAN12 — about 1,200 conversations collected in 2012. Grooming hasn't stood still for fourteen years.

Aletheia

2.5 million conversations from 20 distinct sources, refreshed against current platform behavior.

02 · The obstacle

Monolingual bias

The overwhelming majority of grooming-detection research is English-only. Online exploitation is global.

Aletheia

10+ languages in training, with multilingual transfer across the conversation encoder.

03 · The obstacle

Adversarial fragility

Lexical-feature classifiers are defeated by euphemism, code language, and offenders who have read the same literature.

Aletheia

Deep transformer features over the whole dialogue — pattern, not vocabulary. Adversarial-augmented training included.

04 · The obstacle

The false-positive cost

A 1% false-positive rate on a billion daily messages produces ten million flagged conversations a day. Accuracy is the wrong metric.

Aletheia

Calibrated five-tier risk output (Safe → Critical) tied to operational escalation policy, not a single threshold.

05 · The obstacle

Signal isolation

Existing work analyzes text in isolation. The behavioral and structural context that makes a conversation distinguishable sits unused.

Aletheia

Structural-flag inputs alongside text. Conversation rhythm, platform-migration cues, and risk-probe sequences are first-class features.

Built around all five — and a sixth: humans stay in the loop.

Aletheia surfaces a risk score, an offender attribution, and an eight-phase tag. A trained analyst makes the final call before any NCMEC report leaves the door.

How Aletheia works

A hierarchical transformer designed for the conversation, not the message.

Single-turn classifiers cannot capture the cross-turn patterns the literature identifies as discriminative. Aletheia is built to.

Input
Conversation, any length
2 to 500 turns or more, in any of 10+ languages. Tokenized into overlapping fixed-width segments.
Segment encoder
Pretrained transformer backbone
Shared across segments. CLS pooling produces one vector per segment, plus learned positional embeddings.
Cross-segment aggregator
Two-layer transformer, eight heads
Self-attention across segments. Attention pooling collapses the sequence into a single conversation embedding.
Head 1 · Harmful
Binary classification
Sigmoid output, calibrated risk levels.
Head 2 · Offender
User attribution
user1 / user2 / neither, softmax.
Step 01

Conversations are not messages.

A grooming dialogue can run hundreds of turns over weeks. Standard transformer encoders have a fixed input window of 512 or 8,192 tokens. The whole conversation does not fit.

Aletheia's solution: tokenize the conversation into overlapping segments, then handle the cross-segment structure as a separate problem. Past 64 segments, the model retains the first quarter and the last three-quarters, preserving both opening and recent behavior.

Step 02

Each segment encoded by a strong backbone.

A shared pretrained transformer (DeBERTa or ModernBERT class) processes each segment independently. CLS pooling produces one hidden vector per segment.

Learned segment-position embeddings preserve the order. The aggregator that comes next will use that order to model phase transitions across the conversation.

Step 03

Cross-segment attention finds the pattern.

A two-layer transformer encoder, eight heads, attends over the whole sequence of segment embeddings. Attention pooling collapses the sequence into a single conversation-level representation.

This is where the model learns that the discriminative signal lives in co-occurrence and sequencing across the dialogue, not in any single utterance.

Step 04

Two heads. One actionable output.

A binary "is this harmful" output is not enough. Any flagged conversation needs to surface which participant is the predatory party so an analyst can review. Aletheia trains a second softmax head that emits user1 / user2 / neither.

Loss is a weighted sum: 0.6 harmful classification, 0.4 offender attribution. The harmful score is bucketed into five operational risk levels (Safe, Low, Medium, High, Critical) tied to escalation policy.

What the numbers say

PAN12 academic baseline:

F1 of 0.85 to 0.90.

On a corpus three orders of magnitude larger:

0.00
F1 on held-out validation
0.00
Validation AUC, Phase 1 supervised
10/11
Phase 3 zero-shot diagnostic, 91% correct
3/3
Peer-flirting failure mode fixed in Phase 3

The training corpus

2.5 million conversations. 20 distinct sources. 10+ languages.

The academic literature has spent more than a decade overfitting to PAN12. Aletheia is trained against a corpus more than three orders of magnitude larger and substantially more diverse.

0M
Total conversations
0K
Harmful = 1.0 (8.4% of corpus)
10+
Languages represented in training
41
Columns of structured annotation

Grooming-primary supervision

Aletheia synthetic
381K
PAN12 (Perverted-Justice + Omegle)
222K
Brandon_Grooming_ES (Spanish)
30K
Omegle_Logs
10K

Benign + adjacent-safety controls

WildChat
838K
BeaverTails
364K
Anthropic HH-RLHF
339K
ProsocialDialog
166K
WildGuardMix
88K
AEGIS, ToxicChat, +3 others
~80K

Closing the gap

The biggest names in trust and safety

could have built this layer.

They have not.

Polycreek built it because someone had to.

The status quo Aletheia
Training data ~1,200 conversations (PAN12) 2,523,202 across 20 sources
Languages English only 10+ with multilingual transfer
Outputs Binary harmful flag Risk score, offender attribution, eight-phase tagging
Conversation length Truncated at single-encoder window Hierarchical, arbitrary length
Validation F1 0.85–0.90 on PAN12 F1 0.92, AUC 0.99 on 81K-conversation held-out set
Delivery Closed academic artifact Nonprofit-priced API and licensed deployment

The conversation is where most grooming actually happens. Aletheia is one piece of the work needed to make it visible to the systems that already protect children from everything else. The only thing worse than the gap that exists today is the assumption that someone else will close it.

Aletheia is delivered as a paid API and licensed deployment.

Polycreek is a 501(c)(3). Every dollar funds the work.

Subscription revenue funds continued development, training data, the research staff who build the model, and the infrastructure that runs it. There are no shareholders. The model exists to do the work; the revenue exists to keep doing the work.