Challenge

With a fully remote pipeline, the team couldn't tell who was genuinely strong and who was reading answers off a hidden overlay during the call.

Approach

Challenge Mode planted confident-but-wrong AI hints and Deviation Score measured who verified and pushed back — turning AI use into a signal instead of a threat.

Remote-first and remote-global hiring removed the room — and with it, the old assumptions about interview integrity. Modern cheating tools render AI answers as invisible overlays the interviewer can't see.

The problem: you can't ban your way out

Banning AI is unenforceable on a remote call, and it tests the wrong thing anyway — most modern engineering roles now require AI fluency. The team needed to know who could actually think.

The shift: make judgment the test

Challenge Mode let the AI plant confident-but-wrong suggestions. Deviation Score then measured who verified, debugged, and pushed back — turning AI use from a threat into a measurable signal.

The outcome

Average solution independence landed at 0.87, prompt logs were captured for every session, and redundant re-interviews dropped 71%. The question shifted from "did they cheat?" to "can they reason?"