The Direct Answer
HireVue's AI-detection stack in 2026 is asynchronous, audio-centric, and intentionally narrower than its reputation suggests. Facial analysis has been off the books since 2021. The system today evaluates verbal content against competency models, monitors tab switches during the response, runs a generative-AI audio classifier looking for synthesized-speech artifacts, and surfaces statistically similar answers across candidates. None of this produces an automated rejection. Everything routes to a human reviewer who works for the hiring company. The specific risk for candidates using AI assistants is concentrated in a narrow band — text-to-speech playback into the microphone — and that is the band the 2025 audio classifier was built to surface.
What HireVue Is and How It Differs from Live Platforms
HireVue is an asynchronous video assessment platform. Candidates receive a link, log in, see a prompt on screen, and record a response on a fixed time limit. There is no human on the other end at the moment of recording. Responses are uploaded to HireVue's pipeline, scored against a competency model trained on the hiring company's high-performing employees, and packaged into a report for the recruiter to review later.
This format is the entire reason HireVue's detection stack looks the way it does. There is no live interviewer to read body language, no shared editor for keystroke telemetry, and no real-time interruption to probe a suspicious answer. Everything HireVue can detect, it has to detect from the recorded artifact: the audio, the video, the transcript, and a thin layer of browser-side behavioral signals captured during the response window. That constraint shapes both what HireVue catches and what it misses.
The platform is used by a wide cross-section of Fortune 500 hirers, including Unilever, JPMorgan Chase, Hilton, and Vodafone, primarily for high-volume early-funnel screening rather than for late-stage technical loops. It also sells a Coding Assessment product that adds a live-coding environment on top of the video stack, which is where its detection footprint expands into territory that overlaps with Codility and similar coding-assessment platforms.
The Facial-Analysis History That Defines the 2026 Policy
The current HireVue policy is unintelligible without the history that produced it. From 2014 through 2020, HireVue's marketing centered on a facial-analysis layer that scored candidates on micro-expressions, smile frequency, and other visual cues. In November 2019 EPIC, the Electronic Privacy Information Center, filed a federal trade complaint alleging that the system was opaque, unvalidated, and discriminatory. The Illinois AI Video Interview Act, which took effect in January 2020, required explicit consent for any AI analysis of recorded interviews and established a regulatory baseline.
In January 2021 HireVue announced it would discontinue facial-analysis scoring entirely, citing the results of a third-party algorithmic audit that found visual analysis contributed minimally to predictive validity. The company shifted the public version of its system to transcript-only scoring of verbal content against competency models, with the webcam preserved for identity verification and downstream human review but not for automated scoring.
That policy has held through 2026, reinforced by a February 2026 consent-workflow update aligned to the latest Illinois amendments. The history matters because most candidate-facing guidance about HireVue still describes the pre-2021 stack. The detection surface a candidate actually faces in 2026 is meaningfully smaller than what older articles describe.
The 2026 Detection Stack
The current stack runs four layers in parallel during the response window and across the post-submission pipeline. The first layer is transcript scoring. HireVue's speech-to-text pipeline produces a transcript, and the transcript is scored against competency rubrics trained on prior high-performing responses from the same role family. The second layer is behavioral telemetry from the browser: tab switches, window-focus changes, page visibility transitions, and timing analytics on how long the candidate spent in each phase of the prompt. The third layer is the 2025 generative-AI-aware audio classifier. The fourth layer is cross-candidate statistical similarity, which flags responses that look suspiciously alike across the same hiring funnel.
None of these layers automatically rejects a candidate. Each layer contributes to a session-risk score that, above a configured threshold, routes the session to the hiring company's human reviewer along with the explanation of which signals fired. The final decision is always human, and that decision is made by someone at the hiring company, not at HireVue. The model is closer to email spam filtering than to authoritative judgment — high recall, moderate precision, human in the loop.
The 2025 Generative-AI Audio Classifier
The 2025 classifier is the most consequential addition to the stack for candidates who use AI assistants. It is trained to recognize the audio fingerprint of synthesized speech being played back into a microphone, and it produces a per-response score that surfaces high-likelihood sessions for human review.
The classifier looks for three families of signal. The first is spectral: text-to-speech engines produce characteristic artifacts in the high-frequency range and unusually clean formants. The second is prosodic: synthesized speech tends to have inhuman cadence — too even, too consistent, lacking the micro-hesitations and breath patterns of natural speech. The third is acoustic: when a candidate plays AI audio out of speakers and re-captures it through a microphone, the re-capture introduces a measurable loopback signature, including room reflections that do not match the candidate's own voice in the same recording.
The pseudocode below sketches the kind of scoring logic the audio pipeline can run on a recorded response. It is illustrative, not a leaked specification.
def score_response_audio(audio_clip, transcript_segments):
score = 0.0
# Spectral artifacts of common TTS engines
spec = spectrogram(audio_clip)
if matches_tts_fingerprint(spec, engines=["elevenlabs", "openai", "azure"]):
score += 0.30
# Prosodic cadence too even to be human
cadence = measure_phoneme_intervals(audio_clip)
if variance(cadence) < HUMAN_VARIANCE_FLOOR:
score += 0.20
# Loopback re-capture signature
if has_room_double_reflection(audio_clip):
score += 0.15
# Speaker change inside a single response
if count_distinct_voiceprints(audio_clip) > 1:
score += 0.20
# On-screen reading cadence
for segment in transcript_segments:
if reading_pattern_detected(segment):
score += 0.05
return min(score, 1.0)
The output is a score, not a verdict. A score above a customer-configured threshold flags the response for human review and is reported back to the hiring company alongside the response itself. The hiring company decides what to do with the flag.
Any AI tool that reads suggestions aloud into the microphone is the high-risk category on HireVue. Silent overlays do not produce the same fingerprint.
What the Webcam Still Records and What Human Reviewers Still See
The webcam is still recording during every response, even though facial analysis is not running. The recording is used for identity verification at the start of the assessment, and the full video is available to the hiring company's reviewers as part of the response artifact. That means a human reviewer can still see gaze patterns, off-screen reading behavior, and the candidate's general demeanor when reviewing a flagged response.
This matters because the most reliable signal in the post-2021 stack is not algorithmic — it is the human reviewer at the hiring company watching a flagged response. Reading visibly off-screen, with eyes tracking text that is not on the prompt itself, is one of the patterns flagged-response reviewers cite most often. The 2025 audio classifier and the tab-switch logger surface candidates for review; the review itself looks at the video.
Camera-based gaze inference is also a meaningful signal for the broader webcam-proctoring landscape in 2026, where multiple vendors compete on the precision of gaze and head-pose models. HireVue does not run automated gaze scoring as part of its scored stack, but the visual evidence remains in the recording and is available to the human reviewer.
Tab Switching, Timing, and Browser-Side Behavior
The browser-side telemetry layer is the most familiar one and the easiest to summarize. During the response window, the HireVue assessment runtime captures page-visibility transitions (the standard browser API that fires when a tab loses focus), pointer leaves, and window-blur events. Each event is timestamped against the response phase. A response with three tab switches during the thinking window is logged differently from a response with zero. Long pauses followed by polished answers are timing-flagged.
These signals do not catch a candidate who reads from a second monitor, because the browser cannot see the second monitor. They do catch candidates who alt-tab to ChatGPT inside the same browser instance, which is still the most common form of attempted assistance the platform sees. The countermeasure most candidates assume — an OS-level overlay that does not require switching tabs — bypasses the tab-switch layer but does not bypass the audio classifier if the overlay reads aloud, and does not bypass the human review of off-screen gaze patterns.
Detection Mechanism Matrix
The table below maps the detection mechanisms in HireVue's 2026 stack against whether each one fires on a candidate using an invisible visual-only AI overlay versus an overlay with audio output.
| Detection mechanism | Layer | Fires on silent visual overlay? | Fires on TTS audio overlay? |
|---|---|---|---|
| Tab-switch and focus telemetry | Browser runtime | No | No |
| Transcript competency scoring | Post-upload | No (content-only) | No (content-only) |
| Cross-candidate similarity check | Post-upload | Yes if generic | Yes if generic |
| 2025 TTS spectral classifier | Audio pipeline | No | Yes |
| TTS prosody / cadence classifier | Audio pipeline | No | Yes |
| Loopback acoustic signature | Audio pipeline | No | Yes |
| Voiceprint change detection | Audio pipeline | No | Yes |
| Off-screen gaze (human review) | Webcam recording | Partial | Partial |
| Identity verification snapshot | Webcam recording | N/A | N/A |
| Browser lockdown (Coding Assessment) | Browser runtime | Yes if tier enabled | Yes if tier enabled |
| Statistical timing anomaly | Post-upload | Partial | Partial |
The pattern is clear. The 2025 audio classifier is the layer specifically engineered for the generative-AI era, and it is the layer that disproportionately fires on candidates using TTS-enabled tools. The non-audio signals are largely the same ones HireVue ran in 2022. Candidates evaluating AI-assistant risk on HireVue should treat the audio surface as the primary one.
The Coding Assessment Product and Paid Proctoring
HireVue's Coding Assessment product is a separate tier that wraps a coding environment around the asynchronous video format. It adds browser lockdown, language-specific coding telemetry, and timing analytics. HireVue is explicit that this product does not run a Codility-grade keystroke-monitoring layer inside the machine, but it does add per-keystroke event logging within the coding panel and statistical comparison against known-AI fingerprints in the submitted code.
The paid proctoring add-on, often bundled with the Coding Assessment, expands identity verification and adds a manual review queue staffed by HireVue. Employers who buy this tier receive a more aggressive flag rate and more detailed reviewer notes. Most early-funnel screening on HireVue does not have this tier enabled, which is one reason a casual look at the platform's detection footprint understates the variance across customers. The candidate experience on a Hilton early screen looks meaningfully different from the candidate experience on a Goldman Sachs technical assessment, even though both are nominally HireVue.
Read the assessment-opening disclosure carefully. The tier and the proctoring scope are stated up front.
The AI Ethics Policy and What It Actually Constrains
HireVue's published 2026 AI ethics policy is the result of the multi-year regulatory rebuild after the facial-analysis controversy. The policy commits to no automated facial or demographic scoring, transparent disclosure of AI use to candidates, explicit consent under the Illinois AI Video Interview Act as amended in February 2026, annual third-party algorithmic audits, and human review for any score-affecting AI flag.
The policy does real work. The no-automated-rejection commitment is the reason a HireVue session does not behave like a Codility lockdown environment. The annual third-party audit requirement is the reason facial analysis has not quietly returned. The Illinois consent workflow is the reason the assessment opens with an explicit disclosure of AI processing rather than burying it in terms of service. These are not marketing claims; they are operational constraints that shape what HireVue can and cannot ship.
What the policy does not constrain is the hiring company. Once a flagged session is delivered to the hiring company's recruiter, what happens to the candidate is the hiring company's decision and is not bound by HireVue's policy. A confirmed flag at one employer does not propagate to another, but it does typically end the candidate's process at the employer that received it. That asymmetry — strong constraints on the vendor, soft constraints on the employer — is the operational reality of asynchronous AI-assisted assessments in 2026.
What This Means for AI Assistant Users
The actionable summary for candidates considering an AI assistant during a HireVue session is narrower than the platform's reputation suggests. Tab-switch logging is a mature signal but is bypassed by anything that does not require tab switching. Transcript scoring rewards specificity and texture and penalizes generic content; AI-generated answers tend to fail this test through quality, not through detection. The 2025 audio classifier is the layer specifically tuned for AI assistants and is the one that catches the most preventable mistakes — almost all of which involve playing synthesized speech into the room.
The candidate who uses a silent visual overlay to surface notes during a HireVue prompt faces a fundamentally different risk surface than the candidate who uses a TTS-enabled tool. The first is exposed primarily to the human reviewer reading off-screen gaze patterns. The second is exposed to all of that plus the audio pipeline. This is the distinction that matters most when evaluating tools for HireVue specifically, and it is the distinction most general-purpose guides about whether ChatGPT in Zoom interviews is detectable tend to skip past.
The other consideration is that HireVue is rarely the final round. It is an early-funnel filter, and a flag at this stage typically ends the process before a candidate ever speaks to a hiring manager. The risk-adjusted value of AI assistance at the screening stage is therefore lower than at later rounds where the candidate has built more context with the company. The realistic comparison is between the marginal score gain from AI assistance and the marginal probability of a flag that eliminates downstream rounds entirely. Candidate-side analysis of why qualified candidates fail technical interviews suggests preparation and authentic content typically dominate at this funnel stage, even before detection risk enters the calculation.
Where This Leaves the Candidate
HireVue in 2026 is a narrower detection environment than its public reputation suggests, but it is sharper where it does focus. Facial analysis is gone. Transcript content scoring is the primary scoring layer and rewards specificity over polish. The 2025 audio classifier is the layer most likely to surface a candidate using an AI assistant, and it surfaces a specific behavior — synthesized audio in the response — that visual-only tools do not produce. Human review at the hiring company is the final determination, and the patterns reviewers cite most often are off-screen gaze and generic content rather than algorithmic flags.
Understanding which layer fires on which behavior is the entire risk model. Most candidate-facing guidance about HireVue collapses these layers together and either overstates the detection capability or understates it. The honest 2026 picture is in the middle: real but bounded, audio-centric but human-mediated, and meaningfully different from the facial-analysis platform HireVue used to be.
TechScreen is the leading invisible AI assistant for technical interviews. Silent overlay, no audio output, no synthesized speech in the room. Try free with 3 tokens.
Frequently Asked Questions
Does HireVue still use facial analysis in 2026?
No. HireVue discontinued facial analysis in 2021 after the EPIC FTC complaint and the Illinois AI Video Interview Act, and the policy has held through 2026. Current scoring evaluates the transcript of verbal content against competency models, not facial expressions, appearance, or demographic features. The webcam still records the response for identity verification and downstream human review, but it does not feed an automated facial scorer.
Can HireVue detect that a candidate used ChatGPT?
Not reliably as a direct yes-or-no signal. HireVue has not certified a ChatGPT detector and has publicly stated that no significant score drift has appeared in its assessments since large language models became widely available. The system flags answers that match a known-generic AI fingerprint or that are statistically similar across candidates, and a 2025 audio classifier looks for TTS playback artifacts, but final review is human.
What is the HireVue audio classifier introduced in 2025?
The 2025 generative-AI-aware audio classifier scores response audio for signals associated with synthesized speech being played back into the microphone, including spectral artifacts of text-to-speech engines, inhuman cadence and prosody, and the acoustic signature of an audio loopback. It is one input into the suspected-cheating queue, not a standalone verdict, and high scores route sessions to a human reviewer.
What does HireVue's paid proctoring tier actually monitor?
HireVue's proctoring add-on covers identity verification, webcam recording during the response window, browser tab-switch logging, and statistical similarity checks across candidate pools. The Coding Assessment product wraps this with browser lockdown and timing analytics. HireVue explicitly does not run a Codility-grade keystroke or process-monitoring layer inside the candidate's machine.
What is the actual risk for AI assistant users on HireVue?
The largest risk category for AI assistant users on HireVue is text-to-speech playback. Any tool that reads suggestions aloud into a microphone produces audio fingerprints the 2025 classifier is designed to surface. Silent visual overlays do not produce those fingerprints, but visible eye-tracking patterns of a candidate reading off-screen text remain a meaningful signal during human review of flagged responses.
Does HireVue reject candidates automatically based on AI detection?
No. HireVue's documented policy is that no candidate is automatically rejected based on AI-detection signals. Flagged sessions are routed to a human reviewer, typically the hiring company's recruiter or assessment specialist, who makes the final call. The flag is shared as part of the assessment deliverable, which means the hiring company sees the suspicion even if the formal score is unaffected.
How does HireVue compare with Codility for AI proctoring?
HireVue is built for asynchronous video and behavioral assessment; its detection model is audio and transcript-based. Codility is built for live and asynchronous coding and runs deeper telemetry inside the coding environment itself, including keystroke timing, plagiarism comparison against a global corpus, and tab-focus logging. The two products are often used together by the same employer for different stages.
What is HireVue's published AI ethics policy in 2026?
HireVue's 2026 AI ethics policy commits to no automated facial or demographic scoring, transparent disclosure of AI use to candidates, explicit consent under the Illinois AI Video Interview Act as amended in February 2026, annual third-party algorithmic audits, and human review for any score-affecting AI flag. The policy is the result of a multi-year regulatory and reputational rebuild after the 2019 to 2021 facial-analysis backlash.
Ready to use AI assistance in your next interview?
TechScreen is the invisible AI assistant trusted by engineers interviewing at Google, Meta, Amazon, and hundreds of other companies. Start with 3 free tokens — no credit card required.
Ace your next interview →