← All articles
12 min read

Does CodeSignal Detect AI Tools or Cheating? (2026)

CodeSignal has built one of the most aggressive AI-detection stacks in the technical assessment space. Here is what the platform actually catches in 2026, and the categories of behavior it cannot see directly.

Direct Answer

CodeSignal has built one of the most aggressive detection stacks in the technical assessment space, and its 2026 version catches a wide range of cheating behaviors: copy-paste from external sources, tab switching, identical or near-identical solutions, AI-pattern code in the final submission, off-screen gaze in the webcam stream, and typing rhythm that does not match organic composition. The combination is rolled up into a single Suspicion Score visible to recruiters. The stack is deepest on Certified Evaluations and full-proctored General Coding Assessment configurations and lighter on practice assessments. CodeSignal's own reporting puts the attempt rate for proctored assessments at thirty-five percent in 2025, more than double the prior year. The detection layer is designed to flag submissions for human review rather than to auto-reject; the practical risk for any candidate using an undisclosed AI tool is that the flag triggers a manual review of the session.

The CodeSignal Stack: Three Products, One Detection Layer

CodeSignal sells three overlapping products that share the same proctoring and detection infrastructure. The General Coding Assessment, known as the GCA, is the seventy-minute four-question test that companies like Capital One, Netflix, and Meta use for first-round screens; it is scored out of 850 and the cutoff to advance varies by employer and role, with Capital One commonly cited as requiring 750 or higher and software engineering roles often targeting 800. The Industry Coding Framework, or IFC, is the evaluation framework for mid-to-senior engineers that companies use for later rounds or specialized roles. Certified Evaluations is the relatively newer product where candidates earn a credential by completing a single proctored assessment that they can then submit to multiple participating employers.

All three products share the same proctoring infrastructure. The Suspicion Score, the AI-code classifier, the paste detector, the tab focus tracker, and the webcam analysis run uniformly across them when the recruiter or the credential program enables proctoring. The relevant difference for candidates is the strictness of the configuration, not the underlying capability. Certified Evaluations defaults to the strictest configuration because the credential is meant to be reusable; the GCA defaults to strict proctoring because employers expect the score to be defensible; the IFC framework's proctoring is configured by the employer.

Identity Verification and the Pre-Test Funnel

Before any code is written, CodeSignal runs an identity verification flow on proctored assessments. The candidate is prompted to share their camera, microphone, and screen. The candidate is then prompted to hold up a government-issued photo ID for the camera, which CodeSignal's proctoring verification team later checks against the webcam feed for the duration of the session. The published policy states that camera, microphone, screenshare, and ID data are reviewed by CodeSignal only and are deleted within fifteen days. Once verification completes, the assessment loads in a single full-screen browser window with the lockdown configuration applied.

This step matters for the threat model because it transfers identity from a username to a verified photo of a face. Proxy test-takers — a separate human writing the test on the candidate's behalf — are caught at this layer if the face in the webcam stream does not match the face on the ID. CodeSignal reported in 2025 that proxy test-taking is one of the four cheating categories its detection systems are explicitly designed to identify, alongside copy-paste plagiarism, unauthorized AI use, and identity fraud.

Browser Lockdown and Focus Tracking

The browser-lockdown mode restricts the candidate's environment for the duration of the assessment. The assessment runs in a single full-screen window. Tab switching is blocked or, in lighter configurations, logged with timestamps. External navigation is blocked. Common browser extensions known to interfere with assessments are detected and the candidate is asked to disable them. In some configurations the clipboard is partially restricted, blocking external paste while still allowing paste within the editor for legitimate code reformatting.

The focus tracker uses the standard browser visibilitychange and blur events. If the candidate switches away from the assessment window — to a different application, to a different desktop, or to a second monitor with a different window in focus — the event is recorded with a timestamp and duration. Each such event increments the Suspicion Score. Candidates who treat the assessment window as inviolable for the full session avoid this entire category of flags. Candidates who briefly check a chat application or a notes app generate a stream of focus-loss events that show up clearly in the timeline.

Paste Detection and the Description-Copy Pattern

The paste detector logs every paste event into the editor with a timestamp, the length of the pasted content, and a hash of the pasted text. The hash allows the recruiter to see whether the pasted content matches text the candidate copied earlier from the problem description, from a previous solution, or from outside the assessment. The single highest-signal cheating pattern that CodeSignal's documentation calls out is pasting after a period of editor inactivity, especially when the paste immediately follows a Description Copy event — the candidate copies the problem description, switches to an external tool to solve it, comes back, and pastes the result.

This pattern is straightforward to detect because the events are timestamped and ordered in the timeline. The recruiter can see the problem description being copied at minute 02:14, a focus-loss event at 02:15, the candidate returning at 04:47, and a 78-line paste at 04:48. The Suspicion Score weights this combination heavily because the false-positive rate for it is low. The same logic underpins detection on related platforms; the broader landscape of paste, focus, and similarity detection is covered in the comparison piece on whether HackerRank detects AI in 2026.

The AI-Code Classifier

CodeSignal rolled out its AI-code classifier in 2024 and substantially improved it through 2025 and into 2026. The classifier looks at the final submitted code and the edit history that produced it, and assigns a probability that the submission was authored or substantially assisted by a large language model. The features the classifier uses include stylistic markers — unusually consistent naming, idiomatic but textbook-pattern structure, the absence of dead code or false-start fragments, comments written in a style associated with frontier-model output — and structural markers in the edit history, such as long bursts of typing without any backspace or correction.

The classifier is trained on a large corpus of human-written solutions to CodeSignal's question bank, augmented with synthetic solutions generated by major frontier models on the same tasks. CodeSignal reported that tens of thousands of candidates completed AI-assisted coding assessments through 2025 and that roughly one-third of CodeSignal customers adopted some AI-assisted format during the same year, which means the corpus of labeled AI output the classifier trains on is unusually large by industry standards. The classifier outputs a probability rather than a binary label, and that probability rolls into the Suspicion Score alongside the behavioral signals.

Detection typePractice / unproctoredGCA proctoredCertified EvaluationIndustry Coding Framework
Tab focus and blur eventsLoggedLoggedLoggedLogged
Browser lockdownNoYesYesConfigurable
Paste events with hashYesYesYesYes
Webcam recordingNoYesYesConfigurable
Microphone recordingNoYesYesConfigurable
Screen recordingNoYesYesConfigurable
Government ID verificationNoOptionalYesConfigurable
AI-code classifier on submissionPartialYesYesYes
Typing rhythm analysisYesYesYesYes
Similarity check against historical solutionsYesYesYesYes
Suspicion Score compositeYesYesYesYes
Overlay window not in capture bufferNot detected directlyNot detected directlyInferred via typing rhythmInferred via typing rhythm

The Suspicion Score: How the Composite Works

The Suspicion Score is CodeSignal's headline detection metric and the single number that shows up most prominently on the recruiter dashboard. It combines the outputs of every signal in the stack — paste events, focus events, the AI-code classifier probability, the similarity score against historical and AI-generated reference solutions, the off-screen gaze detector running on the webcam stream, the typing rhythm anomaly score, and the result of the identity verification check. Each signal has a weight that CodeSignal tunes based on the false-positive and true-positive rates observed across the platform.

A simplified version of how the composite gets assembled, in pseudocode:

function suspicion_score(session):
    paste_signal = weight_paste * paste_anomaly(session.paste_events)
    focus_signal = weight_focus * focus_anomaly(session.focus_events)
    typing_signal = weight_typing * rhythm_anomaly(session.keystrokes)

    ai_classifier_score = ai_code_classifier(
        code=session.final_submission,
        edit_history=session.edits,
        problem_id=session.problem_id
    )
    ai_signal = weight_ai * ai_classifier_score

    similarity = compare_against_corpus(
        submission=session.final_submission,
        corpus=[
            historical_human_solutions,
            llm_reference_solutions,
            known_leaked_solutions
        ]
    )
    similarity_signal = weight_sim * similarity

    if session.webcam_enabled:
        gaze_signal = weight_gaze * off_screen_gaze_rate(session.video)
    else:
        gaze_signal = 0

    composite = (paste_signal + focus_signal + typing_signal
                 + ai_signal + similarity_signal + gaze_signal)
    return calibrate_to_bucket(composite)
    # -> "low", "medium", "high", or "very high" risk

The composite is then bucketed into a discrete risk level the recruiter sees. CodeSignal's public guidance is explicit that a high Suspicion Score is a prompt for human review rather than a determination of cheating. The bucket also drives the recruiter's downstream action: low-risk submissions are reviewed only on score; medium-risk submissions get a session-replay review; high-risk submissions get a full proctoring-team review with the video, the screen recording, and the edit history examined together.

Any candidate considering a CodeSignal round in 2026 should assume the platform sees the full event timeline of the editor, the focus history of the browser, the rhythm of every keystroke, and the structural fingerprint of the final submission.

Get started free →

Industry Partnerships and the Stakes

The list of companies that use CodeSignal for technical screening in 2026 includes Netflix, Capital One, Meta, Dropbox, Roblox, Uber, and Robinhood, with smaller and mid-sized employers using the platform across a wider range of industries. The GCA is the most commonly used product in this set — Netflix and Capital One both use the GCA at the first-round screen — and Certified Evaluations is increasingly used as a portable credential for engineers applying to multiple participating employers in the same season.

The stakes of a flagged session at this stage are real. A Suspicion Score in the high bucket on a GCA at a company like Capital One does not automatically reject the candidate, but it does trigger a manual review by the recruiter, and the recruiter has wide discretion on the outcome. CodeSignal's 2025 fraud report stated that detection systems identified and stopped a record number of attempts as the attempt rate on proctored assessments climbed from sixteen percent to thirty-five percent year-over-year. Recruiters at companies that pay for the proctoring stack are paying attention to the output. The competitive landscape around AI-tool-aware hiring is moving quickly, and the related piece on best AI coding assistant for interviews in 2026 covers how the candidate-facing tools are evolving in response.

False Positives and the Cost of Being Flagged

CodeSignal's documentation explicitly states that the Suspicion Score is a flag for review, not a verdict. The platform reports that detection accuracy continues to improve and that the recruiter is expected to use judgment on each flagged submission. In practice, false positives do occur. Common causes include candidates who type their solution in an external editor and paste it in for formatting reasons, candidates who learned a canonical solution to a well-known problem and reproduce it cleanly enough to trigger the similarity score, candidates whose typing rhythm is naturally bursty because they think in long blocks before writing, and candidates whose home office setup includes a second monitor that the webcam catches them glancing at for legitimate reasons.

The cost of a false positive depends on the recruiter. At a well-run hiring team, the flag prompts a careful review of the session replay and a fair determination. At a less experienced or higher-volume team, the flag is sometimes treated as a near-rejection. Candidates who genuinely solved the problem on their own can find themselves disadvantaged by the same detection layer that catches the candidates who did not. This dynamic is part of the broader pattern that drives why qualified candidates fail technical interviews and is one reason employers using these platforms increasingly pair the score with a structured live follow-up round.

What the Detection Stack Does Not See

Two categories of behavior fall outside the CodeSignal stack's direct line of sight in 2026. The first is content displayed on a separate physical device — a phone, a tablet, or a second laptop running on a different network. The webcam can flag the candidate looking away from the screen if the gaze detector is configured aggressively, but the contents of a separate device are not visible to CodeSignal. The second is overlay windows on the candidate's primary machine that render outside the standard screen capture buffer used by the WebRTC getDisplayMedia API. The detection layer for this category is indirect: the AI-code classifier on the final submission, the typing-rhythm anomaly detector on the keystroke stream, and the off-screen gaze detector on the webcam stream all produce signals correlated with the use of such tools without seeing the tools themselves.

This is the architectural property that defines "invisible" interview assistants. The platform's response is not to see the overlay window directly — the operating system's graphics pipeline makes that hard by design when an application opts out of capture — but to infer the use of an external assistant from second-order signals. CodeSignal has invested heavily in those second-order signals, and the AI-code classifier in particular has improved substantially since its 2024 launch.

The defensive baseline for any 2026 candidate on CodeSignal is to expect the Suspicion Score, the AI-code classifier, the focus tracker, and the paste detector to all be running by default on any proctored assessment.

Get started free →

Practical Risk Framework for Candidates

The practical risk surface for a candidate considering an undisclosed AI tool on a CodeSignal proctored assessment in 2026 breaks down into three components. The first is the behavioral signal layer: focus events, paste events, typing rhythm, and the webcam gaze detector. This layer is the easiest to trigger by accident and the easiest to score against intentionally. The second is the AI-code classifier on the final submission. This layer is harder to predict because the candidate does not see the classifier's training data and does not know which structural patterns it weights most heavily. The third is the similarity check against historical and AI-generated reference solutions. This layer is the most deterministic — code that closely matches a frontier-model reference solution to the same problem is flagged with high probability.

The composite Suspicion Score combines all three. A high score does not auto-reject, but it does trigger a manual review with full access to the session replay. Candidates who would consider an undisclosed AI tool should weigh the stakes of being in the manual-review pile against the alternative of preparing such that the assessment can be completed cleanly. The broader ethics question is treated in is using AI during a coding interview cheating. The practical reality is that CodeSignal in 2026 catches more than it did in 2024, the customer base treating the Suspicion Score as a real signal is growing, and the cost of being flagged at a top-tier employer is higher than at almost any other stage of the funnel.

TechScreen is the invisible AI assistant for technical interviews, built for live screens on platforms like Codility, HackerRank, and CodeSignal. Start with 3 free tokens, no card required.

Get started free →

Frequently Asked Questions

Does CodeSignal record the candidate's screen during a proctored assessment?

Yes, in Certified Evaluations and any assessment configured with full proctoring. The candidate is prompted before the assessment to share camera, microphone, and screen via the browser, and the resulting feeds are recorded for the duration of the session. CodeSignal's proctoring verification team reviews the recordings post-hoc and retains the data for up to fifteen days under the published policy.

What is the Suspicion Score and how is it calculated?

The Suspicion Score is CodeSignal's composite metric for how likely a submission involved cheating or AI assistance. It combines tab-switching events, paste events, typing rhythm anomalies, similarity-to-known-solutions scores, off-screen gaze detection from the webcam stream, and the output of the AI-code classifier trained on patterns characteristic of LLM-generated code. The score is presented to recruiters as a flag for review rather than a verdict.

Which companies use CodeSignal and at what stage of hiring?

Companies using CodeSignal in 2026 include Netflix, Capital One, Meta, Dropbox, Roblox, Uber, and Robinhood. Most use it for first-round screens via the General Coding Assessment, with Certified Evaluations gaining traction as a credential candidates can earn once and submit to multiple companies. The Industry Coding Framework is used by select customers for senior-engineer evaluations.

Does CodeSignal detect copy-paste from external sources?

Yes. The CodeSignal editor logs every paste event with timestamp, length, and content hash. Paste events that occur after a period of editor inactivity, or that immediately follow a Description Copy event, are weighted heavily in the Suspicion Score. The recruiter can see the exact moment of each paste in the session replay.

Can CodeSignal detect AI-generated code in the final submission?

CodeSignal's AI-code classifier, rolled out in 2024 and substantially improved through 2025 and into 2026, looks for stylistic markers characteristic of large language model output: unusually consistent naming, textbook-pattern structure, the absence of typical human revisions in the edit history, and overlap with reference solutions produced by major frontier models. The classifier flags a probability score, not a verdict.

What is browser lockdown mode and how strict is it?

In browser-lockdown mode, CodeSignal restricts the candidate's environment to block tab switching, external navigation, common extensions, and clipboard access in certain configurations. The candidate sees a single full-screen window for the duration of the assessment, and any attempt to leave that window — including switching to a different application — triggers a focus-loss event and increments the Suspicion Score.

What is the false positive risk on CodeSignal's AI detection?

CodeSignal does not publish a numerical false-positive rate. Public documentation notes that a high Suspicion Score is not proof of cheating. Common benign causes of flags include candidates who type code in a draft editor before pasting, candidates who learned canonical solutions to common problems and reproduce them cleanly, and accessibility tools that simulate input. The detection stack is designed as a flag for human review, not an automatic disqualification.

Are CodeSignal cheating attempts actually increasing?

Yes. CodeSignal reported that assessment fraud attempt rates more than doubled in 2025, rising from 16 percent to 35 percent across proctored assessments, with entry-level assessments climbing from 15 percent to 40 percent. The platform's detection investments throughout 2025 and 2026 are a direct response to this trend, and the volume of flagged submissions has grown in step.

Ready to use AI assistance in your next interview?

TechScreen is the invisible AI assistant trusted by engineers interviewing at Google, Meta, Amazon, and hundreds of other companies. Start with 3 free tokens — no credit card required.

Ace your next interview →