Assessment

When the résumé says agentic systems and you need to know if it’s true.

A scored, evidence-based read on whether a candidate or incumbent engineer is actually operating in the AI-native register the role requires — versus rehearsing the vocabulary. Six dimensions. One recommendation: Hire, Don’t Hire, or Recalibrate.

Why the recommendation is the recommendation.

Assessment is priced independently so the recommendation stays independent. If the evidence says Hire, we say Hire. If it says Don’t Hire, we say Don’t Hire. If it says Recalibrate — meaning the role is structurally wrong before the market is even tested — there is no downstream search fee for us. That’s the test.

Assessment is not a coding screen, a take-home, or a reference check at scale. It is an evidence standard applied to a specific engineer against the version of their role the next 18–36 months will demand. No shipping evidence = no score.

What you receive

A scored report across the six dimensions of the Engineering Signal Lens, with cited shipping evidence per dimension, a synthesized assessment, and one of three recommendations: Hire, Don’t Hire, or Recalibrate. Delivered to the hiring sponsor first. Rollout sequence chosen by the sponsor.

The Engineering Signal Lens

Six dimensions. Each backed by shipping evidence.

  1. 01

    Shipping Evidence

    What they have actually built in production, against what users, at what scale.

  2. 02

    Production Discipline

    How they reason about latency, cost, eval, and failure under load.

  3. 03

    Failure Decomposition

    How they explain what broke, what they tried, what they learned.

  4. 04

    Learning Velocity

    How fast they absorb new model capabilities and integrate them into shipped systems.

  5. 05

    Stack Reality

    What they actually use vs. what they describe — the gap between vocabulary and tooling.

  6. 06

    Cross-Functional Translation

    How they ship with research, PM, and infrastructure as collaborators rather than blockers.

How it runs

Two to three weeks per candidate or cohort. Structured technical interviews focused on what has actually been shipped, against what production constraints. Artifact review — code, architectures, postmortems, eval suites. Cross-functional signal from PM, research, infra. Final report delivered to the hiring sponsor. The candidate is informed of the engagement in a sequence the sponsor approves before kickoff.

What it costs

Scoped to depth and number of candidates. Fee disclosed before engagement and paid independently of any downstream Search engagement.

When buyers commission an Assessment
  • A finalist who looks right on paper but doesn’t feel right after the coding screen.
  • An incumbent engineer whose role is being rewritten by what their team now ships agentically.
  • A founding-engineer hire where the cost of a wrong call sets the team’s direction for a year.
  • A succession decision on a Distinguished Engineer or AI Platform Lead.
  • A PE portco where the engineering org’s AI-native posture is part of the value-creation thesis.
What it is not

Not a coding screen. Not a take-home. Not a personality assessment. Not a reference check at scale. Assessment tests whether a specific engineer can operate in a specific AI-native role under a specific set of production conditions.

Request an Assessment.

Confidentiality is the default. Sensitive engineering hiring questions, replacements, and finalist assessments are handled directly by Jason and shared only with agreed stakeholders.

AI-native engineering search. Hire. Assess. Map.Confidentiality is the default.jason@runretained.com© RunRetained 2026