The three performance review patterns quietly penalizing your highest-output neurodivergent employees

Last updated

Most performance review systems are doing exactly what they were designed to do. The problem is that what they were designed to do does not match what HR leadership thinks the system measures. Three structural patterns appear in most performance review systems, and each one systematically penalizes neurodivergent employees regardless of output. They are: the Style Capture (in the rubric), the Story Gap (in the calibration), and the Forced Distribution Trap (in the math). Fix the manager's language without fixing these three patterns and you have handed the manager better words for the same systemic outcome.

This is the system-design view of a dynamic Spectrum Roadmap's employee-side performance review guide describes from the inside. Both are true. The coded feedback that employees recognize and the structural patterns that produce it are the same phenomenon, viewed from different sides of the conversation. If your retention pillar is bleeding neurodivergent employees through what the year-one retention article calls the Quiet Performance Drift, the three patterns below are the upstream cause.

This isn't (only) a language problem

A previous pillar in this series — the manager-language piece — argued that warm-but-generic manager language is a major contributor to neurodivergent retention failure. That argument is correct. It is also incomplete.

Coded feedback is the surface. Underneath are three system-design choices that produce coded feedback even from managers who genuinely want to be specific. A manager trained to avoid "executive presence" without specifics will reach for something else when the rubric still expects them to rate that dimension. The language fix and the system fix have to happen together.

Companies that train managers on inclusive language without changing the rubric, the calibration discipline, or the distribution model get the same review outcomes with kinder wording. The system is doing the work the language is now being more careful about.

— Debra Solomon

The Style Capture is in the rubric. The Story Gap is in the calibration. The Forced Distribution Trap is in the math. Each is a structural choice. Each can be unchosen.

Pattern 1 — The Style Capture

A typical performance review rubric has rows for outcomes — objectives delivered, KPIs hit, projects completed — and rows for style. The style rows have names like "communication," "executive presence," "stakeholder management," "polish," "leadership," "influence without authority." Each row carries weight. The final rating is a weighted average.

The high-output neurodivergent employee gets a 5 on objectives and a 3 on communication. The math averages to a 4. The neurotypical peer whose output was actually weaker gets a 4 on objectives and a 4 on communication — and the math averages to a 4 as well. They tie. The rubric, which both managers and HR believe is measuring performance, has produced a tied rating for two employees whose outputs were not tied.

This is the Style Capture. It is the structural choice to give style dimensions equal weight to outcome dimensions, while defining "style" with reference to neurotypical defaults. The rubric says it measures executive presence. In practice, the rubric measures fluency with the social norms of the rating manager.

Most "style" dimensions on most rubrics are not measuring a coherent construct. They are measuring whether the employee performs social signals the rating manager finds legible — sustained eye contact, calibrated emotional affect, fluent small-talk transitions, conventional meeting-cadence interruption patterns, the texture of professionalism that varies considerably between subcultures and almost entirely between neurotypes. The rubric is doing neurotypical-default scoring with a professional-sounding name.

The early signal is a math pattern. If strong-output neurodivergent employees consistently score lower on style dimensions than their output would predict — and the pattern holds across multiple employees and multiple managers — it is not idiosyncratic. It is the rubric, doing the work the rubric was designed to do.

The "polish" row on most rubrics is doing the most damage and getting the least scrutiny. It looks neutral. It is not.

— Debra Solomon

What changes the Style Capture is rubric redesign at the system level. Style dimensions either get cut or get re-anchored to observable behaviors that demonstrably connect to job outcomes. "Executive presence," without a behavioral definition, gets removed. "Effective communication" gets redefined with reference to results: did the team understand the brief, did the deliverable land, did the stakeholder make the decision they needed to make. Not "did the employee perform the cadence of professionalism the rater grew up with."

This is rubric work, not manager-training work. Train managers on a rubric that captures style, and the training cannot save the outcome.

Pattern 2 — The Story Gap

The Style Capture is in the rubric. The Story Gap is in the calibration session.

Calibration is the part of the review cycle where managers compare ratings across the team or the org, and ratings get adjusted to ensure consistency. The defense — why each manager believes their proposed rating is the right one — happens out loud. It is fundamentally a rhetorical exercise as well as a numerical one.

The manager whose report is charismatic, well-networked, and easy to narrate produces a vivid story in the calibration room. Here's what they did. Here's how they did it. Here's the moment I noticed. Here's the team's perception. The manager whose report is the high-output neurodivergent employee struggles to produce the same vividness. The employee's wins are documented but not socialized. Their contributions don't propagate through informal channels at the company. Their manager defends the rating with metrics; the metrics are real, but flat compared to the narrative.

The ratings drift toward the better-narrated employees. The math, viewed from outside the room, looks objective. From inside the room, it was the rhetoric.

The high-output employee whose manager can't tell their story in a calibration room is going to be the most under-rated employee in the company. Fix the calibration process, or accept that you are managing them out of the company you are paying to retain them.

— Debra Solomon

Neurodivergent employees commonly present a story gap to their managers. They often don't self-promote in conventional ways. Their wins propagate through different informal channels than the ones their managers monitor. Their contributions are harder to summarize in the social shorthand fluent management storytelling depends on. None of this is a character flaw. It is a mismatch between how the employee surfaces their work and how the calibration process expects that work to be surfaced.

The early signal is in the calibration record. Compare proposed ratings to final ratings, broken out by employee. If neurodivergent employees (where known to HR) show a pattern of being adjusted downward in calibration more often than peers, even when the proposed rating was data-supported, the Story Gap is operating. If proposed ratings are consistently revised down for managers who relied heavily on metrics in defense and revised up for managers who told vivid stories, the calibration is rewarding narrative fluency, not outcomes.

What changes the Story Gap is structured calibration. Defenses must include both data and narrative. Calibrators are trained to weight evidence over rhetoric. The calibration record captures which defense moved which rating, so the rhetorical-vs-evidence split is auditable across cycles. Some companies experiment with anonymized calibration rounds, where the employee identity is masked until the rating is locked. Done well, the technique surfaces rhetorical bias quickly. It is a strong diagnostic tool, even if the company eventually returns to identified calibration for cultural reasons.

Pattern 3 — The Forced Distribution Trap

The Style Capture is in the rubric. The Story Gap is in the calibration. The Forced Distribution Trap is in the math itself.

Companies that use forced distribution — stack ranking, mandatory percentile bands, "10% must be below expectations" rules — produce outcomes that the rubric does not justify. Ten percent of a forty-person team has to land below expectations. The actual performance distribution may not have 10% below. The math forces it.

When forced distribution interacts with the previous two patterns, the math has a predictable victim. The neurodivergent employee scoring slightly lower on style dimensions, slightly under-rated in calibration because of a less vivid narrative, becomes the easy candidate to push into the lower band — because the manager defending their rating in the forced-distribution conversation has the fewest rhetorical levers, and the rubric has already produced a number that gives cover.

Forced distribution converts every other bias in the review system into a binding outcome. A pattern of mild Style Capture under-rating, combined with mild Story Gap calibration drift, gets crystallized into a "below expectations" rating that wouldn't exist if the distribution weren't being forced. The employee then has a permanent record of underperformance that the rubric, viewed on its own, cannot defend.

The early signal is the demographic composition of the bottom band across cycles. If neurodivergent employees — where their identity is known to HR — are over-represented in the lower distribution band relative to their share of the workforce, the forced distribution is doing the work the rubric and calibration are setting it up to do. The same audit can be run on every other protected category the company is willing to track. The pattern repeats.

What changes the Forced Distribution Trap is dismantling forced distribution. The general management literature on stack ranking has been moving against it for years; the neurodivergent-impact case adds one more piece of evidence to a long file. For companies unwilling to remove forced distribution entirely, the floor mitigation is rigorous calibration that can override the distribution for documented reasons, plus a recurring audit of who lands in the bottom band. If the same demographic patterns appear cycle after cycle, the math is not neutral, and the company's defense of forced distribution as a meritocratic discipline does not survive scrutiny.

What changes the math

These three patterns don't share a manager-level fix. They share a system-level discipline.

Rubric redesign. Style dimensions either get cut or get behavior-anchored. The rubric measures observable behavior tied to outcomes, not professional-sounding proxies for social fluency. "Executive presence" without a behavioral definition is removed. "Effective communication" is redefined with reference to results, not to style.

Calibration discipline. Calibration defenses must include both data and narrative. Calibrators are trained to weight evidence over rhetoric. The calibration record captures which defense moved which rating, so the evidence-vs-rhetoric split is auditable. Anonymized calibration rounds are an experimental tool worth piloting in any company that has not yet audited its calibration rhetoric.

Distribution reform. Forced distribution is removed, or — at minimum — constrained by rigorous calibration with documented overrides. The audit of the bottom band runs every cycle, broken out by every protected category the company is willing to track.

The pattern this series has now described three times — that the fix is institutional, not individual — holds again here. Manager language work is necessary. It is not sufficient. The rubric, the calibration, and the distribution have to be doing the work the language fix is asking them to do. When they aren't, the system reaches a kinder version of the same outcome, and the neurodivergent retention problem the company is trying to solve remains.

Frequently asked questions

Why are our performance reviews disadvantaging our neurodivergent employees?

Three structural patterns commonly produce this outcome. The Style Capture (rubrics that weight neurotypical-default style dimensions equally with outcome dimensions), the Story Gap (calibration sessions that reward narrative fluency over evidence), and the Forced Distribution Trap (forced ranking that crystallizes mild biases into binding outcomes). Each is a system-design choice, not a manager-bias problem. Each can be unchosen. The fixes are at the rubric, the calibration discipline, and the distribution model.

How can we tell if our review system has a style-capture problem?

Look for a math pattern. Compare style-dimension scores to output-dimension scores for neurodivergent employees, where their identity is known. If style dimensions score consistently lower than output dimensions would predict — and the pattern holds across multiple employees and multiple managers — the rubric is capturing style as a proxy for neurotypical-default social fluency. If "executive presence" or "polish" has no behavioral definition on your rubric, you almost certainly have a Style Capture problem. The rubric is rating social legibility, not performance.

What's the link between calibration and neurodivergent retention?

Calibration is a rhetorical exercise as well as a numerical one. The manager who can tell a vivid story about their employee's work walks out of calibration with a higher rating for that employee. The manager defending a high-output neurodivergent employee with metrics alone — without the social storytelling the calibration process implicitly rewards — usually doesn't. Over multiple cycles, this drift produces a pattern of under-rating that contributes directly to the slow-motion managed-out exit. Structured calibration with auditable evidence-vs-rhetoric records is the fix.

Should we remove "executive presence" from our review rubric?

If it has no behavioral definition tied to job outcomes, yes. "Executive presence" as a review dimension typically measures whether the employee performs the texture of professionalism the rater finds familiar — which varies by culture, by generation, and substantially by neurotype. Removing it does not weaken the rubric. It strengthens the rubric by forcing the underlying constructs (communication effectiveness, decision-making, influence) to be defined in terms of observable behavior and outcomes. If executive presence is genuinely a job requirement at a senior level, define what it consists of in terms of what the employee does.

Does forced distribution disadvantage neurodivergent employees?

In most companies, yes — because forced distribution converts every other bias in the review system into a binding outcome. Mild Style Capture under-rating combined with mild Story Gap calibration drift, multiplied by a mandatory below-expectations percentage, produces a predictable result. The neurodivergent employee whose math was slightly off becomes the easy candidate to push into the lower band. Audit the demographic composition of your bottom band across cycles. If neurodivergent employees (where known) are over-represented, the math is not neutral.

What's the difference between this and the employee-side performance review guide?

The employee-side guide describes the experience of being on the receiving end of coded feedback and slow-motion managed-out — what the patterns look like, feel like, and what an employee can do about them. This pillar describes the system mechanics that produce those experiences — what's in the rubric, what's happening in calibration, what the math is doing. Both are accurate accounts of the same dynamic. The employee-side view is necessary for the people inside it. The system-side view is necessary for the HR leaders who can change the system.

About Spectrum Roadmap

Spectrum Roadmap exists to help two audiences who are usually treated as separate. We help neurodivergent individuals navigate corporate America — the disclosure decisions, the unwritten rules, the performance review patterns — through coaching, community, and a body of work built across three decades. We help the companies that employ them close the policy gap so everyone thrives — the rubrics, the manager language, the accommodation frameworks that decide whether the workplace is one neurodivergent talent stays in.

Both sides have to move for the conversation to change. We work with both.

Where to go from here

If you're recognizing one of these patterns in your own review history — or in the reviews you write as a manager — the conversation about what to do about it is happening with the people on both sides of it.

Join the Spectrum Roadmap Community

A private space with twice-monthly live calls with Debra Solomon. It's where neurodivergent professionals working through their own performance review cycles and the HR leaders and managers learning to recognize what their systems are producing are in the same conversation. From $95/month, billed annually. Cancel anytime.

Join the Community →

The newsletter

Get weekly insights on inclusive hiring

Practical guidance for navigating neurodiversity at work, delivered to your inbox by Debra Solomon. No fluff. Unsubscribe anytime.