How to quality-score your outbound outreach (the rubric that prevents drift)
How do you quality-score outbound outreach?
Short answer: with a 5–7 dimension rubric, applied weekly to a random sample of 10–20 outreach touches per rep. Quality drifts invisibly when nobody is watching. A consistent scoring discipline catches drift before it shows up in reply rate metrics.
TL;DR — the rubric dimensions
| Dimension | Scoring 1–5 |
|---|---|
| First-line specificity | Personalised vs generic |
| Value clarity | Specific outcome vs vague |
| Call-to-action quality | Specific vs vague |
| Email length | Under 90 vs over |
| Tone fit for persona | Calibrated vs off |
| Threading discipline | Reply-to-thread vs new email |
| Personalisation depth | Genuine vs templated-revealed |
The weekly sample
| Step | Detail |
|---|---|
| Sample size | 10–20 emails per rep per week |
| Sample method | Random selection from sequencer logs |
| Scorer | Sales manager or designated peer |
| Scoring time | 15 min per rep |
| Output | Documented scorecard + 1–2 coaching topics |
Sampling works because most sequencer tools log the actual sent emails — making post-hoc review trivial.
What to look for
Signs of good quality:
- First lines reference verifiable specific signals.
- Email body is under 90 words.
- CTA is time-bounded and specific.
- Reply-to-thread on follow-ups.
- Tone matches persona (senior / technical / junior).
Signs of degradation:
- Generic first lines ("I noticed your company is growing").
- Body length creeping up.
- Repetitive structure across emails to different personas.
- New subject lines on follow-ups (vs threading).
- Same email body word-for-word across personas.
Coaching from the scores
Each weekly scoring produces 1–2 coaching topics per rep. Examples:
- "Three of five sampled emails this week had generic first lines — let's role-play personalisation."
- "Follow-up emails 2 and 3 had new subject lines instead of threading — confirm Apollo threading is on."
- "Body length averaging 130 words — let's edit them down together."
The scoring is not punishment; it is the data input to coaching.
How quality scoring prevents drift
Outbound quality degrades slowly. A team that hit 5% reply rate in Q1 drifts to 4% by Q3 to 3% by Q1 next year — and nobody knows why. The cumulative effect of small quality drops compounds.
Weekly scoring keeps the bar visible. Reps know their work is reviewed; quality stays consistent.
For UAE & KSA teams
- Cultural calibration in the scoring rubric. "Tone fit for persona" should reflect MENA-appropriate formality.
- Arabic-language emails require Arabic-fluent scoring; ensure capability on the team.
- Regional case study reference rate can be a scored dimension — do reps mention local proof points?
What MAVEN does about it
Quality scoring is part of the Sales Process Program operational deliverables. We design the rubric, train the manager on scoring, and run the first 4–6 weeks alongside.
Frequently asked
How long does scoring take?
15–20 min per rep per week.
Should reps see their scores?
Yes — transparency drives improvement.
Can AI score outreach?
For some dimensions (length, threading), yes. For nuance (tone, personalisation depth), human judgement still wins.
How often should the rubric change?
Quarterly review; refresh as needed.
What's the worst quality-scoring mistake?
Scoring without coaching. Scores without conversation produce resentment, not improvement.
Post 92 of our outbound + sales OS series.
Related reading
Level Up Your Sales Career
Join The Sales Development Society — weekly live coaching, proven templates, and a community of ambitious B2B salespeople going from entry-level to enterprise.
Join the CommunityReady to install your sales engine?
Book a 30-minute Virtual Coffee. No deck, no pitch — just an honest read of where you are.
Book a Virtual Coffee